- Community
- Model
- asr-wav2vec2-large-xlsr-japanese
asr-wav2vec2-large-xlsr-japanese
Audio transcription model for converting Japanese audio to Japanese text
759b8ae2d3f8442c919d0ff5e42851e0
759b8ae2d3f8442c919d0ff5e42851e0
Notes
huggingface model id: vumichien/wav2vec2-large-xlsr-japanese
Wav2Vec2-Large-XLSR-53-Japanese
Fine-tuned facebook/wav2vec2-large-xlsr-53 on Japanese using the Common Voice and Japanese speech corpus of Saruwatari-lab, University of Tokyo JSUT. When using this model, make sure that your speech input is sampled at 16kHz.
Evaluation
The model can be evaluated on the Japanese test data of Common Voice.
Test Result
WER: 30.84%, CER: 17.85%
Training
The Common Voice train, validation datasets and Japanese speech corpus basic5000 datasets were used for training.
- ID
- Namewav2vec2-large-xlsr-japanese
- Model Type IDAudio To Text
- DescriptionAudio transcription model for converting Japanese audio to Japanese text
- Last UpdatedJun 28, 2022
- PrivacyPUBLIC
- Use Case
- Toolkit
- License
- Share
- Badge