• Community
  • Model
  • asr-wav2vec2-large-xlsr-japanese

asr-wav2vec2-large-xlsr-japanese

Audio transcription model for converting Japanese audio to Japanese text

Notes

huggingface model id: vumichien/wav2vec2-large-xlsr-japanese

Wav2Vec2-Large-XLSR-53-Japanese

Fine-tuned facebook/wav2vec2-large-xlsr-53 on Japanese using the Common Voice and Japanese speech corpus of Saruwatari-lab, University of Tokyo JSUT. When using this model, make sure that your speech input is sampled at 16kHz.

Evaluation

The model can be evaluated on the Japanese test data of Common Voice.

Test Result

WER: 30.84%, CER: 17.85%

Training

The Common Voice train, validation datasets and Japanese speech corpus basic5000 datasets were used for training.

  • ID
  • Name
    wav2vec2-large-xlsr-japanese
  • Model Type ID
    Audio To Text
  • Description
    Audio transcription model for converting Japanese audio to Japanese text
  • Last Updated
    Jun 28, 2022
  • Privacy
    PUBLIC
  • Use Case
  • Toolkit
  • License
  • Share
    • Badge
      asr-wav2vec2-large-xlsr-japanese