• Community
  • Model
  • asr-wav2vec2-large-xlsr-jap-hiragana

asr-wav2vec2-large-xlsr-jap-hiragana

Audio transcription model for converting Japanese Hiragana audio to Japanese Hiragana text

Notes

huggingface model id: vumichien/wav2vec2-large-xlsr-japanese-hiragana

Wav2Vec2-Large-XLSR-53-Japanese

Fine-tuned facebook/wav2vec2-large-xlsr-53 on Japanese using the Common Voice and Japanese speech corpus of Saruwatari-lab, University of Tokyo JSUT. When using this model, make sure that your speech input is sampled at 16kHz.

Test Result

WER: 24.74%, CER: 10.99%

Training

The Common Voice train, validation datasets and Japanese speech corpus datasets were used for training.

  • ID
  • Name
    wav2vec2-large-xlsr-jap-hiragana
  • Model Type ID
    Audio To Text
  • Description
    Audio transcription model for converting Japanese Hiragana audio to Japanese Hiragana text
  • Last Updated
    Jun 28, 2022
  • Privacy
    PUBLIC
  • Toolkit
  • License
  • Share
    • Badge
      asr-wav2vec2-large-xlsr-jap-hiragana