• Community
  • Model
  • asr-wav2vec2-large-xlsr-53-vietnamese

asr-wav2vec2-large-xlsr-53-vietnamese

Audio transcription model for converting Vietnamese audio to Vietnamese text

Notes

huggingface model id: not-tanh/wav2vec2-large-xlsr-53-vietnamese

Wav2Vec2-Large-XLSR-53-Vietnamese

Fine-tuned facebook/wav2vec2-large-xlsr-53 on Vietnamese using the Common Voice, Vivos dataset and FOSD dataset. When using this model, make sure that your speech input is sampled at 16kHz.

Evaluation

The model can be evaluated on the Vietnamese test data of Common Voice.

Test Result: 39.571823%

Training

The Common Voice train, validation, the VIVOS and FOSD datasets were used for training

  • ID
  • Name
    wav2vec2-large-xlsr-53-vietnamese
  • Model Type ID
    Audio To Text
  • Description
    Audio transcription model for converting Vietnamese audio to Vietnamese text
  • Last Updated
    Jun 28, 2022
  • Privacy
    PUBLIC
  • Use Case
  • Toolkit
  • License
  • Share
    • Badge
      asr-wav2vec2-large-xlsr-53-vietnamese