- Community
- Model
- asr-wav2vec2-large-xlsr-53-vietnamese
asr-wav2vec2-large-xlsr-53-vietnamese
Audio transcription model for converting Vietnamese audio to Vietnamese text
cd34475a58504dd8b94cd98af751211b
cd34475a58504dd8b94cd98af751211b
Notes
huggingface model id: not-tanh/wav2vec2-large-xlsr-53-vietnamese
Wav2Vec2-Large-XLSR-53-Vietnamese
Fine-tuned facebook/wav2vec2-large-xlsr-53 on Vietnamese using the Common Voice, Vivos dataset and FOSD dataset. When using this model, make sure that your speech input is sampled at 16kHz.
Evaluation
The model can be evaluated on the Vietnamese test data of Common Voice.
Test Result: 39.571823%
Training
The Common Voice train, validation, the VIVOS and FOSD datasets were used for training
- ID
- Namewav2vec2-large-xlsr-53-vietnamese
- Model Type IDAudio To Text
- DescriptionAudio transcription model for converting Vietnamese audio to Vietnamese text
- Last UpdatedJun 28, 2022
- PrivacyPUBLIC
- Use Case
- Toolkit
- License
- Share
- Badge
Concept | Date |
---|