Karakalpak ASR
Collection
The collection of the Fine tuned Karakalpak models β’ 6 items β’ Updated β’ 1
This model is a fine-tuned version of openai/whisper-medium for Automatic Speech Recognition (ASR) in the Karakalpak language.
Quyashbek Allanazarov
Evaluation was performed on a held-out test set.
| Metric | Score |
|---|---|
| WER (Word Error Rate) | 17.63% |
| CER (Character Error Rate) | 3.82% |
import torch
from transformers import WhisperForConditionalGeneration, WhisperProcessor
model_id = "your-username/whisper-medium-karakalpak"
processor = WhisperProcessor.from_pretrained(model_id)
model = WhisperForConditionalGeneration.from_pretrained(model_id)
# inputs = processor(audio, sampling_rate=16000, return_tensors="pt")
# generated_ids = model.generate(inputs.input_features)
# transcription = processor.batch_decode(generated_ids, skip_special_tokens=True)[0]
Base model
openai/whisper-medium