Tooling
Developer Fine-Tunes Cohere Transcribe for Speaker Diarization and Timestamps
A developer has extended Cohere's open-source speech-to-text model with speaker identification and timestamp support, achieving 0.097-second average accuracy across up to 32 speakers.
1 min read
Sourcer/localllama
A developer has successfully fine-tuned Cohere Transcribe, the leading open-source speech-to-text model, to add speaker diarization and timestamp support—features the original model lacked despite having tokenizer tokens reserved for the...
Sign in to read the full analysis
Free account. Full analysis on LLM unit economics, plus the weekly Cost-of-Inference column.
Method & sources
- Source type
- Primary publication (lab/vendor blog) — our analysis + implication
- Source link
- r/localllama
- Published
- UTC
- Byline
- By the gotcontext.ai team (editorial standards)
- Correction?
- corrections@gotcontext.ai