We Used an AudioLLM's Speaker Tags to Guide Diarization. Here's What We Learned.
Mar 18 · 13 min read · Building LLM-guided speaker diarization for production ASR — the architecture, the algorithm, and the failures that taught us the most — by Yingxu He Sirui Lewis Won You're building a speech transcrip
Join discussion