We Used an AudioLLM's Speaker Tags to Guide Diarization. Here's What We Learned.
Building LLM-guided speaker diarization for production ASR — the architecture, the algorithm, and the failures that taught us the most — by Yingxu He Sirui Lewis Won
You're building a speech transcrip
cliolabs.hashnode.dev13 min read