VibeVoice 1.5B: Four Distinct Voices, 90 Minutes and a Leap Forward for Text‑to‑Speech
Last week, while waiting for a tram in Lisbon, I pulled up an audio article on my phone. It was one of those long‑form pieces converted to speech by a synthetic voice. The words were accurate, but the delivery felt robotic-monotone, with awkward paus...