A real-time foundation model that brings human-like digital presence to customer conversations, virtual assistants, training, and interactive experiences

Today, we are publicly releasing Higgs Audio 3.0, a state-of-the-art Speech-to-Text (STT / ASR) foundation model. It supports 94 languages with sophisticated language detection, advanced sentiment and semantic understanding, and outperforms whisper-v3-large by a large margin on key languages.

Today, we are proud to launch Higgs Audio 2.5, the latest iteration of Boson AI’s audio model, designed to bring high-fidelity generation into production environments. Building on Higgs Audio 2, this release combines improved efficiency with the stability required for real-world deployment.

Announcing Version 2 of Higgs Audio Generation, our latest advancement in audio generation technology with enhanced multi-speaker and dialog capabilities. Now open source.
