Nari Labs launches Dia, an open text-to-speech generator

Community-Team · April 29, 2025, 2:17pm

Subscribe for free access to Data Points!

Nari Labs, a two-person startup, released Dia, a 1.6 billion parameter text-to-speech model that generates naturalistic dialogue directly from text prompts. The model supports advanced features like emotional tone, speaker tagging, and nonverbal audio cues such as laughs and coughs — capabilities that co-creator Toby Kim claims surpass competing offerings from ElevenLabs and Google’s NotebookLM. Side-by-side comparisons show Dia handling natural timing, nonverbal expressions, and emotional range quite effectively, with examples demonstrating how it properly interprets cues that other models simply read aloud or skip entirely. The model is available under an Apache 2.0 license, allowing commercial use while running on consumer-grade GPUs with about 10GB of VRAM. (GitHub)

Topic		Replies	Views
[The Batch] Stability AI released Stable Audio Open AI Discussions the-batch	1	114	June 15, 2024
OpenAI text-to-speech - free demo! AI Discussions ai-discussions , project	0	1258	February 19, 2024
Speech to text - Open models for transfer learning AI Discussions	1	73	May 18, 2023
Cohere generate text on multilingual like arabic AI Discussions	0	56	August 17, 2023
No longer accessible the Lab 2 - Fine-tune a generative AI model for dialogue summarization Generative AI with Large Language Models week-module-2	4	408	November 11, 2023

Nari Labs launches Dia, an open text-to-speech generator

Related topics