Hume AI logo
Hume AI
Create
Roadmap
Feedback
Feature Requests
121
Category
TTS
Voters
K
Kirk
Rakin Ishraq
F
Francisco Castillo
Egil Sandfeld
member badge
Richard Marmorstein
Powered by Canny
Add support for vocal bursts and non-verbal output
member badge
Richard Marmorstein
April 18, 2025
K
Kirk
I have a Story Builder application for parents to design customized stories for their children. The user can choose a "Read to Me" feature which uses TTS by HUME AI. One thing that happens to be very entertaining to children when reading a story to them is to incorporate onomatopoeia or some other types of vocal bursts. So the inclusion of this feature does drive engagement and retains audible interest.
·
2 days ago
·
Reply
member badge
Richard Marmorstein
User @kirkrock in Discord reports this would be extremely useful for their TTS application that reads children's stories.
·
2 days ago
·
Reply
Rakin Ishraq
Being able to laugh when the user makes a joke is, in my opinion, an essential for immersive "empathic" voices. As Francisco referenced, Dia has this capability and its certainly imperfect at times but it'd be great if this was considered for Hume.
·
May 11, 2025
·
Reply
F
Francisco Castillo
Yeah, what Dia (https://fal.ai/models/fal-ai/dia-tts) does with its non verbal sounds is awesome!
Generate non-verbal like (laughs), (coughs), (clears throat), (sighs), (gasps), (singing), (sings), (mumbles), (beep), (groans), (sniffs), (claps), (screams), (inhales), (exhales), (applause), (burps), (humming), (sneezes), (chuckle), (whistles)
Source: https://github.com/nari-labs/dia/blob/main/README.md
·
April 23, 2025
·
Reply
Powered by Canny