Given their excellent record, it's strange that ElevenLabs would make a basic transcription mistake in the embedded introduction video[1].
At around 30 seconds in[2], an AI agent types and says, "Hi! Welcome to our design studio." to which a customer responds「すみません、日本語で話してもいいですか?」(Sorry, may I speak Japanese?).
The AI agent then says 「もちろんです。日本語に切り替えましょう。」(Of course. Let's switch to Japanese.) but types「ご心配なく。数量は柔軟に対応できます。」which means "No worries. We can be flexible with the quantity."
Yep that one's on me, we originally were doing an interpreter agent demo here, and then switched to the one in the video, but somehow this text from the interpreter agent was kept in the published video. something for us to learn from
rolph · 6h ago
that could have serious consequences during a contract negotiation.
wild_egg · 2h ago
What does SOTA tool use with voice agents look like these days? Any providers have MCP support?
It'd be incredible to get to a point where we can have natural conversations and the AI is running tools in the background and keeping tabs on things.
gorgoiler · 3h ago
The play button widget will read the article aloud. Alas, it called the product “conversational AI two dot zero comma” and the company “vertical bar elevenlabs”.
Their TTS is good but unfortunately the exemplar widget is not nearly up to the same quality.
giaccoangelo · 5h ago
a lot of awesome features in this ship, comment your feedback for me:)
flyingspaceship · 1h ago
for elevenlabs in general, pricing too high. if we put the price to punch ratio of kokoro at 100, elevenlabs is probably a 35
piterrro · 5h ago
who are you?
frankvdwaal · 3h ago
He's a "Growth Engineer" from ElevenLabs. I'm not sure what that entails, but then I'm not familiar with that area of tech, so maybe someone else can explain it.
ldenoue · 5h ago
How is the turn detection working? LLM prompting or a special AI audio plus text model?
At around 30 seconds in[2], an AI agent types and says, "Hi! Welcome to our design studio." to which a customer responds「すみません、日本語で話してもいいですか?」(Sorry, may I speak Japanese?).
The AI agent then says 「もちろんです。日本語に切り替えましょう。」(Of course. Let's switch to Japanese.) but types「ご心配なく。数量は柔軟に対応できます。」which means "No worries. We can be flexible with the quantity."
[1] https://www.youtube.com/watch?v=TlclS4wLWgY
[2] https://imgur.com/18U9uUU
It'd be incredible to get to a point where we can have natural conversations and the AI is running tools in the background and keeping tabs on things.
Their TTS is good but unfortunately the exemplar widget is not nearly up to the same quality.