Amazon introduces new AI model, Nova Sonic, for voice applications

Amazon introduces new AI model, Nova Sonic, for voice applications

Resources
Webp wlodwed39hb3vawptutmk0424u6b

ORGANIZATIONS IN THIS STORY

Jeffrey Preston Bezos Executive Chairman of Amazon | Amazon

Today, Amazon introduced a new AI model, Amazon Nova Sonic, aiming to enhance the experience of building voice applications and agents. The model, accessible through Amazon Bedrock's bi-directional streaming API, aims to streamline the development of voice-enabled applications for various industries such as travel, education, and healthcare.

Rohit Prasad, SVP of Amazon Artificial General Intelligence, stated, “With Amazon Nova Sonic, we are releasing a new foundation model in Amazon Bedrock that makes it simpler for developers to build voice-powered applications that can complete tasks for customers with higher accuracy, while being more natural, and engaging.”

Traditional approaches to voice applications typically require multiple models for tasks like speech recognition and text-to-speech conversion. Nova Sonic integrates these processes into a single model, enabling natural dialogue that adapts to spoken input and acoustic context.

The model has undergone testing on standard benchmarks, indicating high accuracy for real-time voice conversations. It reportedly surpasses OpenAI and Google models, achieving notable win rates based on the Common Eval data set.

In terms of speech recognition, Nova Sonic achieves a lower word error rate in multilingual settings and handles noisy conditions effectively.

Furthermore, Nova Sonic offers expressive voices and can generate speech in American and British English accents. A spokesperson from ASAPP, Nirmal Mukhi, praised its performance, noting, "We’ve been particularly impressed by Amazon Nova Sonic’s highly accurate speech understanding capabilities which allow for more natural voice interactions and precise dialog handling over telephony."

Education First's Tim Hesse highlighted its benefits for language learning, citing the model's accuracy in understanding non-native accents and its interactive nature as key advantages.

Stats Perform has tested the model for sports data applications and reported satisfaction with its low latency and prompt responses, according to Mike Perez, COO of Stats Perform. The capability to generate responses in different formats and languages was a noted benefit.

Amazon emphasizes the responsible development of Nova models, which come with built-in safety measures and AWS AI Service Cards outlining usage guidelines.

Further information on Amazon Nova models is available on their official website.

ORGANIZATIONS IN THIS STORY

LETTER TO THE EDITOR

Have a concern or an opinion about this story? Click below to share your thoughts.
Send a message

Submit Your Story

Know of a story that needs to be covered? Pitch your story to The Flexible Work News.
Submit Your Story

MORE NEWS