Amazon introduces new AI model, Nova Sonic, for voice applications

Jeffrey Preston Bezos Executive Chairman of Amazon - Amazon
Jeffrey Preston Bezos Executive Chairman of Amazon - Amazon
0Comments

Today, Amazon introduced a new AI model, Amazon Nova Sonic, aiming to enhance the experience of building voice applications and agents. The model, accessible through Amazon Bedrock’s bi-directional streaming API, aims to streamline the development of voice-enabled applications for various industries such as travel, education, and healthcare.

Rohit Prasad, SVP of Amazon Artificial General Intelligence, stated, “With Amazon Nova Sonic, we are releasing a new foundation model in Amazon Bedrock that makes it simpler for developers to build voice-powered applications that can complete tasks for customers with higher accuracy, while being more natural, and engaging.”

Traditional approaches to voice applications typically require multiple models for tasks like speech recognition and text-to-speech conversion. Nova Sonic integrates these processes into a single model, enabling natural dialogue that adapts to spoken input and acoustic context.

The model has undergone testing on standard benchmarks, indicating high accuracy for real-time voice conversations. It reportedly surpasses OpenAI and Google models, achieving notable win rates based on the Common Eval data set.

In terms of speech recognition, Nova Sonic achieves a lower word error rate in multilingual settings and handles noisy conditions effectively.

Furthermore, Nova Sonic offers expressive voices and can generate speech in American and British English accents. A spokesperson from ASAPP, Nirmal Mukhi, praised its performance, noting, “We’ve been particularly impressed by Amazon Nova Sonic’s highly accurate speech understanding capabilities which allow for more natural voice interactions and precise dialog handling over telephony.”

Education First’s Tim Hesse highlighted its benefits for language learning, citing the model’s accuracy in understanding non-native accents and its interactive nature as key advantages.

Stats Perform has tested the model for sports data applications and reported satisfaction with its low latency and prompt responses, according to Mike Perez, COO of Stats Perform. The capability to generate responses in different formats and languages was a noted benefit.

Amazon emphasizes the responsible development of Nova models, which come with built-in safety measures and AWS AI Service Cards outlining usage guidelines.

Further information on Amazon Nova models is available on their official website.



Related

Steve Corcoran co-founder and CEO at LawnStarter

Customer experiences with LawnStarter vary widely depending on contractor assignment

LawnStarter, a digital platform that connects homeowners with local lawn care providers, has received mixed feedback from customers according to recent reviews.

Chelsea Miller, Director of Partnerships at RPM Living

RPM Living expands Airbnb partnership with operator-approved hosting across portfolio

RPM Living, a major multifamily property manager in the United States, has expanded its involvement with Airbnb by joining the Airbnb-friendly Apartment program.

Bryan Clayton, President at GreenPal

GreenPal provides instant transparent quotes for lawn care without requiring on-site estimates

Many homeowners face challenges when searching for reliable lawn care services, particularly when it comes to obtaining quotes.

Trending

The Weekly Newsletter

Sign-up for the Weekly Newsletter from Flexible Work News.