Jeffrey P. Bezos | Executive Chair & founder of Amazon.com | Amazon website
Amazon Web Services (AWS) has announced the launch of DeepSeek-R1 as a fully managed, serverless large language model in Amazon Bedrock. This makes AWS the first cloud service provider to offer this model as generally available. DeepSeek-R1 is part of a series of models developed by artificial intelligence startup DeepSeek and is designed for tasks requiring sophisticated reasoning capabilities.
DeepSeek has gained attention recently due to its cost-effective training techniques, reportedly making their models 90-95% more affordable than similar ones. The availability of DeepSeek-R1 in Amazon Bedrock allows customers to deploy it at an enterprise scale without worrying about technical setup or maintenance. The model also includes security features such as data encryption and access controls to ensure data privacy and compliance.
Customers can use DeepSeek-R1 for various applications, including problem-solving, coding, data analysis, and more. It is available in the Amazon Bedrock Marketplace for self-managed infrastructure and through Amazon Bedrock Custom Model Import for customized versions.
Vasi Philomin, VP of generative AI at AWS, stated: "We are excited to bring DeepSeek-R1, a cutting-edge model with frontier reasoning performance at significantly lower inference costs, to Amazon Bedrock. When paired with features like Amazon Bedrock Guardrails, customers can implement AI safety guardrails while benefiting from the built-in security and privacy that Amazon Bedrock provides."
AWS recommends integrating Amazon Bedrock Guardrails with the DeepSeek-R1 model to prevent harmful content production and protect user privacy. These tools help block offensive material and remove personal data while allowing customers to set rules based on company policies or industry regulations.
By offering a wide selection of fully managed models from leading AI companies, AWS aims to provide businesses with the necessary tools for building and scaling generative AI applications efficiently.