TLDR: Atul Deo, head of Amazon Bedrock and generative AI for AWS, is spearheading a major initiative to make artificial intelligence both more affordable and more intelligent. AWS is focusing on developing cheaper AI solutions and a new generation of ‘agentic’ software that can perform complex, multi-step tasks. This strategy involves leveraging custom silicon like Trainium 2, expanding the Bedrock platform with diverse models, and enabling autonomous AI agents to drive significant efficiency gains for businesses.
Amazon Web Services (AWS) is making a significant strategic bet on the future of artificial intelligence, focusing on dual goals: making AI software more affordable and simultaneously enhancing its intelligence. This ambitious direction is being led by Atul Deo, who heads Amazon Bedrock and generative AI for AWS. Deo aims to demonstrate substantial progress in these areas within the next six months, ahead of the company’s annual re:Invent conference in December.
Deo emphasizes the rapid pace of AI development, stating, ‘The AI space is moving faster than anything I’ve seen. Models get better every few weeks — but customers won’t deploy them unless the economics pencil out.’ This highlights a critical tension in the industry: balancing technological advancements with the practical costs of deployment. For AWS, this means not only offering the most accurate models but also providing the underlying infrastructure and tools to prevent AI expenses from spiraling out of control.
A key shift for corporate Chief Information Officers (CIOs) is moving beyond simple chatbots to ‘agentic’ software capable of executing complex, multi-step tasks that justify their cost. Amazon Bedrock, a two-year-old service that hosts both third-party and Amazon-built AI models, is central to this strategy and is one of the most closely watched products within AWS’s $100-billion-a-year unit.
Since January, Bedrock has significantly expanded its model offerings, including Anthropic’s Claude Sonnet 4 and Opus 4, Meta’s open-source Llama 4, Chinese upstart DeepSeek, and three versions of Amazon’s own Nova family. Among these, Nova Premier is touted by Deo for its ‘state-of-the-art accuracy at a discount.’ A crucial feature of Bedrock is its single API, which allows AWS customers to seamlessly swap between these diverse models.
AWS’s commitment to cost efficiency is further underscored by its investment in custom silicon. Bedrock’s newest Nova models, for instance, were trained entirely on Trainium 2 hardware, reducing Amazon’s reliance on Nvidia’s often scarce GPUs. Deo asserts, ‘Custom silicon is how we bend the curve. It’s the reason we can drop price while pushing capability.’ While rivals like Microsoft and Google Cloud also develop their own AI chips (Maia and TPUs, respectively), AWS continues to lead in data-center investments.
Deo’s team positions Bedrock as the middle layer of a three-tier AI strategy:
Infrastructure: Comprising custom chips like Trainium and Graviton, alongside Amazon SageMaker for customers who wish to build or fine-tune their own models.
Bedrock Platform: Offering off-the-shelf and third-party models, complemented by tooling for prompt caching and multi-agent collaboration.
Applications: Fully managed software such as Q Developer and Q Business, designed to enable coders and business analysts to interact with AI using plain English.
Also Read:
- Cloud Giants Intensify AI Agent Race with New Marketplaces and Strategic Partnerships
- Amazon S3 Undergoes Major AI Transformation to Power Next-Generation Data Infrastructure
Cost discipline is also seen as a prerequisite for the next evolution of AI: autonomous agents that can perform tasks spanning minutes, hours, or even days. Early applications are already demonstrating their potential. A mortgage startup is utilizing Bedrock agents to streamline document collection, error scanning, and underwriting processes, reducing timelines from weeks to days. Similarly, real-estate firms are leveraging these bots to shorten property-sale timelines from three months to a mere fortnight by delegating diligence chores, showcasing the transformative power of these new AI agents.


