Tag

Trainium

All articles tagged with #trainium

Maia 200 AI chip promises threefold FP4 power, edging out TPU and Trainium in inference
technology1 month ago

Maia 200 AI chip promises threefold FP4 power, edging out TPU and Trainium in inference

Microsoft unveiled Maia 200, an AI inference accelerator for Azure, claiming it delivers over 10 petaflops at FP4 and 5 PFLOPS at FP8, with 3x FP4 performance versus Amazon’s Trainium Gen3 and FP8 performance above Google’s TPU Gen7. Built on TSMC’s 3-nanometer process with about 100 billion transistors, Maia 200 is designed for data-center inference to speed Copilot and Azure OpenAI workloads, featuring a memory system to keep model weights local and is currently deployed in a US data center with broader Azure availability planned in the future.

Amazon Unveils AI Supercomputer to Challenge Nvidia's Dominance
technology1 year ago

Amazon Unveils AI Supercomputer to Challenge Nvidia's Dominance

Amazon is developing an AI supercomputer using its in-house Trainium chips and a new server for AI startup Anthropic, following a $4 billion investment in the company. This move by Amazon Web Services aims to position Trainium as a competitive alternative to Nvidia's GPUs. The "Ultracluster," designed by Amazon's chip lab, will feature hundreds of thousands of Trainium2 chips, significantly enhancing AI model training capabilities.

Amazon Invests $4 Billion in Anthropic to Challenge Nvidia and OpenAI
technology1 year ago

Amazon Invests $4 Billion in Anthropic to Challenge Nvidia and OpenAI

Amazon has announced a $4 billion investment in AI startup Anthropic, aiming to increase the use of its Trainium AI chips and challenge Nvidia's dominance in the AI chip market. This brings Amazon's total investment in Anthropic to $8 billion, with Anthropic agreeing to use AWS as its primary cloud and training partner. Despite the investment, Anthropic CEO Dario Amodei expressed a balanced approach, using chips from Nvidia, Google, and Amazon. The collaboration includes developing Amazon's AI-model platform, AWS Neuron, to compete with Nvidia's CUDA software stack.