Tag

Inference

All articles tagged with #inference

Nvidia-OpenAI $100B plan fizzles into non-binding talks
technology24 days ago

Nvidia-OpenAI $100B plan fizzles into non-binding talks

A September 2025 letter of intent for Nvidia to invest up to $100 billion in OpenAI’s AI infrastructure has not materialized; Nvidia’s Jensen Huang says the figure was never a commitment, and Reuters reports OpenAI has been seeking alternatives and citing Nvidia chip speed concerns for inference. OpenAI has since struck deals with Cerebras, Groq, AMD, and Broadcom to diversify compute, while Nvidia emphasizes a large future investment but not at that scale. The news triggered a stock dip for Nvidia and highlighted questions about timing and strategic fit.

OpenAI weighs chip alternatives after Nvidia inference gaps
technology25 days ago

OpenAI weighs chip alternatives after Nvidia inference gaps

OpenAI is reportedly seeking alternatives to Nvidia GPUs due to dissatisfaction with inference performance, citing eight sources. The move follows reports that Nvidia’s plan to invest up to $100 billion in OpenAI has stalled. OpenAI has previously struck deals with AMD and Broadcom to develop custom AI accelerators, signaling a push to diversify hardware sources even as Nvidia remains a major partner.

Nvidia CEO Jensen Huang Envisions Unprecedented AI and Computing Growth
technology1 year ago

Nvidia CEO Jensen Huang Envisions Unprecedented AI and Computing Growth

Nvidia's CEO Jensen Huang addressed investor concerns about the company's future growth amid new AI model improvement methods like "test-time scaling," which enhances AI inference by adding more compute power. Despite competition from startups developing fast AI inference chips, Huang emphasized Nvidia's strong position in the market, noting that while most workloads currently focus on pretraining, the future will see increased AI inference. He reassured investors of Nvidia's scale and reliability, aligning with industry leaders like Microsoft's Satya Nadella on the significance of these developments.

"Intel CEO Aims to Dethrone NVIDIA's CUDA Dominance, Open to Rival Chip Manufacturing"
technology2 years ago

"Intel CEO Aims to Dethrone NVIDIA's CUDA Dominance, Open to Rival Chip Manufacturing"

Intel's CEO, Pat Gelsinger, stated that the entire industry is motivated to eliminate NVIDIA's CUDA dominance in the AI market. Intel believes that the future of AI lies in inference rather than training models and aims to prioritize inference developments. Gelsinger sees NVIDIA's success as a temporary "bubble" and believes that the industry will adopt new training methods to bring a broader set of technologies. Intel praised its OpenVINO model and aims to transition towards next-gen markets. However, Intel needs to do more work to challenge CUDA's dominance, and for now, NVIDIA remains the leader in the AI segment.

"Nvidia's New AI Chip Promises Lower Costs and Reinvents Computing"
technology2 years ago

"Nvidia's New AI Chip Promises Lower Costs and Reinvents Computing"

Nvidia has unveiled its new AI chip, the GH200, designed for running artificial intelligence models. The chip features a powerful GPU paired with 141GB of cutting-edge memory and a 72-core ARM central processor. Nvidia aims to address the increasing demand for GPU capacity by offering a chip that allows larger AI models to fit on a single system, reducing the need for multiple GPUs. The company expects the new chip to significantly lower the costs of running large language models for inference, making it more accessible for various applications. Nvidia's announcement comes as it faces competition from rivals such as AMD, Google, and Amazon in the AI hardware space.