Tag

Inference

All articles tagged with #inference

technology24 days ago•5 min saved

Nvidia-OpenAI $100B plan fizzles into non-binding talks

A September 2025 letter of intent for Nvidia to invest up to $100 billion in OpenAI’s AI infrastructure has not materialized; Nvidia’s Jensen Huang says the figure was never a commitment, and Reuters reports OpenAI has been seeking alternatives and citing Nvidia chip speed concerns for inference. OpenAI has since struck deals with Cerebras, Groq, AMD, and Broadcom to diversify compute, while Nvidia emphasizes a large future investment but not at that scale. The news triggered a stock dip for Nvidia and highlighted questions about timing and strategic fit.

via Ars Technica|

#ai-chips #inference #investment

technology25 days ago•13 min saved

OpenAI weighs chip alternatives after Nvidia inference gaps

OpenAI is reportedly seeking alternatives to Nvidia GPUs due to dissatisfaction with inference performance, citing eight sources. The move follows reports that Nvidia’s plan to invest up to $100 billion in OpenAI has stalled. OpenAI has previously struck deals with AMD and Broadcom to develop custom AI accelerators, signaling a push to diversify hardware sources even as Nvidia remains a major partner.

via Sherwood News|

#amd #broadcom #inference

technology1 month ago•3 min saved

NVIDIA and VAST Data Advance AI Storage and Inference Technologies

NVIDIA has announced the BlueField-4 data processor powering a new AI-native storage platform designed to enhance long-term memory and context sharing for large-scale AI inference, boosting performance and efficiency for multi-agent AI systems, with availability expected in late 2026.

via NVIDIA Newsroom|

#ai-native-storage #bluefield-4 #inference

technology1 year ago•2 min saved

Nvidia CEO Jensen Huang Envisions Unprecedented AI and Computing Growth

Nvidia's CEO Jensen Huang addressed investor concerns about the company's future growth amid new AI model improvement methods like "test-time scaling," which enhances AI inference by adding more compute power. Despite competition from startups developing fast AI inference chips, Huang emphasized Nvidia's strong position in the market, noting that while most workloads currently focus on pretraining, the future will see increased AI inference. He reassured investors of Nvidia's scale and reliability, aligning with industry leaders like Microsoft's Satya Nadella on the significance of these developments.

via TechCrunch|

#ai #chips #inference

technology2 years ago•4 min saved

"Intel CEO Aims to Dethrone NVIDIA's CUDA Dominance, Open to Rival Chip Manufacturing"

Intel's CEO, Pat Gelsinger, stated that the entire industry is motivated to eliminate NVIDIA's CUDA dominance in the AI market. Intel believes that the future of AI lies in inference rather than training models and aims to prioritize inference developments. Gelsinger sees NVIDIA's success as a temporary "bubble" and believes that the industry will adopt new training methods to bring a broader set of technologies. Intel praised its OpenVINO model and aims to transition towards next-gen markets. However, Intel needs to do more work to challenge CUDA's dominance, and for now, NVIDIA remains the leader in the AI segment.

via Wccftech|

#ai #cuda #inference

technology2 years ago•2 min saved

"Nvidia's New AI Chip Promises Lower Costs and Reinvents Computing"

Nvidia has unveiled its new AI chip, the GH200, designed for running artificial intelligence models. The chip features a powerful GPU paired with 141GB of cutting-edge memory and a 72-core ARM central processor. Nvidia aims to address the increasing demand for GPU capacity by offering a chip that allows larger AI models to fit on a single system, reducing the need for multiple GPUs. The company expects the new chip to significantly lower the costs of running large language models for inference, making it more accessible for various applications. Nvidia's announcement comes as it faces competition from rivals such as AMD, Google, and Amazon in the AI hardware space.

via CNBC|

#ai-chips #ai-models #gh200