Nvidia plans new chip to speed AI processing
Technology
Nvidia plans to launch a new processor for faster AI systems at its GTC conference, aimed at improving inference computing. The platform will include a chip from startup Groq, sources say.
(Reuters) - Nvidia (NVDA.O) plans to launch a new processor designed to help OpenAI and other customers build faster, more efficient AI systems, the Wall Street Journal reported on Friday, citing people familiar with the matter.
Nvidia is developing a new system for "inference" computing, a form of processing that allows AI models to respond to queries, the report said.
The new platform is set to be unveiled at Nvidia’s GTC developer conference in San Jose next month and will incorporate a chip designed by startup Groq, the report added citing people familiar.
Reuters could not immediately verify the report. Nvidia and OpenAI did not immediately respond to Reuters request for comment.
Reuters earlier this month reported OpenAI is unsatisfied with the speed at which Nvidia’s hardware can spit out answers to ChatGPT users for specific types of problems such as software development and AI communicating with other software.
It needs new hardware that would eventually provide about 10% of OpenAI’s inference computing needs in the future, one of the sources told Reuters.
The ChatGPT maker has discussed working with startups including Cerebras and Groq to provide chips for faster inference, two sources said. But Nvidia struck a $20-billion licensing deal with Groq that shut down OpenAI’s talks, one of the sources told Reuters.
In September, Nvidia said it intended to pour as much as $100 billion into OpenAI as part of a deal that gave the chipmaker a stake in the startup and gave OpenAI the cash it needed to buy the advanced chips.