Inference Models - Search News

17m

MiroMind’s MiroThinker 1.5 delivers trillion-parameter performance from a 30B model — at 1/20th the cost

Joining the ranks of a growing number of smaller, powerful reasoning models is MiroThinker 1.5 from MiroMind, with just 30 ...

New ‘Test-Time Training’ method lets AI keep learning without exploding inference costs

By allowing models to actively update their weights during inference, Test-Time Training (TTT) creates a "compressed memory" ...

Guru3D

AMD Details Single-Node and Distributed Inference Performance on Instinct MI355X

AMD has published new technical details outlining how its AMD Instinct MI355X accelerator addresses the growing inference ...

Axios on MSN

Nvidia deal shows why inference is AI's next battleground

Chipmakers Nvidia and Groq entered into a non-exclusive tech licensing agreement last week aimed at speeding up and lowering ...

Business Wire

Vultr Launches Cloud Inference to Simplify Model Deployment and Automatically Scale AI Applications Globally

WEST PALM BEACH, Fla.--(BUSINESS WIRE)--Vultr, the world’s largest privately-held cloud computing platform, today announced the launch of Vultr Cloud Inference. This new serverless platform ...

Variant Bio Launches Inference, the World's First Agentic AI Genomic Drug Discovery Platform

Variant Bio, a genomics-driven AI drug discovery company, today announced the launch of Inference, the world's first agentic genomic drug dis ...

Forbes

The Inference Economy: How Sparse Computing And Model Optimization Are Reshaping Enterprise AI Deployment

The AI industry stands at an inflection point. While the previous era pursued larger models—GPT-3's 175 billion parameters to PaLM's 540 billion—focus has shifted toward efficiency and economic ...

Nvidia unveils inference AI models for accelerating self-driving car development

US semiconductor giant Nvidia has unveiled a new artificial intelligence platform technology designed to accelerate the ...

Opinion

The Daily Overview on MSNOpinion

Nvidia deal proves inference is AI's next war zone

The race to build bigger AI models is giving way to a more urgent contest over where and how those models actually run. Nvidia's multibillion dollar move on Groq has crystallized a shift that has been ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results