LLM Inference Optimization

Researchers automated LLM reasoning strategy design and cut token usage by 69.5%

Researchers from Meta and Google built AutoTTS to automatically discover optimal LLM reasoning strategies, cutting token ...

VentureBeat

New LLM optimization technique slashes memory costs up to 75%

Researchers at the Tokyo-based startup Sakana AI have developed a new technique that enables language models to use memory more efficiently, helping enterprises cut the costs of building applications ...

Crypto Briefing

MIT’s MeMo framework boosts LLM performance by 26% without retraining

MIT's MeMo framework trains a compact memory model that boosts LLM performance by up to 26.73% without retraining, with major implications for crypto AI agents.

Search Engine Land

LLM optimization in 2026: Tracking, visibility, and what’s next for AI discovery

Marketing, technology, and business leaders today are asking an important question: how do you optimize for large language models (LLMs) like ChatGPT, Gemini, and Claude? LLM optimization is taking ...

Semiconductor Engineering

HW-SW Co-Designed System With 3 Core Optimization Pathways For Long-Context Agentic LLM Inference (Cambridge, ICL)

A new technical paper titled “Combating the Memory Walls: Optimization Pathways for Long-Context Agentic LLM Inference” was published by researchers at University of Cambridge, Imperial College London ...

Business Wire

MangoBoost Launches Mango LLMBoost™: AI Inference Optimization Software with Up to 12.6x Relative Performance Improvement and 92% Cost Savings

BELLEVUE, Wash.--(BUSINESS WIRE)--MangoBoost, a provider of cutting-edge system solutions designed to maximize AI data center efficiency, is announcing the launch of Mango LLMBoost™, system ...

Forbes

Hide inaccessible results

Researchers automated LLM reasoning strategy design and cut token usage by 69.5%

New LLM optimization technique slashes memory costs up to 75%

MIT’s MeMo framework boosts LLM performance by 26% without retraining

LLM optimization in 2026: Tracking, visibility, and what’s next for AI discovery

HW-SW Co-Designed System With 3 Core Optimization Pathways For Long-Context Agentic LLM Inference (Cambridge, ICL)

MangoBoost Launches Mango LLMBoost™: AI Inference Optimization Software with Up to 12.6x Relative Performance Improvement and 92% Cost Savings

The New Frontier Of LLM Inference: Where The Next Tenfold Gains Will Come From

Defeating Nondeterminism in LLM Inference by Thinking Machines

ChatGPT Can ‘Infer’ Personal Details From Anonymous Text