SMRTR AI• Dec 11, 2025• Daily.dev

Inception Labs: Making LLMs Faster and More Cost-Efficient

SMRTR summary

Inception Labs has developed Mercury, a diffusion-based language model that generates tokens in parallel rather than sequentially like traditional autoregressive models such as ChatGPT and Claude, making it 5-10 times faster and more cost-efficient at 25 cents per million input tokens, with the company focusing on speed-sensitive applications like coding auto-complete and voice agents.

SMRTR provides this summary for quick context. The original article belongs to Daily.dev.

Read the original article

Inception Labs: Making LLMs Faster and More Cost-Efficient

Get the next batch of curated summaries in your inbox.