SMRTR AIDec 11, 2025Daily.dev

Inception Labs: Making LLMs Faster and More Cost-Efficient

SMRTR summary

Inception Labs has developed Mercury, a diffusion-based language model that generates tokens in parallel rather than sequentially like traditional autoregressive models such as ChatGPT and Claude, making it 5-10 times faster and more cost-efficient at 25 cents per million input tokens, with the company focusing on speed-sensitive applications like coding auto-complete and voice agents.

SMRTR provides this summary for quick context. The original article belongs to Daily.dev.

Read the original article
SMRTR AI

Get the next batch of curated summaries in your inbox.

This archive is built from SMRTR newsletter summaries. Subscribe for hand-picked stories without the extra noise.