SMRTR AI• Feb 16, 2025• Daily.dev

Run LLMs on macOS using llm-mlx and Apple’s MLX framework

SMRTR summary

Apple's MLX framework enables fast local language models on Macs. The new llm-mlx plugin for Python allows easy access to over 1,000 MLX-compatible models, with the 1.8GB Llama 3.2 3B generating 152 tokens per second and larger models like the 40GB Llama 3.3 70B running at 8.8 tokens per second.

SMRTR provides this summary for quick context. The original article belongs to Daily.dev.

Read the original article

Run LLMs on macOS using llm-mlx and Apple’s MLX framework

Get the next batch of curated summaries in your inbox.