DeepSeek’s new AI model appears to be one of the best ‘open’ challengers yet
SMRTR summary
DeepSeek, a Chinese AI firm, has released DeepSeek V3, a powerful open AI model. The model outperforms both open and closed AI models in coding and text-based tasks, according to internal benchmarks. DeepSeek V3 boasts 671 billion parameters and was trained on 14.8 trillion tokens. The company claims it cost only $5.5 million to develop, using Nvidia H800 GPUs. However, the model avoids politically sensitive topics due to Chinese regulations.
SMRTR provides this summary for quick context. The original article belongs to TechCrunch.
Read the original article