DeepSeek announced the release and Street Stall (2015)open-source launch of its latest AI model, DeepSeek-V3, via a WeChat post on Tuesday. Users can now interact with the V3 model on DeepSeek’s official website. According to the post, DeepSeek-V3 boasts 671 billion parameters, with 37 billion activated, and was pre-trained on 14.8 trillion tokens. Compared to the V2.5 version, the new model’s generation speed has tripled, with a throughput of 60 tokens per second. Although it currently lacks multi-modal input and output support, DeepSeek-V3 excels in multilingual processing, particularly in algorithmic code and mathematics. In multiple benchmark tests, DeepSeek-V3 outperformed open-source models such as Qwen2.5-72B and Llama-3.1-405B, matching the performance of top proprietary models such as GPT-4o and Claude-3.5-Sonnet. [DeepSeek official WeChat account, in Chinese]
Related Articles
2025-06-27 08:45
1538 views
Meta says some AGI systems are too risky to release
Since AI came into our world, creators have put a lead foot down on the gas. However, according to a
Read More
2025-06-27 07:34
1358 views
'Baldur's Gate 3' characters' persistent thirstiness was due to a bug
Baldur's Gate 3has allowed millions of people to indulge in their fantasy of performing magic, talki
Read More
2025-06-27 06:39
1875 views
Another Evening Gone by Sadie Stein
Another Evening GoneBy Sadie SteinNovember 19, 2014Our Daily CorrespondentIf you never have, watch t
Read More