On March 6,Watch Aunt's Temptation (2018) Alibaba released and open-sourced its new reasoning model, QwQ-32B, featuring 32 billion parameters. Despite being significantly smaller than DeepSeek-R1, which has 6,710 billion parameters (with 3.7 billion active), QwQ-32B matches its performance in various benchmarks. QwQ-32B excelled in math and coding tests, outperforming OpenAI’s o1-mini and distilled versions of DeepSeek-R1. It also scored higher than DeepSeek-R1 in some evaluations like LiveBench and IFEval. The model leverages reinforcement learning and integrates agent capabilities for critical thinking and adaptive reasoning. Notably, QwQ-32B requires much less computational power, making it deployable on consumer-grade hardware. This release aligns with Alibaba’s AI strategy, which includes significant investments in cloud and AI infrastructure. Following the release, Alibaba’s US stock rose 8.61% to $141.03, with Hong Kong shares up over 7%.[Jiemian, in Chinese]
Related Articles
2025-06-26 00:50
1804 views
Reading in Bed with James Franco by The Paris Review
Reading in Bed with James FrancoBy The Paris ReviewOctober 25, 2011Video & MultimediaThe other n
Read More
2025-06-26 00:45
820 views
Work Frustrations; Social Climbing by Lorin Stein
Work Frustrations; Social ClimbingBy Lorin SteinNovember 11, 2011Ask The Paris ReviewOkay, I have a
Read More