Alibaba has claimed that its open-source large model Qwen2-Math delivers “state-of-the-art” math competency,Too Naughty to Say No (1985) - Remastered saying it handles mathematical problems in algebra and geometry with 84% accuracy, and has outperformed OpenAI’s GPT 4o and Google’s Gemini 1.5 Pro. The math-centered model, which was trained on Alibaba’s foundation Qwen2 model by feeding it a large-scale high-quality math corpus, is currently only supported in English, with a bilingual version coming soon, according to the tech giant. The Alibaba team has challenged Qwen2-Math with multiple benchmarks including China’s latest GaoKao (college entrance exam) math problems and questions from US math competition AIME, which all proved the model’s proficiency in dealing with advanced mathematical problems. [QbitAI, in Chinese]
Related Articles
2025-06-26 18:38
315 views
NYT Strands hints, answers for April 14
If you're reading this, you're looking for a little help playing Strands, the New York Times' elevat
Read More
2025-06-26 17:01
1024 views
10 best tweets of the week, including this photo of my adorable new puppy
The work week is gone and it's the weekend. So at least there's that.I can't promise you'll have a g
Read More
2025-06-26 16:41
1894 views
Chaucer Lived and Wrote in Squalor, a New Book Says
Chaucer’s Bachelor Pad, and Other NewsBy Dan PiepenbringJanuary 19, 2015On the ShelfFrom Portrait an
Read More