🔴 LIVE — Updated every 10 minutes
👤 -- reading now 🌡 Nairobi
Breaking
HomeTechnologyWhy Weibo’s tiny VibeThinker-3B has the…
Technology

Why Weibo’s tiny VibeThinker-3B has the AI world arguing over benchmarks again

VentureBeat Jun 17, 2026 2h ago ⏱ 1 min read 👁 5 views
Why Weibo’s tiny VibeThinker-3B has the AI world arguing over benchmarks again
Image via VentureBeat
📋 Article Summary
193 words
On Sunday, a team of nine researchers at Sina Weibo — the Chinese social media giant better known for its microblogging platform than for cutting-edge artificial intelligence — quietly posted a 14-page technical report to arXiv that sent shockwaves through… On Sunday, a team of nine researchers at Sina Weibo — the Chinese social media giant better known for its microblogging platform than for cutting-edge artificial intelligence — quietly posted a 14-page technical report to arXiv that sent shockwaves through the AI research community. Their claim: a language model with just 3 billion parameters can match or exceed the reasoning performance of flagship systems from Google DeepMind, OpenAI, Anthropic, and DeepSeek that are hundreds of times larger.The model, called VibeThinker-3B, scored 94.3 on AIME 2026 — the American Invitational Mathematics Examination, one of the most demanding standardized math competitions in the world. That figure places it alongside DeepSeek V3.2, a model with 671 billion parameters, and ahead of Gemini 3 Pro, Google's high-performance flagship reasoning system, which scored 91.7. With a test-time scaling technique the team calls Claim-Level Reliability Assessment, the score climbs to 97.1, edging past virtually every system in the public record.Within hours of publication, the paper had…
Continue Reading
Full story on VentureBeat
Read Full Story →
🔗 Clicking will take you to venturebeat.com
Share this story: WhatsApp X/Twitter Facebook
👁 People Also Read
KE
👨🏿‍🚀TechCabal Daily – EVs crash SA’s road fund
Technology

👨🏿‍🚀TechCabal Daily – EVs crash SA’s road fund

In today's edition: DRC unveils National digital ID system || South Africa's EV transition || Kenya's fuel prices just dropped…

Read
KE
Z.ai’s open-weights GLM-5.2 beats GPT-5.5 on multiple long-horizon coding benchmarks for 1/6th the cost
Technology

Z.ai’s open-weights GLM-5.2 beats GPT-5.5 on multiple long-horizon coding benchmarks for 1/6th the cost

Today, Chinese AI startup Z.ai (formerly Zhipu AI) announced the immediate release of GLM-5.2, a 753-billion parameter open-weights large language…

Read
SpaceX is public: Everything you need to know post-IPO
Technology

SpaceX is public: Everything you need to know post-IPO

TechCrunch has followed SpaceX's start, struggles, and successes from the early days. And we're here for what happens next too.…

Read
Meta’s new ‘AI Mode’ on Facebook pulls from public info across its platforms
Technology

Meta’s new ‘AI Mode’ on Facebook pulls from public info across its platforms

Meta announced Monday that it's rolling out a wave of new AI features on Facebook, the latest sign of the…

Read