Technology

Why Weibo’s tiny VibeThinker-3B has the AI world arguing over benchmarks again

VentureBeat Jun 17, 2026 2h ago ⏱ 1 min read 👁 5 views

Image via VentureBeat

📋 Article Summary

193 words

On Sunday, a team of nine researchers at Sina Weibo — the Chinese social media giant better known for its microblogging platform than for cutting-edge artificial intelligence — quietly posted a 14-page technical report to arXiv that sent shockwaves through… On Sunday, a team of nine researchers at Sina Weibo — the Chinese social media giant better known for its microblogging platform than for cutting-edge artificial intelligence — quietly posted a 14-page technical report to arXiv that sent shockwaves through the AI research community. Their claim: a language model with just 3 billion parameters can match or exceed the reasoning performance of flagship systems from Google DeepMind, OpenAI, Anthropic, and DeepSeek that are hundreds of times larger.The model, called VibeThinker-3B, scored 94.3 on AIME 2026 — the American Invitational Mathematics Examination, one of the most demanding standardized math competitions in the world. That figure places it alongside DeepSeek V3.2, a model with 671 billion parameters, and ahead of Gemini 3 Pro, Google's high-performance flagship reasoning system, which scored 91.7. With a test-time scaling technique the team calls Claim-Level Reliability Assessment, the score climbs to 97.1, edging past virtually every system in the public record.Within hours of publication, the paper had…

This is a summary. Read the complete story on VentureBeat below.

Full story on VentureBeat

Read Full Story →

🔗 Clicking will take you to venturebeat.com

Why Weibo’s tiny VibeThinker-3B has the AI world arguing over benchmarks again

👨🏿‍🚀TechCabal Daily – EVs crash SA’s road fund

Z.ai’s open-weights GLM-5.2 beats GPT-5.5 on multiple long-horizon coding benchmarks for 1/6th the cost

SpaceX is public: Everything you need to know post-IPO

Meta’s new ‘AI Mode’ on Facebook pulls from public info across its platforms