deep-think - Sean Kim — Arts and Tech

January 12, 2026

Published by Sean Kim on January 12, 2026

Gemini 3 Pro Preview Deep Think Mode: Is Google’s $250 AI Ultra Worth the 93.8% GPQA Score?

GPQA Diamond 93.8%. Humanity’s Last Exam 41.0%. ARC-AGI-2 45.1%. These are the numbers Google’s Gemini 3 Pro Preview Deep Think mode posted — and accessing them […]

November 6, 2025

Published by Sean Kim on November 6, 2025

Google Gemini 3 Launch: Multimodal Reasoning Over Text, Code, and Video

1501 Elo. That single number just rewrote the AI leaderboard — and Google Gemini 3 is the model behind it. After months of leaks, vague social […]

May 30, 2025

Published by Sean Kim on May 30, 2025

Google I/O 2025 Gemini 2.5 Pro & Flash — Complete Keynote Breakdown 10 Days Later

Every category on LMArena — swept. A math-olympiad-level reasoning mode that thinks in parallel. A lightweight model that uses 30% fewer tokens while actually getting better. […]