LLMs & Foundation Models

Google Rolls Out Gemini 3 Flash as Its New Default AI Model

The cheaper, faster variant now powers the Gemini app and Search globally.

Andrés Martínez
Andrés MartínezAI Content Writer
December 18, 20252 min read
Share:
Abstract visualization of fast AI processing with lightning and neural network patterns in Google brand colors

Google shipped Gemini 3 Flash on Wednesday, immediately making it the default model across the Gemini app and AI Mode in Search. The model slots in below Gemini 3 Pro in the lineup but matches or beats it on several benchmarks, which raises an obvious question: why pay more for Pro?

The company positions Flash as the "workhorse model," per Tulsee Doshi, senior director of Gemini product. It costs $0.50 per million input tokens and $3 per million output, compared to Pro's $2 and $12 respectively. Google claims it outperforms the previous Gemini 2.5 Pro while running three times faster, citing third-party benchmarks from Artificial Analysis.

On Google's own numbers, Flash scores 90.4% on GPQA Diamond and 33.7% on Humanity's Last Exam without tools. For context, Gemini 3 Pro hit 37.5% on HLE while GPT-5.2 scored 34.5%, though all these figures come from the companies themselves. Flash actually beat Pro on SWE-bench Verified for coding tasks (78%), which seems backwards until you consider the benchmarks might not capture what makes Pro worth the premium.

JetBrains, Figma, Cursor, and Harvey are already using Flash in production. Google says the API has been processing over a trillion tokens daily since launching Gemini 3 Pro last month.

The Bottom Line: Google now defaults users to a model that nearly matches Pro performance at a quarter of the price, betting volume matters more than margin.


QUICK FACTS

  • API pricing: $0.50/1M input tokens, $3/1M output tokens
  • Released: December 17, 2025
  • GPQA Diamond: 90.4% (Google-reported)
  • SWE-bench Verified: 78% (Google-reported)
  • Context window: 1M input tokens, 65K output tokens
  • Knowledge cutoff: January 2025
Tags:GoogleGemini 3 FlashAI modelsGemini APIAI pricingmachine learning
Andrés Martínez

Andrés Martínez

AI Content Writer

Andrés reports on the AI stories that matter right now. No hype, just clear, daily coverage of the tools, trends, and developments changing industries in real time. He makes the complex feel routine.

Related Articles

Stay Ahead of the AI Curve

Get the latest AI news, reviews, and deals delivered straight to your inbox. Join 100,000+ AI enthusiasts.

By subscribing, you agree to our Privacy Policy. Unsubscribe anytime.

Google Rolls Out Gemini 3 Flash as Its New Default AI Model | aiHola