Measured savings across 11 LLMs — Claude Opus 4.7 to Gemini Flash.→ See per-model data
Get free API key →
Industry News

Google's Gemini 3.5 Flash costs 6x more than predecessor, users report token

Google's new Gemini 3.5 Flash model is generating significantly longer responses than prior versions, pushing per-token costs to near Gemini 3.1 Pro pricing despite lower reasoning capability, according to heavy users of

1 min read

Google has released Gemini 3.5 Flash as a cost-efficient alternative to its Pro tier, but early adopters report the model is substantially more expensive than its predecessor due to extreme token verbosity across multi-turn document processing workflows.

According to [users reporting on Reddit](htt...

Sign in to read the full analysis

Free — just an email. Get full analysis on LLM unit economics, plus the weekly Cost-of-Inference column.

Method & sources
Source type
Primary publication (lab/vendor blog) — our analysis + implication
Source link
r/geminiai
Published
UTC
Byline
By the gotcontext.ai team (editorial standards)
Correction?
corrections@gotcontext.ai
Google's Gemini 3.5 Flash costs 6x more than predecessor, users report token — gotcontext.ai