Skip to main content
Measured savings across 11 LLMs, from Claude Opus 4.7 to Gemini Flash.→ See per-model data
Connect your client
Models

Google's Gemini 3.1 Pro underperforms newer models in benchmark tests

Google's Gemini 3.1 Pro is now scoring below Gemini 3.5 Flash and Claude 3.5 Sonnet in recent benchmarks, raising questions about the model's competitive positioning.

1 min read

Google's Gemini 3.1 Pro has fallen behind both Gemini 3.5 Flash and Claude 3.5 Sonnet in recent benchmark evaluations, according to user reports shared on Reddit. The performance gap marks a shift in Google's model lineup and suggests that the company's newer, lighter-weight offerings now outperform...

Sign in to read the full analysis

Free account. Full analysis on LLM unit economics, plus the weekly Cost-of-Inference column.

Try it on your own context

You just read the writeup. Now run the thing. Paste a doc or some verbose tool output and watch it shrink — free, no signup.

2,912/12,000 chars
Compressed
Compressed text will appear here…
Method & sources
Source type
Primary publication (lab/vendor blog) — our analysis + implication
Source link
r/geminiai
Published
UTC
Byline
By the gotcontext.ai team (editorial standards)
Correction?
corrections@gotcontext.ai

Related