Skip to main content
Measured savings across 11 LLMs, from Claude Opus 4.7 to Gemini Flash.→ See per-model data
Connect your client
Industry News

The 100B-120B model gap widens as labs chase extremes

A critical size class is disappearing from open-source releases. Labs are shipping either 25B-35B models or 200B+ variants, leaving the 100B-120B sweet spot abandoned for months.

1 min read

The open-source model market is experiencing a puzzling absence. The 100B to 120B parameter range, once a reliable tier for production deployments, has stalled. The last meaningful release in this window was GPT-OSS-120B over 10 months ago, followed by scattered models like GLM-4.5-Air, Nemotron-3-S...

Sign in to read the full analysis

Free account. Full analysis on LLM unit economics, plus the weekly Cost-of-Inference column.

Try it on your own context

You just read the writeup. Now run the thing. Paste a doc or some verbose tool output and watch it shrink — free, no signup.

2,912/12,000 chars
Compressed
Compressed text will appear here…
Method & sources
Source type
Primary publication (lab/vendor blog) — our analysis + implication
Source link
r/localllama
Published
UTC
Byline
By the gotcontext.ai team (editorial standards)
Correction?
corrections@gotcontext.ai