Économies mesurées sur 11 LLMs — Claude Opus 4.7 à Gemini Flash.→ Voir les données par modèle
Obtenir une clé API gratuite →
Research

GPT-5.2 matches top human reviewers in Nature peer review study

A 45-scientist study comparing 82 peer reviews found GPT-5.2 performed at parity with top-rated human reviewers, though with measurable gaps in certain evaluation dimensions.

1 min read
Sourcer/openai

OpenAI's GPT-5.2 matched the performance of top-tier human peer reviewers in a rigorous comparative study published in Nature, according to research conducted by 45 scientists who spent 469 hours evaluating AI and human reviews across 82 papers.

The study's...

Sign in to read the full analysis

Free — just an email. Get full analysis on LLM unit economics, plus the weekly Cost-of-Inference column.

Method & sources
Source type
Primary publication (lab/vendor blog) — our analysis + implication
Source link
r/openai
Published
UTC
Byline
By the gotcontext.ai team (editorial standards)
Correction?
corrections@gotcontext.ai
GPT-5.2 matches top human reviewers in Nature peer review study — gotcontext.ai