The AI Brief

Timeline

Sunday, Feb 8

Recent AI developments at a glance — scroll to explore

Feb 5HIGH
New Model

Claude Opus 4.6 — 1M context, agent teams

Features a 1-million-token context window and introduces agent teams — groups of agents that can split larger...

Sep 29HIGH
New Model

Claude Sonnet 4.5 — autonomous agent benchmark leader

State-of-the-art on SWE-bench Verified and OSWorld (61.4%). Can work autonomously for 30+ hours (up from 7 hou...

May 22HIGH
New Model

Claude Sonnet 4 and Opus 4 launch

Claude Sonnet 4 delivers superior coding and reasoning as a hybrid model. Claude Opus 4 is declared the world'...

Apr 16HIGH
New Model

o3 — OpenAI's smartest reasoning model with tool use

OpenAI releases o3, its most capable reasoning model, making 20% fewer major errors than o1. The first reasoni...

Apr 5HIGH
Open Source

Llama 4 Scout and Maverick — MoE + 10M context

First Llama models with native multimodality and mixture-of-experts architecture. Scout supports a 10M-token c...

Mar 25HIGH
Breakthrough

Gemini 2.5 Pro — Google's first thinking model

Google's first 'thinking model' that reasons through steps before responding. Debuted at #1 on LMArena by a si...

Feb 24HIGH
Breakthrough

Claude 3.7 Sonnet — first hybrid reasoning model

The first hybrid reasoning model on the market, able to produce near-instant responses or engage in visible ex...

Feb 17HIGH
New Model

Grok 3 — trained on 200K-GPU Colossus supercluster

Trained with 10x the compute of Grok-2 on the 200,000-GPU Colossus supercluster. Features a 1M-token context w...

Jan 31
New Model

o3-mini — fast, affordable reasoning

OpenAI releases o3-mini, a cost-efficient reasoning model optimized for STEM. Excels in science, math, and cod...

Jan 20HIGH
Open Source

DeepSeek R1 — open-source reasoning model shakes markets

An open-source reasoning model rivaling OpenAI o1. Its mobile app briefly surpassed ChatGPT as #1 free app on...

Dec 26HIGH
New Model

DeepSeek V3 — trained for $6M, rivals GPT-4o

A 671B MoE model (37B active per token) trained for only ~$6 million. Competitive with GPT-4o and Claude 3.5 S...

Dec 11HIGH
New Model

Gemini 2.0 Flash — built for the agentic era

Outperforms Gemini 1.5 Pro on key benchmarks at twice the speed. First Gemini with multimodal output (native i...

Dec 6
Open Source

Llama 3.3 70B — 405B performance at 70B cost

A 70B text-only model delivering performance comparable to the much larger Llama 3.1 405B at a fraction of the...

Dec 5HIGH
Product Launch

o1 full release with ChatGPT Pro ($200/mo)

Full release of o1 with 34% fewer major mistakes and 50% faster than the preview, now multimodal. Launched alo...

Nov 4
New Model

Claude 3.5 Haiku — Opus performance at Haiku speed

Claude 3.5 Haiku matches the performance of the prior Claude 3 Opus at the speed and cost of the smaller Haiku...

Oct 22HIGH
Breakthrough

Upgraded Claude 3.5 Sonnet + computer use beta

Upgraded Claude 3.5 Sonnet with across-the-board improvements, especially in coding. Introduces computer use i...

Sep 25HIGH
Open Source

Llama 3.2 — first multimodal + edge models

Meta's first multimodal open models (11B and 90B vision LLMs) plus lightweight models (1B and 3B) for edge and...

Sep 12HIGH
Breakthrough

o1-preview — OpenAI's first reasoning model

OpenAI launches o1-preview, its first model trained with reinforcement learning to think step-by-step before a...

Aug 14
New Model

Grok 2 — xAI enters the frontier race

xAI's first competitive frontier model, achieving performance on par with leading models on graduate-level sci...

Jul 23HIGH
Open Source

Llama 3.1 405B — first open frontier model

The first openly available frontier-level model at 405 billion parameters, rivaling top proprietary models in...

Everything happening in AI· Tracking 200+ sources so you don't have to

This Week's Signal

The biggest AI news from the last 7 days

New Model

Claude Opus 4.6 — 1M context, agent teams

Features a 1-million-token context window and introduces agent teams — groups of agents that can split larger tasks into segmented jobs. Outperformed GPT-5.2 on several benchmarks.

Feb 5
New ModelSep 29

Claude Sonnet 4.5 — autonomous agent benchmark leader

State-of-the-art on SWE-bench Verified and OSWorld (61.4%). Can work autonomously for 30+ hours (up from 7 hours with Op...

New ModelMay 22

Claude Sonnet 4 and Opus 4 launch

Claude Sonnet 4 delivers superior coding and reasoning as a hybrid model. Claude Opus 4 is declared the world's best cod...

New ModelApr 16

o3 — OpenAI's smartest reasoning model with tool use

OpenAI releases o3, its most capable reasoning model, making 20% fewer major errors than o1. The first reasoning model t...

Open SourceApr 5

Llama 4 Scout and Maverick — MoE + 10M context

First Llama models with native multimodality and mixture-of-experts architecture. Scout supports a 10M-token context win...

BreakthroughMar 25

Gemini 2.5 Pro — Google's first thinking model

Google's first 'thinking model' that reasons through steps before responding. Debuted at #1 on LMArena by a significant...

BreakthroughFeb 24

Claude 3.7 Sonnet — first hybrid reasoning model

The first hybrid reasoning model on the market, able to produce near-instant responses or engage in visible extended ste...

New ModelFeb 17

Grok 3 — trained on 200K-GPU Colossus supercluster

Trained with 10x the compute of Grok-2 on the 200,000-GPU Colossus supercluster. Features a 1M-token context window and...

New ModelJan 31

o3-mini — fast, affordable reasoning

OpenAI releases o3-mini, a cost-efficient reasoning model optimized for STEM. Excels in science, math, and coding, showi...

Open SourceJan 20

DeepSeek R1 — open-source reasoning model shakes markets

An open-source reasoning model rivaling OpenAI o1. Its mobile app briefly surpassed ChatGPT as #1 free app on the US App...

Trending Models

The AI models making the most noise right now

Quick Take

What's shaping AI this week in three lines

The race is on

Top AI labs are competing harder than ever to build the smartest model

Free AI is catching up

Open-source models are getting close to matching the paid ones

AI that works alone

Autonomous AI agents are going from experiment to everyday tool

Explore

Go deeper into each area

Get this in your Telegram, daily

Briefer sends you the top AI news every morning. Free.

Open Briefer

Get your daily AI briefing

The most important AI news, scored and summarized. Free.

Or get it on TelegramOpen Briefer