SIGNAL//SYNTH

The next big breakthrough will be AIs learning on the job

Dwarkesh Podcast

aired Jun 26, 2026 · 19.0m

▸ Listen ↗ Source

Signal

92.1/ 100

Essential

confidence 0.99

Orig60.1

Actn100.0

Dens100.0

Dpth100.0

Clty80.6

Summary

So here's the big research bet that all the labs are making. Because this kind of training will have created a kind of problem-solving agent, the kind of thing that can make progress on open-ended tasks for weeks on end in the face of errors and mistakes and ambiguity.

Why listen

It goes beyond the title with direct discussion of model, learning, training, including: They think that if we train AIs to accomplish millions of verifiable tasks across thousands of diverse RL environments, then we will have basically built AGI.

Key takeaways

01Because this kind of training will have created a kind of problem-solving agent, the kind of thing that can make progress on open-ended tasks for weeks on end in the face of errors
02And the people who are optimistic about this vision will say that all these things that we talk about as the fundamental deficits in the current training paradigm, for example, the
03So in the previous essay, I talked about how these models are one-to-one-millionth as sample efficient as humans

Best for

platform teams improving retrieval and memoryresearch-minded practitioners comparing model behavior