SIGNAL//SYNTH
Ai

Reality: The Final Eval — Lukas Petersson and Axel Backlund of Andon Labs

aired Jun 04, 2026 · 75.0m
Signal
76.5/ 100
High signal
confidence 0.99
Orig59.5
Actn100.0
Dens100.0
Dpth100.0
Clty48.0
Summary

Thank you for having us. Yeah, I'm Lucas and I'm Axel.

Why listen

It goes beyond the title with direct discussion of like, yeah, think, including: Thank you for having us.

Key takeaways
  1. 01Yeah, so we did work with, like, Anthropic was one of our early customers in doing evals
  2. 02So we did, like, dangerous capability evals, nothing we published openly
  3. 03But then we started thinking about doing some kind of public benchmark
Best for
research-minded practitioners comparing model behavior