Pittsburgh — The top coding benchmark changed leaders 5 times in one month as all major labs competed. Reinforcement learning on code drove the shift, making agents daily-driver tools by November for
Hacker News