Why reinforcement learning plateaus without representation depth (and other key takeaways from NeurIPS 2025) ...
Students haven't lost the ability to work through confusion, but they've developed an automatic reflex to avoid it. Remove the escape hatch, and the ability returns within minutes.