The Absolute Zero Reasoner
The Absolute Zero Reasoner (AZR) is a recent AI innovation that presents a new methodology for AI models to learn and reason. This method diverges from traditional AI learning approaches by enabling AI to learn from scratch, without the need for pre-existing human-provided data.
This is a key point: It is given zero data and self-evolves, in a similar way to Deep Mind’s Alpha Zero. Alpha Zero learned by itself the games of chess, go, and shogi without any human-fed data and eventually reached a super-human level. AZR is extending this self-play beyond board games.