01. What It Is
A world model is a system's internal, learned approximation of how its environment works. Given a current state and an action, the model predicts what the next state will be. The agent can then use this model to mentally simulate candidate action sequences, evaluate their consequences, and choose actions without needing to physically execute each possibility first.
The term entered machine learning through Juergen Schmidhuber's 1990 work on recurrent neural networks that predict future states. David Ha and Schmidhuber's 2018 paper "World Models" (arXiv:1803.10122) brought the concept to widespread attention, demonstrating agents that learned to play video games by training almost entirely inside hallucinated simulations generated by their own world model.
World models are distinct from systems that only classify inputs or generate outputs. They model dynamics: causality, physics, object persistence, and temporal transitions. The question they answer is not "what is this?" but "what will happen next if I do X?"