@DrJimFan
Let's talk about which papers *actually* show a hint of LLM's internal world models. There're quite a few, but I'll highlight 2 in game AI. 1. Voyager (shameless self-plug). In Minecraft, Voyager is able to make decisions by world modeling. Example: "hunger bar is low -> if I don't get food soon I'll die -> I see a cat, a pig, and a villager nearby -> which one should I hunt? -> pig, because killing the other 2 wouldn't give me food even if I succeed -> check inventory, no good weapon -> [go craft stone sword] -> ugh pig ran away -> [start hunting sheep]" This trace of thought involves counterfactual reasoning and active intervention given the agent and the world's current state. Voyager anticipates what it needs by mentally simulating the future, and plan against that "imagined future" accordingly. It does extensive exploration and acquires new embodied skills along the way via the skill library mechanism. It makes mistakes but adjusts course of action to avoid them in the future. Now this fits perfectly with @ylecun's characterization. We did not mention world models in the paper, but now I think we should have. I'll update Arxiv accordingly. https://t.co/1d3YocozsI 2. Othello-GPT: https://t.co/VcKbmKDPG2. This is a much simpler game than Minecraft, but it shows that LLM can develop a world model of the game by training on histories of game moves. The model has no a priori knowledge of the game rules. Now you can use it to answer questions like "what would the opponent do had I made a different move?", or "is this move legal given the current world state?". The authors also discuss an intervention technique that suggests that the world model can be used to control the network’s behavior.
MineDojo
Voyager | An Open-Ended Embodied Agent with Large Language Models
Voyager is the first LLM-powered lifelong learning agent in Minecraft, capable of self-driven exploration and skill acquisition without human interven...
• Utilizes GPT-4 for exploration and skill development.
• Achieves 3.3x more unique items in Minecraft.
arXiv
Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task
This article investigates whether language models rely on internal representations or just memorize statistics by applying a GPT variant to Othello....
• Explores internal representations in language models.
• Uses a GPT model for predicting moves in Othello.