@pengrui_han
The human brain is strikingly modular, with distinct networks for language, formal reasoning, social reasoning, and physical reasoning. Is this a fundamental principle of how intelligent systems are built, or an accident of biological evolution? In our latest preprint, we find that a similar modular organization emerges in Large Language Models, another class of intelligent system. Brains and LLMs are shaped by entirely different kinds of optimization (biological evolution vs. gradient descent). That they arrive at the same modular design anyway suggests modularity may be a fundamental property of intelligent systems. 🌐 Web: https://t.co/ZKrnTSSuSf 📄 Paper: https://t.co/ZibBXz3PUy 💻 Code & data: https://t.co/uBo5iOYNjy Using circuit analyses across 46 tasks spanning four cognitive domains, we find: 1️⃣ Tasks that draw on the same network in humans recruit overlapping units in LLMs, while tasks drawing on different networks recruit distinct units. 2️⃣ These units are causally linked to model behavior. Ablating the units critical for one domain impairs performance in that domain (−26% accuracy) but barely touches the others (−2.5%). This project has been in the works for a while :) Huge thanks to my advisors @jacobandreas @ev_fedorenko @devarda_a, and to @Nancy_Kanwisher for valuable conceptual input and feedback throughout. #MIT