"Physics of Language Models" series
updated
Physics of Language Models: Part 1, Context-Free Grammar
Paper
• 2305.13673
• Published • 7
Physics of Language Models: Part 3.2, Knowledge Manipulation
Paper
• 2309.14402
• Published • 7
Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws
Paper
• 2404.05405
• Published • 10
Physics of Language Models: Part 3.1, Knowledge Storage and Extraction
Paper
• 2309.14316
• Published • 9
Physics of Language Models: Part 2.1, Grade-School Math and the Hidden
Reasoning Process
Paper
• 2407.20311
• Published • 5
Physics of Language Models: Part 2.2, How to Learn From Mistakes on
Grade-School Math Problems
Paper
• 2408.16293
• Published • 27
Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers
Paper
• 2512.17351
• Published • 28