CodeMLSLet Code Learn to Think

Recording the full process from idea → structure → implementation → review: LLM engineering, Agent practice, software engineering methods, and continuous learning notes.

Read Latest Start from Columns Browse All Articles

Latest2026-04-2910 min readAgent Architecture Learning

What's Actually Happening When a Model "Thinks"?

What's Actually Happening When a Model "Thinks"? More and more models now have a "thinking mode." Some call it thinking, some call it reasoning, and some thinking modes even come with an effort option that lets you tune the intensity from low to high. After turning it on, you can feel the model "thinking" — responses slow down, and sometimes the results really do get more accurate. But I always had a question: what exactly is it thinking about? What's the difference between this "thinking" and a normal output? More importantly, what's the actual cost of "letting it think a bit more"? And what's the difference between different effort levels?