DEV Community

Nikita Namjoshi for Google AI

Posted on

What is an LLM actually doing when it's "thinking"?

Ever wondered what an LLM is doing when it's "thinking"?

In this episode of Release Notes Explained, we cover the fundamentals of how thinking and reasoning models work including concepts like:

  • Scaling laws
  • Test-time compute
  • Reinforcement learning from verifiable rewards

Hope you enjoy! 🩵

Questions? Leave them down below.

Top comments (1)

Collapse
 
benjamin_nguyen_8ca6ff360 profile image
Benjamin Nguyen

nice!