Ever wondered what an LLM is doing when it's "thinking"?
In this episode of Release Notes Explained, we cover the fundamentals of how thinking and reasoning models work including concepts like:
- Scaling laws
- Test-time compute
- Reinforcement learning from verifiable rewards
Hope you enjoy! 🩵
Questions? Leave them down below.
Top comments (1)
nice!