DEV Community

# reinforcementlearning

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Solving CartPole Without Gradients: Simulated Annealing

Solving CartPole Without Gradients: Simulated Annealing

Comments
13 min read
The Cross-Entropy Method: Solving RL Without Gradients

The Cross-Entropy Method: Solving RL Without Gradients

1
Comments
12 min read
Self-Learning AI Agents; Architectures and Challenges

Self-Learning AI Agents; Architectures and Challenges

1
Comments 1
3 min read
Spilling beans for how i learn for exam😁"Reinforcement Learning Cheat Sheet"

Spilling beans for how i learn for exam😁"Reinforcement Learning Cheat Sheet"

5
Comments
2 min read
Top 15 Reinforcement Learning Questions That Will Appear in Exams

Top 15 Reinforcement Learning Questions That Will Appear in Exams

6
Comments
2 min read
Embodied AI Systems: Extending Intelligence Through Learning in the Environment

Embodied AI Systems: Extending Intelligence Through Learning in the Environment

Comments
2 min read
Policy Gradients: REINFORCE from Scratch with NumPy

Policy Gradients: REINFORCE from Scratch with NumPy

Comments
16 min read
Deep Q-Networks: Experience Replay and Target Networks

Deep Q-Networks: Experience Replay and Target Networks

Comments
18 min read
Q-Learning from Scratch: Navigating the Frozen Lake

Q-Learning from Scratch: Navigating the Frozen Lake

Comments
11 min read
Why Most Game NPCs Feel Dead (And How Emotion and Memory Fix It)

Why Most Game NPCs Feel Dead (And How Emotion and Memory Fix It)

1
Comments
4 min read
Hamilton-Jacobi-Bellman Equation: Reinforcement Learning and Diffusion Models!

Hamilton-Jacobi-Bellman Equation: Reinforcement Learning and Diffusion Models!

Comments
11 min read
A free model matched GPT-5.2. No fine-tuning. It rewrote its own skill files until it got there

A free model matched GPT-5.2. No fine-tuning. It rewrote its own skill files until it got there

5
Comments
4 min read
Reinforcement Learning for Robotics: A Comprehensive 2025 Guide

Reinforcement Learning for Robotics: A Comprehensive 2025 Guide

1
Comments
52 min read
How I Built a Readable AlphaZero From Scratch — A Deep Dive Into the Code

How I Built a Readable AlphaZero From Scratch — A Deep Dive Into the Code

1
Comments
10 min read
From Pixels to Physicality ☃️: Engineering Olaf with Reinforcement ✨ Learning, Control Systems, and Illusion Design 🤖

From Pixels to Physicality ☃️: Engineering Olaf with Reinforcement ✨ Learning, Control Systems, and Illusion Design 🤖

2
Comments
8 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.