Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
reinforcementlearning
Follow
Hide
Posts
Left menu
đ
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Solving CartPole Without Gradients: Simulated Annealing
Berkan Sesen
Berkan Sesen
Berkan Sesen
Follow
Apr 23
Solving CartPole Without Gradients: Simulated Annealing
#
reinforcementlearning
#
optimisation
Comments
Add Comment
13 min read
The Cross-Entropy Method: Solving RL Without Gradients
Berkan Sesen
Berkan Sesen
Berkan Sesen
Follow
Apr 21
The Cross-Entropy Method: Solving RL Without Gradients
#
reinforcementlearning
#
optimisation
1
 reaction
Comments
Add Comment
12 min read
Self-Learning AI Agents; Architectures and Challenges
Vishal Uttam Mane
Vishal Uttam Mane
Vishal Uttam Mane
Follow
Apr 21
Self-Learning AI Agents; Architectures and Challenges
#
selflearningai
#
aiagents
#
agentarchitecture
#
reinforcementlearning
1
 reaction
Comments
1
 comment
3 min read
Spilling beans for how i learn for examđ"Reinforcement Learning Cheat Sheet"
Keerthana
Keerthana
Keerthana
Follow
Apr 19
Spilling beans for how i learn for examđ"Reinforcement Learning Cheat Sheet"
#
reinforcementlearning
#
rl
#
ai
#
student
5
 reactions
Comments
Add Comment
2 min read
Top 15 Reinforcement Learning Questions That Will Appear in Exams
Keerthana
Keerthana
Keerthana
Follow
Apr 19
Top 15 Reinforcement Learning Questions That Will Appear in Exams
#
rl
#
reinforcementlearning
#
ai
#
students
6
 reactions
Comments
Add Comment
2 min read
Embodied AI Systems: Extending Intelligence Through Learning in the Environment
shangkyu shin
shangkyu shin
shangkyu shin
Follow
Apr 11
Embodied AI Systems: Extending Intelligence Through Learning in the Environment
#
ai
#
machinelearning
#
reinforcementlearning
#
robotics
Comments
Add Comment
2 min read
Policy Gradients: REINFORCE from Scratch with NumPy
Berkan Sesen
Berkan Sesen
Berkan Sesen
Follow
Apr 8
Policy Gradients: REINFORCE from Scratch with NumPy
#
reinforcementlearning
#
deeplearning
#
optimisation
Comments
Add Comment
16 min read
Deep Q-Networks: Experience Replay and Target Networks
Berkan Sesen
Berkan Sesen
Berkan Sesen
Follow
Apr 6
Deep Q-Networks: Experience Replay and Target Networks
#
reinforcementlearning
#
deeplearning
#
optimisation
Comments
Add Comment
18 min read
Q-Learning from Scratch: Navigating the Frozen Lake
Berkan Sesen
Berkan Sesen
Berkan Sesen
Follow
Apr 4
Q-Learning from Scratch: Navigating the Frozen Lake
#
reinforcementlearning
#
optimisation
Comments
Add Comment
11 min read
Why Most Game NPCs Feel Dead (And How Emotion and Memory Fix It)
Pranjal Raut
Pranjal Raut
Pranjal Raut
Follow
Mar 30
Why Most Game NPCs Feel Dead (And How Emotion and Memory Fix It)
#
gamedev
#
ai
#
reinforcementlearning
#
machinelearning
1
 reaction
Comments
Add Comment
4 min read
Hamilton-Jacobi-Bellman Equation: Reinforcement Learning and Diffusion Models!
Mariano Gobea Alcoba
Mariano Gobea Alcoba
Mariano Gobea Alcoba
Follow
Mar 30
Hamilton-Jacobi-Bellman Equation: Reinforcement Learning and Diffusion Models!
#
hamiltonjacobibellman
#
hjb
#
reinforcementlearning
#
rl
Comments
Add Comment
11 min read
A free model matched GPT-5.2. No fine-tuning. It rewrote its own skill files until it got there
nasuy
nasuy
nasuy
Follow
Apr 1
A free model matched GPT-5.2. No fine-tuning. It rewrote its own skill files until it got there
#
reinforcementlearning
5
 reactions
Comments
Add Comment
4 min read
Reinforcement Learning for Robotics: A Comprehensive 2025 Guide
Abhishek Nair
Abhishek Nair
Abhishek Nair
Follow
Mar 15
Reinforcement Learning for Robotics: A Comprehensive 2025 Guide
#
reinforcementlearning
#
robotics
#
rl
#
sac
1
 reaction
Comments
Add Comment
52 min read
How I Built a Readable AlphaZero From Scratch â A Deep Dive Into the Code
Zhixiang Li
Zhixiang Li
Zhixiang Li
Follow
Mar 1
How I Built a Readable AlphaZero From Scratch â A Deep Dive Into the Code
#
alphazero
#
reinforcementlearning
#
deeplearning
#
python
1
 reaction
Comments
Add Comment
10 min read
From Pixels to Physicality âď¸: Engineering Olaf with Reinforcement ⨠Learning, Control Systems, and Illusion Design đ¤
Hemant
Hemant
Hemant
Follow
Mar 22
From Pixels to Physicality âď¸: Engineering Olaf with Reinforcement ⨠Learning, Control Systems, and Illusion Design đ¤
#
ai
#
machinelearning
#
rpa
#
reinforcementlearning
2
 reactions
Comments
Add Comment
8 min read
đ
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account