DEV Community

# reinforcementlearning

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Fixing an Off-By-One Bug in PufferLib's PPO Implementation

Fixing an Off-By-One Bug in PufferLib's PPO Implementation

Comments
2 min read
Multi armed bandit exercise 2.5 with C#

Multi armed bandit exercise 2.5 with C#

Comments
4 min read
Sutton & Barto Gridworld example in C#

Sutton & Barto Gridworld example in C#

Comments
5 min read
HRPO-X v1.0.1: from HRPO paper production-hardened runnable code

HRPO-X v1.0.1: from HRPO paper production-hardened runnable code

Comments
2 min read
When Deep Learning Meets the Devil's Wheel: RL for European Roulette (Part 1)"

When Deep Learning Meets the Devil's Wheel: RL for European Roulette (Part 1)"

Comments
12 min read
Gravity? Who Needs It: AI-Powered Levitation for Next-Gen Robotics

Gravity? Who Needs It: AI-Powered Levitation for Next-Gen Robotics

Comments
2 min read
Quantum-Inspired State Sculpting: Revolutionizing Offline Reinforcement Learning by Arvind Sundararajan

Quantum-Inspired State Sculpting: Revolutionizing Offline Reinforcement Learning by Arvind Sundararajan

Comments
2 min read
Quantum-Inspired Encoding: A Leap in Offline Reinforcement Learning

Quantum-Inspired Encoding: A Leap in Offline Reinforcement Learning

Comments
2 min read
Quantum-Inspired Geometry: Boosting Offline Reinforcement Learning with Compact State Representations

Quantum-Inspired Geometry: Boosting Offline Reinforcement Learning with Compact State Representations

Comments
2 min read
Quantum-Inspired Shortcuts: Reinforcement Learning on a Budget

Quantum-Inspired Shortcuts: Reinforcement Learning on a Budget

Comments
2 min read
Quantum-Inspired Encoding: Revolutionizing Reinforcement Learning with Scarce Data

Quantum-Inspired Encoding: Revolutionizing Reinforcement Learning with Scarce Data

Comments
2 min read
Evolving Minds: Building Adaptable AI Through Strategic Response Learning

Evolving Minds: Building Adaptable AI Through Strategic Response Learning

Comments
2 min read
The Art of Deception: How AI is Redefining Strategic Warfare by Arvind Sundararajan

The Art of Deception: How AI is Redefining Strategic Warfare by Arvind Sundararajan

Comments
2 min read
Stop Guessing! AI That Explains Its Algorithm Choices is Finally Here

Stop Guessing! AI That Explains Its Algorithm Choices is Finally Here

Comments
2 min read
AI Autopilot for AI: Dynamically Scaling Neural Nets on Edge Devices

AI Autopilot for AI: Dynamically Scaling Neural Nets on Edge Devices

Comments
2 min read
Unlocking AI Speed: The Hidden Symmetries in Reinforcement Learning

Unlocking AI Speed: The Hidden Symmetries in Reinforcement Learning

Comments
2 min read
Unlock Peak Performance: Refining MCTS with Action-Aware State Grouping

Unlock Peak Performance: Refining MCTS with Action-Aware State Grouping

Comments
2 min read
The Introspective AI Revolution: Smarter Learning Through Self-Awareness by Arvind Sundararajan

The Introspective AI Revolution: Smarter Learning Through Self-Awareness by Arvind Sundararajan

Comments
2 min read
Beyond Black Boxes: Building AI That Explains Itself

Beyond Black Boxes: Building AI That Explains Itself

Comments
2 min read
AI Factories: Balancing Brains for Smarter Production

AI Factories: Balancing Brains for Smarter Production

Comments
2 min read
Decision Trees Evolved: Faster, Smarter Reinforcement Learning by Arvind Sundararajan

Decision Trees Evolved: Faster, Smarter Reinforcement Learning by Arvind Sundararajan

Comments
2 min read
Training a Nematode with Quantum Reinforcement Learning

Training a Nematode with Quantum Reinforcement Learning

Comments
8 min read
Unlocking AI: The Simplicity Revolution in Reinforcement Learning

Unlocking AI: The Simplicity Revolution in Reinforcement Learning

Comments
2 min read
Web Surfing AI: Teaching Robots to Shop for You

Web Surfing AI: Teaching Robots to Shop for You

1
Comments
2 min read
Altruistic AI: When Helping Others Helps You Win by Arvind Sundararajan

Altruistic AI: When Helping Others Helps You Win by Arvind Sundararajan

2
Comments
2 min read
loading...