Quick Summary
In this article we will:
- Cover the basic ideas.
- Code up a solver in Python.
- Play with a simple linear system: the double integrator.
- Get all the code here: https://github.com/wil...
Similar Articles (10 found)
π 57.3% similar
Excellent source for PID heuristics. That being said PID control is >100 years old concept and we have done tremendous theoretical progress since then...
π 55.3% similar
Dynamic Programming in Reinforcement Learning
Our First Approach to Solving Reinforcement Learning Problems!
If youβre not familiar with the Bellman e...
π 50.2% similar
You need to know what right-half-plane zeros are
"I need to know about what now?!"
Right-half-plane zeros. Or at least, I want you to know about them,...
π 49.9% similar
Understanding rewards by teaching a robot to navigate a maze
One of the biggest barriers to traditional machine learning is that most supervised and u...
π 49.3% similar
Member-only story
Monte Carlo Off-Policy for the Maze Problem
Tutorial 8.2: Implementing the Off-Policy MC Method for Our Maze Problem
Not a Medium me...
π 49.2% similar
Through my work building XGBoost models across different projects, I came across the great resource Effective XGBoost by Matt Harrison, a textbook cov...
π 48.2% similar
Deep Reinforcement Learning: Pong from Pixels
This is a long overdue blog post on Reinforcement Learning (RL). RL is hot! You may have noticed that co...
π 47.9% similar
Writing an LLM from scratch, part 22 -- finally training our LLM!
This post wraps up my notes on chapter 5 of Sebastian Raschka's book "Build a Large ...
π 47.1% similar
I'm curious why we seem convinced that this is a task that is possible or something worthy of investigation.
I've worked on language models since 2018...
π 46.3% similar
In last weekβs blog post we got our feet wet by implementing a simple object tracking algorithm called βcentroid trackingβ.
Today, we are going to tak...