Similar Articles

Deep Reinforcement Learning: Pong from Pixels

http://karpathy.github.io/2016/05/31/rl/

Domain: karpathy.github.io Added: 2025-09-01 Status: ✓ Success

karpathy.github.io

Deep Reinforcement Learning: Pong from Pixels This is a long overdue blog post on Reinforcement Learning (RL). RL is hot! You may have noticed that computers can now automatically learn to play ATARI ...

Similar Articles (10 found)

https://pub.towardsai.net/dynamic-programming-in-reinforcement-learning-b94d7b3db22b?source=rss----98111c9905da---4

pub.towardsai.net 2025-08-13

pub.towardsai.net

Dynamic Programming in Reinforcement Learning Our First Approach to Solving Reinforcement Learning Problems! If you’re not familiar with the Bellman e...

🔍 View Similar Articles

🔍 66.2% similar

Error extracting title

https://towardsdatascience.com/hands-on-introduction-to-reinforcement-learning-in-python-da07f7aaca88/

towardsdatascience.com 2025-07-12

data-science,machine-learning,tutorial,towardsdatascience.com

Understanding rewards by teaching a robot to navigate a maze One of the biggest barriers to traditional machine learning is that most supervised and u...

🔍 View Similar Articles

https://pyimagesearch.com/2025/08/04/fine-tuning-smolvlm-for-human-alignment-using-direct-preference-optimization/

pyimagesearch.com 2025-08-13

pyimagesearch.com computer-vision opencv +1

Table of Contents Fine Tuning SmolVLM for Human Alignment Using Direct Preference Optimization Preference optimization shines when we want models to m...

🔍 View Similar Articles

🔍 62.7% similar

AlphaGo, in context

https://karpathy.medium.com/alphago-in-context-c47718cb95a5?source=rss-ac9d9a35533e------2

karpathy.medium.com 2025-08-13

karpathy.medium.com blog article +1

AlphaGo, in context Update Oct 18, 2017: AlphaGo Zero was announced. This post refers to the previous version. 95% of it still applies. I had a chance...

🔍 View Similar Articles

https://news.ycombinator.com/item?id=31051540

news.ycombinator.com 2025-07-13

hackernews,tech,news,news.ycombinator.com

First, thanks to the publisher and authors for making this freely available! I retired recently after using neural networks since the 1980s. I still s...

🔍 View Similar Articles

https://news.ycombinator.com/item?id=40845304

news.ycombinator.com 2025-07-12

news,tech,hackernews,news.ycombinator.com

This article doesn't talk much about testing or getting training data. It seems like that part is key. For code that you think you understand, it's be...

🔍 View Similar Articles

🔍 58.9% similar

Yes you should understand backprop

https://karpathy.medium.com/yes-you-should-understand-backprop-e2f06eab496b?source=rss-ac9d9a35533e------2

karpathy.medium.com 2025-08-13

karpathy.medium.com blog article +1

Yes you should understand backprop When we offered CS231n (Deep Learning class) at Stanford, we intentionally designed the programming assignments to ...

🔍 View Similar Articles

🔍 58.9% similar

Yes you should understand backprop

https://karpathy.medium.com/yes-you-should-understand-backprop-e2f06eab496b

karpathy.medium.com 2025-11-03

karpathy.medium.com blog article +1

Yes you should understand backprop When we offered CS231n (Deep Learning class) at Stanford, we intentionally designed the programming assignments to ...

🔍 View Similar Articles 🟠 HN

https://www.gilesthomas.com/2025/10/llm-from-scratch-22-finally-training-our-llm

www.gilesthomas.com 2025-11-08

www.gilesthomas.com

Writing an LLM from scratch, part 22 -- finally training our LLM! This post wraps up my notes on chapter 5 of Sebastian Raschka's book "Build a Large ...

🔍 View Similar Articles 🟠 HN

https://news.ycombinator.com/item?id=40297946

news.ycombinator.com 2025-07-13

news.ycombinator.com,hackernews,tech,news

I'm curious why we seem convinced that this is a task that is possible or something worthy of investigation. I've worked on language models since 2018...

🔍 View Similar Articles