Similar Articles

Articles similar to the selected content.

Domain: karpathy.github.io Added: 2025-09-01 Status: βœ“ Success
karpathy.github.io
Deep Reinforcement Learning: Pong from Pixels This is a long overdue blog post on Reinforcement Learning (RL). RL is hot! You may have noticed that computers can now automatically learn to play ATARI ...
Similar Articles (10 found)
πŸ” 70.8% similar
Dynamic Programming in Reinforcement Learning
https://pub.towardsai.net/dynamic-programming-in-reinforcement-learning-b94d7b3db22b?source=rss----98111c9905da---4
Dynamic Programming in Reinforcement Learning Our First Approach to Solving Reinforcement Learning Problems! If you’re not familiar with the Bellman e...
πŸ” View Similar Articles
πŸ” 66.2% similar
Error extracting title
https://towardsdatascience.com/hands-on-introduction-to-reinforcement-learning-in-python-da07f7aaca88/
Understanding rewards by teaching a robot to navigate a maze One of the biggest barriers to traditional machine learning is that most supervised and u...
πŸ” View Similar Articles
πŸ” 62.7% similar
Fine Tuning SmolVLM for Human Alignment Using Direct Preference Optimization
https://pyimagesearch.com/2025/08/04/fine-tuning-smolvlm-for-human-alignment-using-direct-preference-optimization/
Table of Contents Fine Tuning SmolVLM for Human Alignment Using Direct Preference Optimization Preference optimization shines when we want models to m...
πŸ” View Similar Articles
πŸ” 62.7% similar
AlphaGo, in context
https://karpathy.medium.com/alphago-in-context-c47718cb95a5?source=rss-ac9d9a35533e------2
AlphaGo, in context Update Oct 18, 2017: AlphaGo Zero was announced. This post refers to the previous version. 95% of it still applies. I had a chance...
πŸ” View Similar Articles
πŸ” 61.8% similar
The Principles of Deep Learning Theory (arxiv.org)
https://news.ycombinator.com/item?id=31051540
First, thanks to the publisher and authors for making this freely available! I retired recently after using neural networks since the 1980s. I still s...
πŸ” View Similar Articles
πŸ” 59.8% similar
My Python code is a neural network (gabornyeki.com)
https://news.ycombinator.com/item?id=40845304
This article doesn't talk much about testing or getting training data. It seems like that part is key. For code that you think you understand, it's be...
πŸ” View Similar Articles
πŸ” 58.9% similar
Yes you should understand backprop
https://karpathy.medium.com/yes-you-should-understand-backprop-e2f06eab496b?source=rss-ac9d9a35533e------2
Yes you should understand backprop When we offered CS231n (Deep Learning class) at Stanford, we intentionally designed the programming assignments to ...
πŸ” View Similar Articles
πŸ” 58.9% similar
Yes you should understand backprop
https://karpathy.medium.com/yes-you-should-understand-backprop-e2f06eab496b
Yes you should understand backprop When we offered CS231n (Deep Learning class) at Stanford, we intentionally designed the programming assignments to ...
πŸ” View Similar Articles 🟠 HN
πŸ” 58.8% similar
Writing an LLM from scratch, part 22 -- finally training our LLM!
https://www.gilesthomas.com/2025/10/llm-from-scratch-22-finally-training-our-llm
Writing an LLM from scratch, part 22 -- finally training our LLM! This post wraps up my notes on chapter 5 of Sebastian Raschka's book "Build a Large ...
πŸ” View Similar Articles 🟠 HN
πŸ” 58.5% similar
TimesFM: Time Series Foundation Model for time-series forecasting (github.com/google-research)
https://news.ycombinator.com/item?id=40297946
I'm curious why we seem convinced that this is a task that is possible or something worthy of investigation. I've worked on language models since 2018...
πŸ” View Similar Articles