microgpt
This is a brief guide to my new art project microgpt, a single file of 200 lines of pure Python with no dependencies that trains and inferences a GPT. This file contains the full algorithmic ...
Similar Articles (10 found)
🔍 65.9% similar
MicroGPT explained interactively
Andrej Karpathy wrote a 200-line Python script that trains and runs a GPT from scratch, with no libraries or dependen...
🔍 62.9% similar
Yes you should understand backprop
When we offered CS231n (Deep Learning class) at Stanford, we intentionally designed the programming assignments to ...
🔍 62.9% similar
Yes you should understand backprop
When we offered CS231n (Deep Learning class) at Stanford, we intentionally designed the programming assignments to ...
🔍 62.9% similar
10th FEBRUARY, 2026
Table of Contents:
PyTorch is currently one of the most popular deep learning frameworks. It is an open-source library built upon ...
🔍 59.5% similar
Writing an LLM from scratch, part 22 -- finally training our LLM!
This post wraps up my notes on chapter 5 of Sebastian Raschka's book "Build a Large ...
🔍 56.8% similar
This article doesn't talk much about testing or getting training data. It seems like that part is key.
For code that you think you understand, it's be...
🔍 56.7% similar
Deep Reinforcement Learning: Pong from Pixels
This is a long overdue blog post on Reinforcement Learning (RL). RL is hot! You may have noticed that co...
🔍 54.9% similar
Deep Neural Nets: 33 years ago and 33 years from now
The Yann LeCun et al. (1989) paper Backpropagation Applied to Handwritten Zip Code Recognition is...
🔍 54.7% similar
Comprehensive toy implementations of the 30 foundational papers recommended by Ilya Sutskever
Buy the Google Colab Code for this Repo at Gumroad
This ...
🔍 54.6% similar
Techniques for training large neural networks
Large neural networks are at the core of many recent advances in AI, but training them is a difficult en...