MicroGPT explained interactively
Andrej Karpathy wrote a 200-line Python script that trains and runs a GPT from scratch, with no libraries or dependencies, just pure Python. The script contains the al...
Similar Articles (10 found)
π 77.3% similar
Writing an LLM from scratch, part 22 -- finally training our LLM!
This post wraps up my notes on chapter 5 of Sebastian Raschka's book "Build a Large ...
π 69.9% similar
GPT-5: Key characteristics, pricing and model card
7th August 2025
Iβve had preview access to the new GPT-5 model family for the past two weeks (see r...
π 66.9% similar
GPT-5.2
11th December 2025
OpenAI reportedly declared a βcode redβ on the 1st of December in response to increasingly credible competition from the li...
π 65.9% similar
microgpt
This is a brief guide to my new art project microgpt, a single file of 200 lines of pure Python with no dependencies that trains and inferenc...
π 65.8% similar
Writing an LLM from scratch, part 28 -- training a base model from scratch on an RTX 3090
Having worked through the main body of Sebastian Raschka's b...
π 65.2% similar
Language Modeling with Limited Data, Infinite Compute
March 2026
NanoGPT Slowrun is an open effort to implement data-efficient learning algorithms; 5....
π 64.6% similar
Same AI, Different Answer: How Tiny Prompts Can Change Everything
Why Does ChatGPT Sometimes Feel Different?
If youβve used AI chatbots like ChatGPT f...
π 64.6% similar
> the generation of 281,128 augmented examples, from which 1,000 were
held out as a benchmark test set.
This model is trained on a custom dataset of 2...
π 63.6% similar
Things we learned about LLMs in 2024
31st December 2024
A lot has happened in the world of Large Language Models over the course of 2024. Hereβs a rev...
π 63.5% similar
Why DeepSeek is cheap at scale but expensive to run locally
Why is DeepSeek-V3 supposedly fast and cheap to serve at scale, but too slow and expensive...