Similar Articles

Domain: growingswe.com Added: 2026-03-01 Status: ✓ Success

growingswe.com

MicroGPT explained interactively Andrej Karpathy wrote a 200-line Python script that trains and runs a GPT from scratch, with no libraries or dependencies, just pure Python. The script contains the al...

Similar Articles (10 found)

https://www.gilesthomas.com/2025/10/llm-from-scratch-22-finally-training-our-llm

www.gilesthomas.com 2025-11-08

www.gilesthomas.com

Writing an LLM from scratch, part 22 -- finally training our LLM! This post wraps up my notes on chapter 5 of Sebastian Raschka's book "Build a Large ...

🔍 View Similar Articles 🟠 HN

https://simonwillison.net/2025/Aug/7/gpt-5/

simonwillison.net 2025-08-13

simonwillison.net

GPT-5: Key characteristics, pricing and model card 7th August 2025 I’ve had preview access to the new GPT-5 model family for the past two weeks (see r...

🔍 View Similar Articles 🟠 HN

🔍 66.9% similar

GPT-5.2

https://simonwillison.net/2025/Dec/11/gpt-52/#atom-entries

simonwillison.net 2025-12-18

simonwillison.net

GPT-5.2 11th December 2025 OpenAI reportedly declared a “code red” on the 1st of December in response to increasingly credible competition from the li...

🔍 View Similar Articles

🔍 65.9% similar

microgpt

https://karpathy.github.io/2026/02/12/microgpt/

karpathy.github.io 2026-03-01

karpathy.github.io

microgpt This is a brief guide to my new art project microgpt, a single file of 200 lines of pure Python with no dependencies that trains and inferenc...

🔍 View Similar Articles

https://www.gilesthomas.com/2025/12/llm-from-scratch-28-training-a-base-model-from-scratch

www.gilesthomas.com 2025-12-14

www.gilesthomas.com

Writing an LLM from scratch, part 28 -- training a base model from scratch on an RTX 3090 Having worked through the main body of Sebastian Raschka's b...

🔍 View Similar Articles 🟠 HN

https://qlabs.sh/slowrun

qlabs.sh 2026-03-05

qlabs.sh

Language Modeling with Limited Data, Infinite Compute March 2026 NanoGPT Slowrun is an open effort to implement data-efficient learning algorithms; 5....

🔍 View Similar Articles 🟠 HN

https://lightcapai.medium.com/same-ai-different-answer-how-tiny-prompts-can-change-everything-83e880f9773f

lightcapai.medium.com 2025-08-13

lightcapai.medium.com blog article +1

Same AI, Different Answer: How Tiny Prompts Can Change Everything Why Does ChatGPT Sometimes Feel Different? If you’ve used AI chatbots like ChatGPT f...

🔍 View Similar Articles 🟠 HN

https://news.ycombinator.com/item?id=45427634

news.ycombinator.com 2025-10-11

news.ycombinator.com

> the generation of 281,128 augmented examples, from which 1,000 were held out as a benchmark test set. This model is trained on a custom dataset of 2...

🔍 View Similar Articles

🔍 63.6% similar

Error extracting title

https://simonwillison.net/2024/Dec/31/llms-in-2024/

simonwillison.net 2025-07-12

simonwillison.net

Things we learned about LLMs in 2024 31st December 2024 A lot has happened in the world of Large Language Models over the course of 2024. Here’s a rev...

🔍 View Similar Articles 🟠 HN

https://www.seangoedecke.com/inference-batching-and-deepseek/

www.seangoedecke.com 2025-07-13

deepseek,ai models,throughput,latency,batch size,www.seangoedecke.com

Why DeepSeek is cheap at scale but expensive to run locally Why is DeepSeek-V3 supposedly fast and cheap to serve at scale, but too slow and expensive...

🔍 View Similar Articles 🟠 HN