Similar Articles

Articles similar to the selected content.

Domain: growingswe.com Added: 2026-03-01 Status: βœ“ Success
growingswe.com
MicroGPT explained interactively Andrej Karpathy wrote a 200-line Python script that trains and runs a GPT from scratch, with no libraries or dependencies, just pure Python. The script contains the al...
Similar Articles (10 found)
πŸ” 77.3% similar
Writing an LLM from scratch, part 22 -- finally training our LLM!
https://www.gilesthomas.com/2025/10/llm-from-scratch-22-finally-training-our-llm
Writing an LLM from scratch, part 22 -- finally training our LLM! This post wraps up my notes on chapter 5 of Sebastian Raschka's book "Build a Large ...
πŸ” View Similar Articles 🟠 HN
πŸ” 69.9% similar
GPT-5: Key characteristics, pricing and system card
https://simonwillison.net/2025/Aug/7/gpt-5/
GPT-5: Key characteristics, pricing and model card 7th August 2025 I’ve had preview access to the new GPT-5 model family for the past two weeks (see r...
πŸ” View Similar Articles 🟠 HN
πŸ” 66.9% similar
GPT-5.2
https://simonwillison.net/2025/Dec/11/gpt-52/#atom-entries
GPT-5.2 11th December 2025 OpenAI reportedly declared a β€œcode red” on the 1st of December in response to increasingly credible competition from the li...
πŸ” View Similar Articles
πŸ” 65.9% similar
microgpt
https://karpathy.github.io/2026/02/12/microgpt/
microgpt This is a brief guide to my new art project microgpt, a single file of 200 lines of pure Python with no dependencies that trains and inferenc...
πŸ” View Similar Articles
πŸ” 65.8% similar
Writing an LLM from scratch, part 28 -- training a base model from scratch on an RTX 3090
https://www.gilesthomas.com/2025/12/llm-from-scratch-28-training-a-base-model-from-scratch
Writing an LLM from scratch, part 28 -- training a base model from scratch on an RTX 3090 Having worked through the main body of Sebastian Raschka's b...
πŸ” View Similar Articles 🟠 HN
πŸ” 65.2% similar
Language Modeling with Limited Data, Infinite Compute
https://qlabs.sh/slowrun
Language Modeling with Limited Data, Infinite Compute March 2026 NanoGPT Slowrun is an open effort to implement data-efficient learning algorithms; 5....
πŸ” View Similar Articles 🟠 HN
πŸ” 64.6% similar
Same AI, Different Answer: How Tiny Prompts Can Change Everything
https://lightcapai.medium.com/same-ai-different-answer-how-tiny-prompts-can-change-everything-83e880f9773f
Same AI, Different Answer: How Tiny Prompts Can Change Everything Why Does ChatGPT Sometimes Feel Different? If you’ve used AI chatbots like ChatGPT f...
πŸ” View Similar Articles 🟠 HN
πŸ” 64.6% similar
Extract-0: A specialized language model for document information extraction
https://news.ycombinator.com/item?id=45427634
> the generation of 281,128 augmented examples, from which 1,000 were held out as a benchmark test set. This model is trained on a custom dataset of 2...
πŸ” View Similar Articles
πŸ” 63.6% similar
Error extracting title
https://simonwillison.net/2024/Dec/31/llms-in-2024/
Things we learned about LLMs in 2024 31st December 2024 A lot has happened in the world of Large Language Models over the course of 2024. Here’s a rev...
πŸ” View Similar Articles 🟠 HN
πŸ” 63.5% similar
Why DeepSeek is cheap at scale but expensive to run locally
https://www.seangoedecke.com/inference-batching-and-deepseek/
Why DeepSeek is cheap at scale but expensive to run locally Why is DeepSeek-V3 supposedly fast and cheap to serve at scale, but too slow and expensive...
πŸ” View Similar Articles 🟠 HN