Similar Articles

Deep Neural Nets: 33 years ago and 33 years from now

http://karpathy.github.io/2022/03/14/lecun1989/

Domain: karpathy.github.io Added: 2025-09-01 Status: ✓ Success

karpathy.github.io

Deep Neural Nets: 33 years ago and 33 years from now The Yann LeCun et al. (1989) paper Backpropagation Applied to Handwritten Zip Code Recognition is I believe of some historical significance because...

Similar Articles (10 found)

https://news.ycombinator.com/item?id=31051540

news.ycombinator.com 2025-07-13

hackernews,tech,news,news.ycombinator.com

First, thanks to the publisher and authors for making this freely available! I retired recently after using neural networks since the 1980s. I still s...

🔍 View Similar Articles

🔍 68.7% similar

A Recipe for Training Neural Networks

http://karpathy.github.io/2019/04/25/recipe/

karpathy.github.io 2025-09-01

karpathy.github.io

A Recipe for Training Neural Networks Some few weeks ago I posted a tweet on “the most common neural net mistakes”, listing a few common gotchas relat...

🔍 View Similar Articles 🟠 HN

https://towardsdatascience.com/a-refined-training-recipe-for-fine-grained-visual-classification/

towardsdatascience.com 2025-08-13

towardsdatascience.com

1. The problem: We needed a system that could identify specific car models, not just “this is a BMW,” but which BMW model and year. And it needed to r...

🔍 View Similar Articles

🔍 66.8% similar

A Peek at Trends in Machine Learning

https://karpathy.medium.com/a-peek-at-trends-in-machine-learning-ab8a1085a106?source=rss-ac9d9a35533e------2

karpathy.medium.com 2025-08-13

karpathy.medium.com blog article +1

A Peek at Trends in Machine Learning Have you looked at Google Trends? It’s pretty cool — you enter some keywords and see how Google Searches of that ...

🔍 View Similar Articles

https://www.gilesthomas.com/2025/10/llm-from-scratch-22-finally-training-our-llm

www.gilesthomas.com 2025-11-08

www.gilesthomas.com

Writing an LLM from scratch, part 22 -- finally training our LLM! This post wraps up my notes on chapter 5 of Sebastian Raschka's book "Build a Large ...

🔍 View Similar Articles 🟠 HN

https://liuliu.me/eyes/stretch-iphone-to-its-limit-a-2gib-model-that-can-draw-everything-in-your-pocket/

liuliu.me 2025-07-13

liuliu.me

Every year, we have a new iPhone that claims to be faster and better in every way. And yes, these new computer vision models and new image sensors can...

🔍 View Similar Articles 🟠 HN

https://news.ycombinator.com/item?id=40845304

news.ycombinator.com 2025-07-12

news,tech,hackernews,news.ycombinator.com

This article doesn't talk much about testing or getting training data. It seems like that part is key. For code that you think you understand, it's be...

🔍 View Similar Articles

🔍 62.7% similar

Software 2.0

https://karpathy.medium.com/software-2-0-a64152b37c35?source=rss-ac9d9a35533e------2

karpathy.medium.com 2025-08-13

karpathy.medium.com blog article +1

Software 2.0 I sometimes see people refer to neural networks as just “another tool in your machine learning toolbox”. They have some pros and cons, th...

🔍 View Similar Articles

https://news.ycombinator.com/item?id=40297946

news.ycombinator.com 2025-07-13

news.ycombinator.com,hackernews,tech,news

I'm curious why we seem convinced that this is a task that is possible or something worthy of investigation. I've worked on language models since 2018...

🔍 View Similar Articles

https://news.ycombinator.com/item?id=45427634

news.ycombinator.com 2025-10-11

news.ycombinator.com

> the generation of 281,128 augmented examples, from which 1,000 were held out as a benchmark test set. This model is trained on a custom dataset of 2...

🔍 View Similar Articles