Similar Articles

My Python code is a neural network (gabornyeki.com)

https://news.ycombinator.com/item?id=40845304

Domain: news.ycombinator.com Added: 2025-07-12 Status: ✓ Success

news,tech,hackernews,news.ycombinator.com

This article doesn't talk much about testing or getting training data. It seems like that part is key. For code that you think you understand, it's because you've informally proven to yourself that it...

Similar Articles (10 found)

https://news.ycombinator.com/item?id=31051540

news.ycombinator.com 2025-07-13

hackernews,tech,news,news.ycombinator.com

First, thanks to the publisher and authors for making this freely available! I retired recently after using neural networks since the 1980s. I still s...

🔍 View Similar Articles

https://www.gilesthomas.com/2025/10/llm-from-scratch-22-finally-training-our-llm

www.gilesthomas.com 2025-11-08

www.gilesthomas.com

Writing an LLM from scratch, part 22 -- finally training our LLM! This post wraps up my notes on chapter 5 of Sebastian Raschka's book "Build a Large ...

🔍 View Similar Articles 🟠 HN

🔍 66.2% similar

A Brief History of GPT Through Papers

https://towardsdatascience.com/a-brief-history-of-gpt-through-papers/

towardsdatascience.com 2025-08-28

towardsdatascience.com

0) Prologue: The Turing test In October 1950, Alan Turing proposed a test. Was it possible to have a conversation with a machine and not be able to te...

🔍 View Similar Articles

http://karpathy.github.io/2022/03/14/lecun1989/

karpathy.github.io 2025-09-01

karpathy.github.io

Deep Neural Nets: 33 years ago and 33 years from now The Yann LeCun et al. (1989) paper Backpropagation Applied to Handwritten Zip Code Recognition is...

🔍 View Similar Articles 🟠 HN

https://news.ycombinator.com/item?id=45427634

news.ycombinator.com 2025-10-11

news.ycombinator.com

> the generation of 281,128 augmented examples, from which 1,000 were held out as a benchmark test set. This model is trained on a custom dataset of 2...

🔍 View Similar Articles

https://news.ycombinator.com/item?id=40297946

news.ycombinator.com 2025-07-13

news.ycombinator.com,hackernews,tech,news

I'm curious why we seem convinced that this is a task that is possible or something worthy of investigation. I've worked on language models since 2018...

🔍 View Similar Articles

🔍 63.8% similar

The Illustrated Transformer

https://jalammar.github.io/illustrated-transformer/

jalammar.github.io 2025-12-23

jalammar.github.io

The Illustrated Transformer Discussions: Hacker News (65 points, 4 comments), Reddit r/MachineLearning (29 points, 3 comments) Translations: Arabic, C...

🔍 View Similar Articles 🟠 HN

openai.com 2025-07-13

openai.com

Techniques for training large neural networks Large neural networks are at the core of many recent advances in AI, but training them is a difficult en...

🔍 View Similar Articles

🔍 63.2% similar

A Recipe for Training Neural Networks

http://karpathy.github.io/2019/04/25/recipe/

karpathy.github.io 2025-09-01

karpathy.github.io

A Recipe for Training Neural Networks Some few weeks ago I posted a tweet on “the most common neural net mistakes”, listing a few common gotchas relat...

🔍 View Similar Articles 🟠 HN

🔍 62.4% similar

Yes you should understand backprop

https://karpathy.medium.com/yes-you-should-understand-backprop-e2f06eab496b?source=rss-ac9d9a35533e------2

karpathy.medium.com 2025-08-13

karpathy.medium.com blog article +1

Yes you should understand backprop When we offered CS231n (Deep Learning class) at Stanford, we intentionally designed the programming assignments to ...

🔍 View Similar Articles