Similar Articles

https://towardsdatascience.com/a-brief-history-of-gpt-through-papers/

Domain: towardsdatascience.com Added: 2025-08-28 Status: ✓ Success

towardsdatascience.com

0) Prologue: The Turing test In October 1950, Alan Turing proposed a test. Was it possible to have a conversation with a machine and not be able to tell it apart from a human. He called this “the imit...

Similar Articles (10 found)

🔍 75.5% similar

The Illustrated Transformer

https://jalammar.github.io/illustrated-transformer/

jalammar.github.io 2025-12-23

jalammar.github.io

The Illustrated Transformer Discussions: Hacker News (65 points, 4 comments), Reddit r/MachineLearning (29 points, 3 comments) Translations: Arabic, C...

🔍 View Similar Articles 🟠 HN

https://news.ycombinator.com/item?id=31051540

news.ycombinator.com 2025-07-13

hackernews,tech,news,news.ycombinator.com

First, thanks to the publisher and authors for making this freely available! I retired recently after using neural networks since the 1980s. I still s...

🔍 View Similar Articles

https://news.ycombinator.com/item?id=40845304

news.ycombinator.com 2025-07-12

news,tech,hackernews,news.ycombinator.com

This article doesn't talk much about testing or getting training data. It seems like that part is key. For code that you think you understand, it's be...

🔍 View Similar Articles

🔍 65.3% similar

The Bitter Lesson is Misunderstood

https://obviouslywrong.substack.com/p/the-bitter-lesson-is-misunderstood

obviouslywrong.substack.com 2025-09-04

obviouslywrong.substack.com

The Bitter Lesson is Misunderstood Together, the Bitter Lesson and Scaling Laws reveal that the god of Compute we worship is yoked to an even greater ...

🔍 View Similar Articles 🟠 HN

https://news.ycombinator.com/item?id=45427634

news.ycombinator.com 2025-10-11

news.ycombinator.com

> the generation of 281,128 augmented examples, from which 1,000 were held out as a benchmark test set. This model is trained on a custom dataset of 2...

🔍 View Similar Articles

http://karpathy.github.io/2022/03/14/lecun1989/

karpathy.github.io 2025-09-01

karpathy.github.io

Deep Neural Nets: 33 years ago and 33 years from now The Yann LeCun et al. (1989) paper Backpropagation Applied to Handwritten Zip Code Recognition is...

🔍 View Similar Articles 🟠 HN

🔍 61.2% similar

A Peek at Trends in Machine Learning

https://karpathy.medium.com/a-peek-at-trends-in-machine-learning-ab8a1085a106?source=rss-ac9d9a35533e------2

karpathy.medium.com 2025-08-13

karpathy.medium.com blog article +1

A Peek at Trends in Machine Learning Have you looked at Google Trends? It’s pretty cool — you enter some keywords and see how Google Searches of that ...

🔍 View Similar Articles

🔍 61.1% similar

The Q, K, V Matrices

https://arpitbhayani.me/blogs/qkv-matrices/

arpitbhayani.me 2026-02-03

arpitbhayani.me

At the core of the attention mechanism in LLMs are three matrices: Query, Key, and Value. These matrices are how transformers actually pay attention t...

🔍 View Similar Articles 🟠 HN

https://www.gilesthomas.com/2025/10/llm-from-scratch-22-finally-training-our-llm

www.gilesthomas.com 2025-11-08

www.gilesthomas.com

Writing an LLM from scratch, part 22 -- finally training our LLM! This post wraps up my notes on chapter 5 of Sebastian Raschka's book "Build a Large ...

🔍 View Similar Articles 🟠 HN

🔍 59.9% similar

Why AGI Will Not Happen — Tim Dettmers

https://timdettmers.com/2025/12/10/why-agi-will-not-happen/

timdettmers.com 2026-02-03

timdettmers.com

If you are reading this, you probably have strong opinions about AGI, superintelligence, and the future of AI. Maybe you believe we are on the cusp of...

🔍 View Similar Articles 🟠 HN