0) Prologue: The Turing test
In October 1950, Alan Turing proposed a test. Was it possible to have a conversation with a machine and not be able to tell it apart from a human. He called this βthe imit...
Similar Articles (10 found)
π 75.5% similar
The Illustrated Transformer
Discussions:
Hacker News (65 points, 4 comments), Reddit r/MachineLearning (29 points, 3 comments)
Translations: Arabic, C...
π 66.8% similar
First, thanks to the publisher and authors for making this freely available!
I retired recently after using neural networks since the 1980s. I still s...
π 66.2% similar
This article doesn't talk much about testing or getting training data. It seems like that part is key.
For code that you think you understand, it's be...
π 65.3% similar
The Bitter Lesson is Misunderstood
Together, the Bitter Lesson and Scaling Laws reveal that the god of Compute we worship is yoked to an even greater ...
π 62.6% similar
Comprehensive toy implementations of the 30 foundational papers recommended by Ilya Sutskever
Buy the Google Colab Code for this Repo at Gumroad
This ...
π 62.3% similar
> the generation of 281,128 augmented examples, from which 1,000 were
held out as a benchmark test set.
This model is trained on a custom dataset of 2...
π 61.9% similar
Deep Neural Nets: 33 years ago and 33 years from now
The Yann LeCun et al. (1989) paper Backpropagation Applied to Handwritten Zip Code Recognition is...
π 61.3% similar
How will OpenAI compete?
βJakub and Mark set the research direction for the long run. Then after months of work, something incredible emerges and I ge...
π 61.2% similar
A Peek at Trends in Machine Learning
Have you looked at Google Trends? Itβs pretty cool β you enter some keywords and see how Google Searches of that ...
π 61.1% similar
At the core of the attention mechanism in LLMs are three matrices: Query, Key, and Value. These matrices are how transformers actually pay attention t...