Similar Articles

Articles similar to the selected content.

Domain: gudok.xyz Added: 2025-07-13 Status: βœ“ Success
gudok.xyz
No dataset pitfall Machine learning is not about solving some random problem that looks commercially appealing. It is all about finding a problem for which a good training dataset can be acquired. For...
Similar Articles (10 found)
πŸ” 65.4% similar
The Principles of Deep Learning Theory (arxiv.org)
https://news.ycombinator.com/item?id=31051540
First, thanks to the publisher and authors for making this freely available! I retired recently after using neural networks since the 1980s. I still s...
πŸ” View Similar Articles
πŸ” 64.9% similar
Extract-0: A specialized language model for document information extraction
https://news.ycombinator.com/item?id=45427634
> the generation of 281,128 augmented examples, from which 1,000 were held out as a benchmark test set. This model is trained on a custom dataset of 2...
πŸ” View Similar Articles
πŸ” 64.8% similar
Are GPUs Worth It for ML? (exafunction.com)
https://news.ycombinator.com/item?id=32641769
For some reason they focus on the inference, which is the computationally cheap part. If you're working on ML (as opposed to deploying someone else's ...
πŸ” View Similar Articles
πŸ” 64.8% similar
Building with Humility
https://www.matroid.com/building-with-humility/
Building with Humility John Goddard | July 31st, 2025 How a product can get it right when machine learning gets it wrong Introduction Silicon Valley i...
πŸ” View Similar Articles
πŸ” 64.8% similar
The End of the Train-Test Split
https://folio.benguzovsky.com/train-test
You are a machine learning engineer at Facebook in Menlo Park. Your task: build the best butt classification model, which decides if there is an expos...
πŸ” View Similar Articles 🟠 HN
πŸ” 63.9% similar
A Recipe for Training Neural Networks
http://karpathy.github.io/2019/04/25/recipe/
A Recipe for Training Neural Networks Some few weeks ago I posted a tweet on β€œthe most common neural net mistakes”, listing a few common gotchas relat...
πŸ” View Similar Articles 🟠 HN
πŸ” 63.2% similar
Software 2.0
https://karpathy.medium.com/software-2-0-a64152b37c35?source=rss-ac9d9a35533e------2
Software 2.0 I sometimes see people refer to neural networks as just β€œanother tool in your machine learning toolbox”. They have some pros and cons, th...
πŸ” View Similar Articles
πŸ” 62.5% similar
TimesFM: Time Series Foundation Model for time-series forecasting (github.com/google-research)
https://news.ycombinator.com/item?id=40297946
I'm curious why we seem convinced that this is a task that is possible or something worthy of investigation. I've worked on language models since 2018...
πŸ” View Similar Articles
πŸ” 60.5% similar
Writing an LLM from scratch, part 22 -- finally training our LLM!
https://www.gilesthomas.com/2025/10/llm-from-scratch-22-finally-training-our-llm
Writing an LLM from scratch, part 22 -- finally training our LLM! This post wraps up my notes on chapter 5 of Sebastian Raschka's book "Build a Large ...
πŸ” View Similar Articles 🟠 HN
πŸ” 59.5% similar
https://openai.com/index/techniques-for-training-large-neural-networks/
https://openai.com/index/techniques-for-training-large-neural-networks/
Techniques for training large neural networks Large neural networks are at the core of many recent advances in AI, but training them is a difficult en...
πŸ” View Similar Articles