Similar Articles

Articles similar to the selected content.

Domain: news.ycombinator.com Added: 2025-07-13 Status: ✓ Success
hackernews,tech,news,news.ycombinator.com
For some reason they focus on the inference, which is the computationally cheap part. If you're working on ML (as opposed to deploying someone else's ML) then almost all of your workload is training, ...
Similar Articles (10 found)
🔍 74.5% similar
Extract-0: A specialized language model for document information extraction
https://news.ycombinator.com/item?id=45427634
> the generation of 281,128 augmented examples, from which 1,000 were held out as a benchmark test set. This model is trained on a custom dataset of 2...
🔍 View Similar Articles
🔍 72.5% similar
The Principles of Deep Learning Theory (arxiv.org)
https://news.ycombinator.com/item?id=31051540
First, thanks to the publisher and authors for making this freely available! I retired recently after using neural networks since the 1980s. I still s...
🔍 View Similar Articles
🔍 72.4% similar
TimesFM: Time Series Foundation Model for time-series forecasting (github.com/google-research)
https://news.ycombinator.com/item?id=40297946
I'm curious why we seem convinced that this is a task that is possible or something worthy of investigation. I've worked on language models since 2018...
🔍 View Similar Articles
🔍 70.7% similar
Are OpenAI and Anthropic Really Losing Money on Inference?
https://martinalderson.com/posts/are-openai-and-anthropic-really-losing-money-on-inference/
Are OpenAI and Anthropic Really Losing Money on Inference? I keep hearing what a cash incinerator AI is, especially around inference. While it seems r...
🔍 View Similar Articles 🟠 HN
🔍 69.8% similar
Scaling PostgresML to 1M Requests per Second (postgresml.org)
https://news.ycombinator.com/item?id=33518443
What is a good algorithm-to-purpose map for ML beginners? Looking for something like "Algo X is good for making predictions when your data looks like ...
🔍 View Similar Articles
🔍 69.1% similar
What happens when coding agents stop feeling like dialup?
https://martinalderson.com/posts/what-happens-when-coding-agents-stop-feeling-like-dialup/
What happens when coding agents stop feeling like dialup? It's funny how quickly humans adjust to new technology. Only a few months ago Claude Code an...
🔍 View Similar Articles 🟠 HN
🔍 68.8% similar
https://openai.com/index/techniques-for-training-large-neural-networks/
https://openai.com/index/techniques-for-training-large-neural-networks/
Techniques for training large neural networks Large neural networks are at the core of many recent advances in AI, but training them is a difficult en...
🔍 View Similar Articles
🔍 68.8% similar
Why DeepSeek is cheap at scale but expensive to run locally
https://www.seangoedecke.com/inference-batching-and-deepseek/
Why DeepSeek is cheap at scale but expensive to run locally Why is DeepSeek-V3 supposedly fast and cheap to serve at scale, but too slow and expensive...
🔍 View Similar Articles 🟠 HN
🔍 67.9% similar
The Bitter Lesson is Misunderstood
https://obviouslywrong.substack.com/p/the-bitter-lesson-is-misunderstood
The Bitter Lesson is Misunderstood Together, the Bitter Lesson and Scaling Laws reveal that the god of Compute we worship is yoked to an even greater ...
🔍 View Similar Articles 🟠 HN
🔍 66.3% similar
Ask HN: How can ChatGPT serve 700M users when I can't run one GPT-4 locally?
https://news.ycombinator.com/item?id=44840728
Sam said yesterday that chatgpt handles ~700M weekly users. Meanwhile, I can't even run a single GPT-4-class model locally without insane VRAM or pain...
🔍 View Similar Articles