Similar Articles

Articles similar to the selected content.

Domain: www.seangoedecke.com Added: 2025-07-13 Status: βœ“ Success
deepseek,ai models,throughput,latency,batch size,www.seangoedecke.com
Why DeepSeek is cheap at scale but expensive to run locally Why is DeepSeek-V3 supposedly fast and cheap to serve at scale, but too slow and expensive to run locally? Why are some AI models slow to re...
Similar Articles (10 found)
πŸ” 71.4% similar
https://openai.com/index/techniques-for-training-large-neural-networks/
https://openai.com/index/techniques-for-training-large-neural-networks/
Techniques for training large neural networks Large neural networks are at the core of many recent advances in AI, but training them is a difficult en...
πŸ” View Similar Articles
πŸ” 68.8% similar
Are GPUs Worth It for ML? (exafunction.com)
https://news.ycombinator.com/item?id=32641769
For some reason they focus on the inference, which is the computationally cheap part. If you're working on ML (as opposed to deploying someone else's ...
πŸ” View Similar Articles
πŸ” 66.3% similar
Extract-0: A specialized language model for document information extraction
https://news.ycombinator.com/item?id=45427634
> the generation of 281,128 augmented examples, from which 1,000 were held out as a benchmark test set. This model is trained on a custom dataset of 2...
πŸ” View Similar Articles
πŸ” 66.2% similar
Writing an LLM from scratch, part 22 -- finally training our LLM!
https://www.gilesthomas.com/2025/10/llm-from-scratch-22-finally-training-our-llm
Writing an LLM from scratch, part 22 -- finally training our LLM! This post wraps up my notes on chapter 5 of Sebastian Raschka's book "Build a Large ...
πŸ” View Similar Articles 🟠 HN
πŸ” 65.9% similar
The Bitter Lesson is Misunderstood
https://obviouslywrong.substack.com/p/the-bitter-lesson-is-misunderstood
The Bitter Lesson is Misunderstood Together, the Bitter Lesson and Scaling Laws reveal that the god of Compute we worship is yoked to an even greater ...
πŸ” View Similar Articles 🟠 HN
πŸ” 64.8% similar
Ask HN: How can ChatGPT serve 700M users when I can't run one GPT-4 locally?
https://news.ycombinator.com/item?id=44840728
Sam said yesterday that chatgpt handles ~700M weekly users. Meanwhile, I can't even run a single GPT-4-class model locally without insane VRAM or pain...
πŸ” View Similar Articles
πŸ” 64.7% similar
Are OpenAI and Anthropic Really Losing Money on Inference?
https://martinalderson.com/posts/are-openai-and-anthropic-really-losing-money-on-inference/
Are OpenAI and Anthropic Really Losing Money on Inference? I keep hearing what a cash incinerator AI is, especially around inference. While it seems r...
πŸ” View Similar Articles 🟠 HN
πŸ” 63.8% similar
Defeating Nondeterminism in LLM Inference
https://thinkingmachines.ai/blog/defeating-nondeterminism-in-llm-inference/
Defeating Nondeterminism in LLM Inference Reproducibility is a bedrock of scientific progress. However, it’s remarkably difficult to get reproducible ...
πŸ” View Similar Articles 🟠 HN
πŸ” 63.7% similar
Stretch iPhone to its Limit, a 2GiB Model that can Draw Everything in Your Pocket
https://liuliu.me/eyes/stretch-iphone-to-its-limit-a-2gib-model-that-can-draw-everything-in-your-pocket/
Every year, we have a new iPhone that claims to be faster and better in every way. And yes, these new computer vision models and new image sensors can...
πŸ” View Similar Articles 🟠 HN
πŸ” 62.3% similar
Scaling PostgresML to 1M Requests per Second (postgresml.org)
https://news.ycombinator.com/item?id=33518443
What is a good algorithm-to-purpose map for ML beginners? Looking for something like "Algo X is good for making predictions when your data looks like ...
πŸ” View Similar Articles