Similar Articles

Articles similar to the selected content.

Domain: news.ycombinator.com Added: 2025-10-11 Status: βœ“ Success
news.ycombinator.com
> the generation of 281,128 augmented examples, from which 1,000 were held out as a benchmark test set. This model is trained on a custom dataset of 280k examples then tested on 1k very similar exampl...
Similar Articles (10 found)
πŸ” 76.0% similar
Fine-tune your own Llama 2 to replace GPT-3.5/4
https://news.ycombinator.com/item?id=37484135
There has been a lot of interest on HN in fine-tuning open-source LLMs recently (eg. Anyscale's post at https://news.ycombinator.com/item?id=37090632)...
πŸ” View Similar Articles
πŸ” 75.6% similar
Writing an LLM from scratch, part 22 -- finally training our LLM!
https://www.gilesthomas.com/2025/10/llm-from-scratch-22-finally-training-our-llm
Writing an LLM from scratch, part 22 -- finally training our LLM! This post wraps up my notes on chapter 5 of Sebastian Raschka's book "Build a Large ...
πŸ” View Similar Articles 🟠 HN
πŸ” 74.5% similar
Are GPUs Worth It for ML? (exafunction.com)
https://news.ycombinator.com/item?id=32641769
For some reason they focus on the inference, which is the computationally cheap part. If you're working on ML (as opposed to deploying someone else's ...
πŸ” View Similar Articles
πŸ” 74.5% similar
Does the Bitter Lesson Have Limits?
https://www.dbreunig.com/2025/08/01/does-the-bitter-lesson-have-limits.html
Does the Bitter Lesson Have Limits? Recently, β€œthe bitter lesson” is having a moment. Coined in an essay by Rich Sutton, the bitter lesson is that, β€œg...
πŸ” View Similar Articles 🟠 HN
πŸ” 73.0% similar
What happens when coding agents stop feeling like dialup?
https://martinalderson.com/posts/what-happens-when-coding-agents-stop-feeling-like-dialup/
What happens when coding agents stop feeling like dialup? It's funny how quickly humans adjust to new technology. Only a few months ago Claude Code an...
πŸ” View Similar Articles 🟠 HN
πŸ” 72.7% similar
The Bitter Lesson is Misunderstood
https://obviouslywrong.substack.com/p/the-bitter-lesson-is-misunderstood
The Bitter Lesson is Misunderstood Together, the Bitter Lesson and Scaling Laws reveal that the god of Compute we worship is yoked to an even greater ...
πŸ” View Similar Articles 🟠 HN
πŸ” 72.6% similar
TimesFM: Time Series Foundation Model for time-series forecasting (github.com/google-research)
https://news.ycombinator.com/item?id=40297946
I'm curious why we seem convinced that this is a task that is possible or something worthy of investigation. I've worked on language models since 2018...
πŸ” View Similar Articles
πŸ” 72.1% similar
The Principles of Deep Learning Theory (arxiv.org)
https://news.ycombinator.com/item?id=31051540
First, thanks to the publisher and authors for making this freely available! I retired recently after using neural networks since the 1980s. I still s...
πŸ” View Similar Articles
πŸ” 71.3% similar
Error extracting title
https://simonwillison.net/2024/Dec/31/llms-in-2024/
Things we learned about LLMs in 2024 31st December 2024 A lot has happened in the world of Large Language Models over the course of 2024. Here’s a rev...
πŸ” View Similar Articles 🟠 HN
πŸ” 70.9% similar
GPT-5: Key characteristics, pricing and system card
https://simonwillison.net/2025/Aug/7/gpt-5/
GPT-5: Key characteristics, pricing and model card 7th August 2025 I’ve had preview access to the new GPT-5 model family for the past two weeks (see r...
πŸ” View Similar Articles 🟠 HN