Similar Articles

Extract-0: A specialized language model for document information extraction

https://news.ycombinator.com/item?id=45427634

Domain: news.ycombinator.com Added: 2025-10-11 Status: ✓ Success

news.ycombinator.com

> the generation of 281,128 augmented examples, from which 1,000 were held out as a benchmark test set. This model is trained on a custom dataset of 280k examples then tested on 1k very similar exampl...

Similar Articles (10 found)

https://news.ycombinator.com/item?id=37484135

news.ycombinator.com 2025-07-13

open-source,gpt-4,tech,hackernews,llms,news,news.ycombinator.com,machine learning

There has been a lot of interest on HN in fine-tuning open-source LLMs recently (eg. Anyscale's post at https://news.ycombinator.com/item?id=37090632)...

🔍 View Similar Articles

https://www.gilesthomas.com/2025/10/llm-from-scratch-22-finally-training-our-llm

www.gilesthomas.com 2025-11-08

www.gilesthomas.com

Writing an LLM from scratch, part 22 -- finally training our LLM! This post wraps up my notes on chapter 5 of Sebastian Raschka's book "Build a Large ...

🔍 View Similar Articles 🟠 HN

https://news.ycombinator.com/item?id=32641769

news.ycombinator.com 2025-07-13

hackernews,tech,news,news.ycombinator.com

For some reason they focus on the inference, which is the computationally cheap part. If you're working on ML (as opposed to deploying someone else's ...

🔍 View Similar Articles

🔍 74.5% similar

Does the Bitter Lesson Have Limits?

https://www.dbreunig.com/2025/08/01/does-the-bitter-lesson-have-limits.html

www.dbreunig.com 2025-08-06

ai research,breakthrough progress,bitter lesson,www.dbreunig.com,scaling computation,computation

Does the Bitter Lesson Have Limits? Recently, “the bitter lesson” is having a moment. Coined in an essay by Rich Sutton, the bitter lesson is that, “g...

🔍 View Similar Articles 🟠 HN

https://martinalderson.com/posts/what-happens-when-coding-agents-stop-feeling-like-dialup/

martinalderson.com 2025-10-11

martinalderson.com

What happens when coding agents stop feeling like dialup? It's funny how quickly humans adjust to new technology. Only a few months ago Claude Code an...

🔍 View Similar Articles 🟠 HN

https://simonwillison.net/2025/Nov/24/claude-opus/#atom-entries

simonwillison.net 2025-12-18

simonwillison.net

Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult 24th November 2025 Anthropic released Claude Opus 4.5 this morning, which they ...

🔍 View Similar Articles

🔍 72.7% similar

The Bitter Lesson is Misunderstood

https://obviouslywrong.substack.com/p/the-bitter-lesson-is-misunderstood

obviouslywrong.substack.com 2025-09-04

obviouslywrong.substack.com

The Bitter Lesson is Misunderstood Together, the Bitter Lesson and Scaling Laws reveal that the god of Compute we worship is yoked to an even greater ...

🔍 View Similar Articles 🟠 HN

https://news.ycombinator.com/item?id=40297946

news.ycombinator.com 2025-07-13

news.ycombinator.com,hackernews,tech,news

I'm curious why we seem convinced that this is a task that is possible or something worthy of investigation. I've worked on language models since 2018...

🔍 View Similar Articles

https://news.ycombinator.com/item?id=31051540

news.ycombinator.com 2025-07-13

hackernews,tech,news,news.ycombinator.com

First, thanks to the publisher and authors for making this freely available! I retired recently after using neural networks since the 1980s. I still s...

🔍 View Similar Articles

🔍 71.3% similar

Error extracting title

https://simonwillison.net/2024/Dec/31/llms-in-2024/

simonwillison.net 2025-07-12

simonwillison.net

Things we learned about LLMs in 2024 31st December 2024 A lot has happened in the world of Large Language Models over the course of 2024. Here’s a rev...

🔍 View Similar Articles 🟠 HN