Similar Articles

Articles similar to the selected content.

Domain: www.gilesthomas.com Added: 2025-12-14 Status: βœ“ Success
www.gilesthomas.com
Writing an LLM from scratch, part 28 -- training a base model from scratch on an RTX 3090 Having worked through the main body of Sebastian Raschka's book "Build a Large Language Model (from Scratch)",...
Similar Articles (10 found)
πŸ” 73.3% similar
Writing an LLM from scratch, part 22 -- finally training our LLM!
https://www.gilesthomas.com/2025/10/llm-from-scratch-22-finally-training-our-llm
Writing an LLM from scratch, part 22 -- finally training our LLM! This post wraps up my notes on chapter 5 of Sebastian Raschka's book "Build a Large ...
πŸ” View Similar Articles 🟠 HN
πŸ” 64.1% similar
Introducing Google’s LangExtract tool
https://towardsdatascience.com/introducing-googles-langextract-tool-2/
One announcement that caught my eye in particular occurred at the end of July, when Google released a new text processing and data extraction tool cal...
πŸ” View Similar Articles
πŸ” 63.5% similar
Extract-0: A specialized language model for document information extraction
https://news.ycombinator.com/item?id=45427634
> the generation of 281,128 augmented examples, from which 1,000 were held out as a benchmark test set. This model is trained on a custom dataset of 2...
πŸ” View Similar Articles
πŸ” 61.7% similar
How We Cut Inference Costs from $46K to $7.5K Fine-Tuning Qwen-Image-Edit
https://ghost.oxen.ai/how-we-cut-inference-costs-from-46k-to-7-5k-fine-tuning-qwen-image-edit/
How We Cut Inference Costs from $46K to $7.5K Fine-Tuning Qwen-Image-Edit Running quality inference at scale is something we think about a lot at Oxen...
πŸ” View Similar Articles
πŸ” 61.3% similar
Show HN: Building a web search engine from scratch with 3B neural embeddings
https://blog.wilsonl.in/search-engine/
Building a web search engine from scratch in two months with 3 billion neural embeddings A while back, I decided to undertake a project to challenge m...
πŸ” View Similar Articles 🟠 HN
πŸ” 61.1% similar
Fine-tune your own Llama 2 to replace GPT-3.5/4
https://news.ycombinator.com/item?id=37484135
There has been a lot of interest on HN in fine-tuning open-source LLMs recently (eg. Anyscale's post at https://news.ycombinator.com/item?id=37090632)...
πŸ” View Similar Articles
πŸ” 59.6% similar
Error extracting title
https://simonwillison.net/2024/Dec/31/llms-in-2024/
Things we learned about LLMs in 2024 31st December 2024 A lot has happened in the world of Large Language Models over the course of 2024. Here’s a rev...
πŸ” View Similar Articles 🟠 HN
πŸ” 59.1% similar
GPT-5: Key characteristics, pricing and system card
https://simonwillison.net/2025/Aug/7/gpt-5/
GPT-5: Key characteristics, pricing and model card 7th August 2025 I’ve had preview access to the new GPT-5 model family for the past two weeks (see r...
πŸ” View Similar Articles 🟠 HN
πŸ” 59.1% similar
Deep Neural Nets: 33 years ago and 33 years from now
http://karpathy.github.io/2022/03/14/lecun1989/
Deep Neural Nets: 33 years ago and 33 years from now The Yann LeCun et al. (1989) paper Backpropagation Applied to Handwritten Zip Code Recognition is...
πŸ” View Similar Articles 🟠 HN
πŸ” 58.3% similar
Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult
https://simonwillison.net/2025/Nov/24/claude-opus/#atom-entries
Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult 24th November 2025 Anthropic released Claude Opus 4.5 this morning, which they ...
πŸ” View Similar Articles