Similar Articles

Fine-tune your own Llama 2 to replace GPT-3.5/4

https://news.ycombinator.com/item?id=37484135

Domain: news.ycombinator.com Added: 2025-07-13 Status: ✓ Success

open-source,gpt-4,tech,hackernews,llms,news,news.ycombinator.com,machine learning

There has been a lot of interest on HN in fine-tuning open-source LLMs recently (eg. Anyscale's post at https://news.ycombinator.com/item?id=37090632). I've been playing around with fine-tuning models...

Similar Articles (10 found)

https://news.ycombinator.com/item?id=45427634

news.ycombinator.com 2025-10-11

news.ycombinator.com

> the generation of 281,128 augmented examples, from which 1,000 were held out as a benchmark test set. This model is trained on a custom dataset of 2...

🔍 View Similar Articles

https://www.gilesthomas.com/2025/10/llm-from-scratch-22-finally-training-our-llm

www.gilesthomas.com 2025-11-08

www.gilesthomas.com

Writing an LLM from scratch, part 22 -- finally training our LLM! This post wraps up my notes on chapter 5 of Sebastian Raschka's book "Build a Large ...

🔍 View Similar Articles 🟠 HN

🔍 68.5% similar

2 Years of ML vs. 1 Month of Prompting

https://www.levs.fyi/blog/2-years-of-ml-vs-1-month-of-prompting/

www.levs.fyi 2025-11-15

www.levs.fyi

2 Years of ML vs. 1 Month of Prompting November 7, 2025 Recalls at major automakers cost hundreds of millions of dollars a year. It’s a huge issue. To...

🔍 View Similar Articles 🟠 HN

https://www.baseten.co/blog/sota-performance-for-gpt-oss-120b-on-nvidia-gpus/

www.baseten.co 2025-08-06

www.baseten.co,model performance optimization,bug fixing,nvidia gpus,experimentation,benchmarking

Day zero model performance optimization work is a mix of experimentation, bug fixing, and benchmarking guided by intuition and experience. This writeu...

🔍 View Similar Articles 🟠 HN

🔍 65.7% similar

Error extracting title

https://simonwillison.net/2024/Dec/31/llms-in-2024/

simonwillison.net 2025-07-12

simonwillison.net

Things we learned about LLMs in 2024 31st December 2024 A lot has happened in the world of Large Language Models over the course of 2024. Here’s a rev...

🔍 View Similar Articles 🟠 HN

https://ghost.oxen.ai/how-we-cut-inference-costs-from-46k-to-7-5k-fine-tuning-qwen-image-edit/

ghost.oxen.ai 2025-10-26

ghost.oxen.ai

How We Cut Inference Costs from $46K to $7.5K Fine-Tuning Qwen-Image-Edit Running quality inference at scale is something we think about a lot at Oxen...

🔍 View Similar Articles

https://pub.towardsai.net/use-your-own-customized-open-source-large-language-model-81d0999ef59b?source=rss----98111c9905da---4

pub.towardsai.net 2025-08-13

pub.towardsai.net

Use your own customized open-source Large Language Model You’ve built it. Now unleash it. You already fine-tuned a model (great!). Now it’s time to us...

🔍 View Similar Articles

https://simonwillison.net/2025/Aug/7/gpt-5/

simonwillison.net 2025-08-13

simonwillison.net

GPT-5: Key characteristics, pricing and model card 7th August 2025 I’ve had preview access to the new GPT-5 model family for the past two weeks (see r...

🔍 View Similar Articles 🟠 HN

https://simonwillison.net/2025/Nov/24/claude-opus/#atom-entries

simonwillison.net 2025-12-18

simonwillison.net

Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult 24th November 2025 Anthropic released Claude Opus 4.5 this morning, which they ...

🔍 View Similar Articles

🔍 62.3% similar

GPT-5.2

https://simonwillison.net/2025/Dec/11/gpt-52/#atom-entries

simonwillison.net 2025-12-18

simonwillison.net

GPT-5.2 11th December 2025 OpenAI reportedly declared a “code red” on the 1st of December in response to increasingly credible competition from the li...

🔍 View Similar Articles