Similar Articles

https://simonwillison.net/2025/Nov/22/olmo-3/#atom-entries

Domain: simonwillison.net Added: 2025-12-18 Status: ✓ Success

simonwillison.net

Olmo 3 is a fully open LLM 22nd November 2025 Olmo is the LLM series from Ai2—the Allen institute for AI. Unlike most open weight models these are notable for including the full training data, trainin...

Similar Articles (10 found)

https://simonwillison.net/2025/Nov/24/claude-opus/#atom-entries

simonwillison.net 2025-12-18

simonwillison.net

Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult 24th November 2025 Anthropic released Claude Opus 4.5 this morning, which they ...

🔍 View Similar Articles

🔍 74.7% similar

Error extracting title

https://simonwillison.net/2024/Dec/31/llms-in-2024/

simonwillison.net 2025-07-12

simonwillison.net

Things we learned about LLMs in 2024 31st December 2024 A lot has happened in the world of Large Language Models over the course of 2024. Here’s a rev...

🔍 View Similar Articles 🟠 HN

https://the-decoder.com/qwen3-vl-can-scan-two-hour-videos-and-pinpoint-nearly-every-detail/

the-decoder.com 2025-12-02

the-decoder.com

A few months after launching Qwen3-VL, Alibaba has released a detailed technical report on the open multimodal model. The data shows the system excels...

🔍 View Similar Articles 🟠 HN

🔍 70.0% similar

GPT-5.2

https://simonwillison.net/2025/Dec/11/gpt-52/#atom-entries

simonwillison.net 2025-12-18

simonwillison.net

GPT-5.2 11th December 2025 OpenAI reportedly declared a “code red” on the 1st of December in response to increasingly credible competition from the li...

🔍 View Similar Articles

https://news.ycombinator.com/item?id=45427634

news.ycombinator.com 2025-10-11

news.ycombinator.com

> the generation of 281,128 augmented examples, from which 1,000 were held out as a benchmark test set. This model is trained on a custom dataset of 2...

🔍 View Similar Articles

https://simonwillison.net/2025/Aug/7/gpt-5/

simonwillison.net 2025-08-13

simonwillison.net

GPT-5: Key characteristics, pricing and model card 7th August 2025 I’ve had preview access to the new GPT-5 model family for the past two weeks (see r...

🔍 View Similar Articles 🟠 HN

🔍 65.4% similar

Error extracting title

https://abishekmuthian.com/how-i-run-llms-locally/

abishekmuthian.com 2025-07-12

abishekmuthian.com

A HN user asked me0 how I run LLMs locally with some specific questions, I’m documenting it here for everyone. Before I begin I would like to credit t...

🔍 View Similar Articles 🟠 HN

https://darkcoding.net/software/personal-ai-evals-aug-2025/

darkcoding.net 2025-09-01

darkcoding.net

Evaluating LLMs for my personal use case Summary It’s great that AI can win maths Olympiads, but that’s not what I’m doing. I mostly ask basic Rust, P...

🔍 View Similar Articles 🟠 HN

🔍 64.5% similar

OpenAI's Open Source Strategy

https://nextword.substack.com/p/openai-open-source-strategy-gpt-oss

nextword.substack.com 2025-08-28

nextword.substack.com

OpenAI just released two open-weight models—gpt-oss-120b and gpt-oss-20b—after months of anticipation (you can try them here). That means anyone with ...

🔍 View Similar Articles

https://www.gilesthomas.com/2025/10/llm-from-scratch-22-finally-training-our-llm

www.gilesthomas.com 2025-11-08

www.gilesthomas.com

Writing an LLM from scratch, part 22 -- finally training our LLM! This post wraps up my notes on chapter 5 of Sebastian Raschka's book "Build a Large ...

🔍 View Similar Articles 🟠 HN