OpenAI just released two open-weight modelsβgpt-oss-120b and gpt-oss-20bβafter months of anticipation (you can try them here).
That means anyone with a MacBook Pro can run a O3 (mini)-level model loca...
Similar Articles (10 found)
π 70.3% similar
Each month, this newsletter is read by over 45K+ operators, investors, and tech / product leaders and executives. If you found value in this newslette...
π 66.5% similar
Day zero model performance optimization work is a mix of experimentation, bug fixing, and benchmarking guided by intuition and experience. This writeu...
π 64.0% similar
Things we learned about LLMs in 2024
31st December 2024
A lot has happened in the world of Large Language Models over the course of 2024. Hereβs a rev...
π 62.5% similar
A few months after launching Qwen3-VL, Alibaba has released a detailed technical report on the open multimodal model. The data shows the system excels...
π 61.1% similar
For some reason they focus on the inference, which is the computationally cheap part. If you're working on ML (as opposed to deploying someone else's ...
π 60.5% similar
GPT-5: Key characteristics, pricing and model card
7th August 2025
Iβve had preview access to the new GPT-5 model family for the past two weeks (see r...
π 60.4% similar
There has been a lot of interest on HN in fine-tuning open-source LLMs recently (eg. Anyscale's post at
https://news.ycombinator.com/item?id=37090632)...
π 57.8% similar
Evaluating LLMs for my personal use case
Summary
Itβs great that AI can win maths Olympiads, but thatβs not what Iβm doing. I mostly ask basic Rust, P...
π 57.4% similar
> the generation of 281,128 augmented examples, from which 1,000 were
held out as a benchmark test set.
This model is trained on a custom dataset of 2...
π 56.8% similar
Same AI, Different Answer: How Tiny Prompts Can Change Everything
Why Does ChatGPT Sometimes Feel Different?
If youβve used AI chatbots like ChatGPT f...