Similar Articles

Articles similar to the selected content.

Domain: www.baseten.co Added: 2025-08-06 Status: βœ“ Success
www.baseten.co,model performance optimization,bug fixing,nvidia gpus,experimentation,benchmarking
Day zero model performance optimization work is a mix of experimentation, bug fixing, and benchmarking guided by intuition and experience. This writeup outlines the process we followed to achieve SOTA...
Similar Articles (10 found)
πŸ” 67.0% similar
Fine-tune your own Llama 2 to replace GPT-3.5/4
https://news.ycombinator.com/item?id=37484135
There has been a lot of interest on HN in fine-tuning open-source LLMs recently (eg. Anyscale's post at https://news.ycombinator.com/item?id=37090632)...
πŸ” View Similar Articles
πŸ” 67.0% similar
GPT-5: Strategic Implications
https://nextword.substack.com/p/gpt-5-strategic-implications
Each month, this newsletter is read by over 45K+ operators, investors, and tech / product leaders and executives. If you found value in this newslette...
πŸ” View Similar Articles
πŸ” 66.5% similar
Ask HN: How can ChatGPT serve 700M users when I can't run one GPT-4 locally?
https://news.ycombinator.com/item?id=44840728
Sam said yesterday that chatgpt handles ~700M weekly users. Meanwhile, I can't even run a single GPT-4-class model locally without insane VRAM or pain...
πŸ” View Similar Articles
πŸ” 66.5% similar
OpenAI's Open Source Strategy
https://nextword.substack.com/p/openai-open-source-strategy-gpt-oss
OpenAI just released two open-weight modelsβ€”gpt-oss-120b and gpt-oss-20bβ€”after months of anticipation (you can try them here). That means anyone with ...
πŸ” View Similar Articles
πŸ” 62.2% similar
So you wanna build a local RAG?
https://blog.yakkomajuri.com/blog/local-rag
When we launched Skald, we wanted it to not only be self-hostable, but also for one to be able to run it without sending any data to third-parties. Wi...
πŸ” View Similar Articles 🟠 HN
πŸ” 62.0% similar
Why DeepSeek is cheap at scale but expensive to run locally
https://www.seangoedecke.com/inference-batching-and-deepseek/
Why DeepSeek is cheap at scale but expensive to run locally Why is DeepSeek-V3 supposedly fast and cheap to serve at scale, but too slow and expensive...
πŸ” View Similar Articles 🟠 HN
πŸ” 62.0% similar
Extract-0: A specialized language model for document information extraction
https://news.ycombinator.com/item?id=45427634
> the generation of 281,128 augmented examples, from which 1,000 were held out as a benchmark test set. This model is trained on a custom dataset of 2...
πŸ” View Similar Articles
πŸ” 61.9% similar
What happens when coding agents stop feeling like dialup?
https://martinalderson.com/posts/what-happens-when-coding-agents-stop-feeling-like-dialup/
What happens when coding agents stop feeling like dialup? It's funny how quickly humans adjust to new technology. Only a few months ago Claude Code an...
πŸ” View Similar Articles 🟠 HN
πŸ” 61.5% similar
https://openai.com/index/techniques-for-training-large-neural-networks/
https://openai.com/index/techniques-for-training-large-neural-networks/
Techniques for training large neural networks Large neural networks are at the core of many recent advances in AI, but training them is a difficult en...
πŸ” View Similar Articles
πŸ” 60.9% similar
Error extracting title
https://simonwillison.net/2024/Dec/31/llms-in-2024/
Things we learned about LLMs in 2024 31st December 2024 A lot has happened in the world of Large Language Models over the course of 2024. Here’s a rev...
πŸ” View Similar Articles 🟠 HN