Similar Articles

Defeating Nondeterminism in LLM Inference

https://thinkingmachines.ai/blog/defeating-nondeterminism-in-llm-inference/

Domain: thinkingmachines.ai Added: 2025-09-10 Status: ✓ Success

thinkingmachines.ai

Defeating Nondeterminism in LLM Inference Reproducibility is a bedrock of scientific progress. However, it’s remarkably difficult to get reproducible results out of large language models. For example,...

Similar Articles (10 found)

https://www.seangoedecke.com/inference-batching-and-deepseek/

www.seangoedecke.com 2025-07-13

deepseek,ai models,throughput,latency,batch size,www.seangoedecke.com

Why DeepSeek is cheap at scale but expensive to run locally Why is DeepSeek-V3 supposedly fast and cheap to serve at scale, but too slow and expensive...

🔍 View Similar Articles 🟠 HN

🔍 62.5% similar

LLM Engineer's Almanac - Workloads

https://modal.com/llm-almanac/workloads

modal.com 2026-02-03

modal.com

The three types of LLM workloads and how to serve them We hold this truth to be self-evident: not all workloads are created equal. But for large langu...

🔍 View Similar Articles 🟠 HN

openai.com 2025-07-13

openai.com

Techniques for training large neural networks Large neural networks are at the core of many recent advances in AI, but training them is a difficult en...

🔍 View Similar Articles

https://news.ycombinator.com/item?id=31051540

news.ycombinator.com 2025-07-13

hackernews,tech,news,news.ycombinator.com

First, thanks to the publisher and authors for making this freely available! I retired recently after using neural networks since the 1980s. I still s...

🔍 View Similar Articles

https://neutree.ai/blog/nano-vllm-part-1

neutree.ai 2026-02-03

neutree.ai

Understanding LLM Inference Engines: Inside Nano-vLLM (Part 1) Architecture, Scheduling, and the Path from Prompt to Token When deploying large langua...

🔍 View Similar Articles 🟠 HN

https://news.ycombinator.com/item?id=40297946

news.ycombinator.com 2025-07-13

news.ycombinator.com,hackernews,tech,news

I'm curious why we seem convinced that this is a task that is possible or something worthy of investigation. I've worked on language models since 2018...

🔍 View Similar Articles

https://www.gilesthomas.com/2025/10/llm-from-scratch-22-finally-training-our-llm

www.gilesthomas.com 2025-11-08

www.gilesthomas.com

Writing an LLM from scratch, part 22 -- finally training our LLM! This post wraps up my notes on chapter 5 of Sebastian Raschka's book "Build a Large ...

🔍 View Similar Articles 🟠 HN

https://news.ycombinator.com/item?id=45427634

news.ycombinator.com 2025-10-11

news.ycombinator.com

> the generation of 281,128 augmented examples, from which 1,000 were held out as a benchmark test set. This model is trained on a custom dataset of 2...

🔍 View Similar Articles

🔍 57.8% similar

Why AGI Will Not Happen — Tim Dettmers

https://timdettmers.com/2025/12/10/why-agi-will-not-happen/

timdettmers.com 2026-02-03

timdettmers.com

If you are reading this, you probably have strong opinions about AGI, superintelligence, and the future of AI. Maybe you believe we are on the cusp of...

🔍 View Similar Articles 🟠 HN

https://news.ycombinator.com/item?id=40845304

news.ycombinator.com 2025-07-12

news,tech,hackernews,news.ycombinator.com

This article doesn't talk much about testing or getting training data. It seems like that part is key. For code that you think you understand, it's be...

🔍 View Similar Articles