Similar Articles

https://vickiboykis.com/2025/09/01/how-big-are-our-embeddings-now-and-why/

Domain: vickiboykis.com Added: 2025-09-10 Status: ✓ Success

vickiboykis.com

How big are our embeddings now and why? #embeddings #openai #anthropic #huggingface #dimensionality A few years ago, I wrote a paper on embeddings. At the time, I wrote that 200-300 dimension embeddin...

Similar Articles (10 found)

🔍 63.9% similar

The Illustrated Word2vec

https://jalammar.github.io/illustrated-word2vec/

jalammar.github.io 2025-12-14

jalammar.github.io

The Illustrated Word2vec Discussions: Hacker News (347 points, 37 comments), Reddit r/MachineLearning (151 points, 19 comments) Translations: Chinese ...

🔍 View Similar Articles 🟠 HN

🔍 62.0% similar

Error extracting title

https://simonwillison.net/2024/Dec/31/llms-in-2024/

simonwillison.net 2025-07-12

simonwillison.net

Things we learned about LLMs in 2024 31st December 2024 A lot has happened in the world of Large Language Models over the course of 2024. Here’s a rev...

🔍 View Similar Articles 🟠 HN

https://www.seangoedecke.com/inference-batching-and-deepseek/

www.seangoedecke.com 2025-07-13

deepseek,ai models,throughput,latency,batch size,www.seangoedecke.com

Why DeepSeek is cheap at scale but expensive to run locally Why is DeepSeek-V3 supposedly fast and cheap to serve at scale, but too slow and expensive...

🔍 View Similar Articles 🟠 HN

https://blog.wilsonl.in/search-engine/

blog.wilsonl.in 2025-08-13

blog.wilsonl.in

Building a web search engine from scratch in two months with 3 billion neural embeddings A while back, I decided to undertake a project to challenge m...

🔍 View Similar Articles 🟠 HN

https://towardsdatascience.com/finetune-your-topic-modeling-workflow-with-bertopic/

towardsdatascience.com 2025-08-13

towardsdatascience.com

Topic modeling remains a critical tool in the AI and NLP toolbox. While large language models (LLMs) handle text exceptionally well, extracting high-l...

🔍 View Similar Articles

https://news.ycombinator.com/item?id=45427634

news.ycombinator.com 2025-10-11

news.ycombinator.com

> the generation of 281,128 augmented examples, from which 1,000 were held out as a benchmark test set. This model is trained on a custom dataset of 2...

🔍 View Similar Articles

openai.com 2025-07-13

openai.com

Techniques for training large neural networks Large neural networks are at the core of many recent advances in AI, but training them is a difficult en...

🔍 View Similar Articles

https://pyimagesearch.com/2025/06/23/smolvlm-to-smolvlm2-compact-models-for-multi-image-vqa/

pyimagesearch.com 2025-08-13

pyimagesearch.com computer-vision opencv +1

Table of Contents - SmolVLM to SmolVLM2: Compact Models for Multi-Image VQA - SmolVLM 1: A Compact Yet Capable Vision-Language Model - What Is SmolVLM...

🔍 View Similar Articles

https://www.gilesthomas.com/2025/12/llm-from-scratch-28-training-a-base-model-from-scratch

www.gilesthomas.com 2025-12-14

www.gilesthomas.com

Writing an LLM from scratch, part 28 -- training a base model from scratch on an RTX 3090 Having worked through the main body of Sebastian Raschka's b...

🔍 View Similar Articles 🟠 HN

🔍 57.3% similar

The Bitter Lesson is Misunderstood

https://obviouslywrong.substack.com/p/the-bitter-lesson-is-misunderstood

obviouslywrong.substack.com 2025-09-04

obviouslywrong.substack.com

The Bitter Lesson is Misunderstood Together, the Bitter Lesson and Scaling Laws reveal that the god of Compute we worship is yoked to an even greater ...

🔍 View Similar Articles 🟠 HN