Similar Articles

Articles similar to the selected content.

Domain: vickiboykis.com Added: 2025-09-10 Status: βœ“ Success
vickiboykis.com
How big are our embeddings now and why? #embeddings #openai #anthropic #huggingface #dimensionality A few years ago, I wrote a paper on embeddings. At the time, I wrote that 200-300 dimension embeddin...
Similar Articles (10 found)
πŸ” 62.0% similar
Error extracting title
https://simonwillison.net/2024/Dec/31/llms-in-2024/
Things we learned about LLMs in 2024 31st December 2024 A lot has happened in the world of Large Language Models over the course of 2024. Here’s a rev...
πŸ” View Similar Articles 🟠 HN
πŸ” 60.5% similar
Why DeepSeek is cheap at scale but expensive to run locally
https://www.seangoedecke.com/inference-batching-and-deepseek/
Why DeepSeek is cheap at scale but expensive to run locally Why is DeepSeek-V3 supposedly fast and cheap to serve at scale, but too slow and expensive...
πŸ” View Similar Articles 🟠 HN
πŸ” 58.2% similar
Show HN: Building a web search engine from scratch with 3B neural embeddings
https://blog.wilsonl.in/search-engine/
Building a web search engine from scratch in two months with 3 billion neural embeddings A while back, I decided to undertake a project to challenge m...
πŸ” View Similar Articles 🟠 HN
πŸ” 58.2% similar
Fine-Tune Your Topic Modeling Workflow with BERTopic
https://towardsdatascience.com/finetune-your-topic-modeling-workflow-with-bertopic/
Topic modeling remains a critical tool in the AI and NLP toolbox. While large language models (LLMs) handle text exceptionally well, extracting high-l...
πŸ” View Similar Articles
πŸ” 58.1% similar
Extract-0: A specialized language model for document information extraction
https://news.ycombinator.com/item?id=45427634
> the generation of 281,128 augmented examples, from which 1,000 were held out as a benchmark test set. This model is trained on a custom dataset of 2...
πŸ” View Similar Articles
πŸ” 57.4% similar
https://openai.com/index/techniques-for-training-large-neural-networks/
https://openai.com/index/techniques-for-training-large-neural-networks/
Techniques for training large neural networks Large neural networks are at the core of many recent advances in AI, but training them is a difficult en...
πŸ” View Similar Articles
πŸ” 57.4% similar
SmolVLM to SmolVLM2: Compact Models for Multi-Image VQA
https://pyimagesearch.com/2025/06/23/smolvlm-to-smolvlm2-compact-models-for-multi-image-vqa/
Table of Contents - SmolVLM to SmolVLM2: Compact Models for Multi-Image VQA - SmolVLM 1: A Compact Yet Capable Vision-Language Model - What Is SmolVLM...
πŸ” View Similar Articles
πŸ” 57.3% similar
The Bitter Lesson is Misunderstood
https://obviouslywrong.substack.com/p/the-bitter-lesson-is-misunderstood
The Bitter Lesson is Misunderstood Together, the Bitter Lesson and Scaling Laws reveal that the god of Compute we worship is yoked to an even greater ...
πŸ” View Similar Articles 🟠 HN
πŸ” 57.1% similar
Writing an LLM from scratch, part 22 -- finally training our LLM!
https://www.gilesthomas.com/2025/10/llm-from-scratch-22-finally-training-our-llm
Writing an LLM from scratch, part 22 -- finally training our LLM! This post wraps up my notes on chapter 5 of Sebastian Raschka's book "Build a Large ...
πŸ” View Similar Articles 🟠 HN
πŸ” 56.5% similar
Vector Databases: A Technical Primer [pdf] (digitaloceanspaces.com)
https://news.ycombinator.com/item?id=38971221
Thanks for writing this one Simon, I read it some time ago and I just wanted to say thanks and recommend it to folks browsing the comments, it's reall...
πŸ” View Similar Articles