Similar Articles

word2vec-style vector arithmetic on docs embeddings§

https://technicalwriting.dev/embeddings/arithmetic/index.html

Domain: technicalwriting.dev Added: 2025-11-03 Status: ✓ Success

technicalwriting.dev

word2vec-style vector arithmetic on docs embeddings§ 2025 October 29 word2vec popularized the idea of representing words as vectors where semantically similar words are positioned close to each other ...

Similar Articles (10 found)

🔍 61.6% similar

The Illustrated Word2vec

https://jalammar.github.io/illustrated-word2vec/

jalammar.github.io 2025-12-14

jalammar.github.io

The Illustrated Word2vec Discussions: Hacker News (347 points, 37 comments), Reddit r/MachineLearning (151 points, 19 comments) Translations: Chinese ...

🔍 View Similar Articles 🟠 HN

https://towardsdatascience.com/finetune-your-topic-modeling-workflow-with-bertopic/

towardsdatascience.com 2025-08-13

towardsdatascience.com

Topic modeling remains a critical tool in the AI and NLP toolbox. While large language models (LLMs) handle text exceptionally well, extracting high-l...

🔍 View Similar Articles

🔍 58.8% similar

Introducing Google’s LangExtract tool

https://towardsdatascience.com/introducing-googles-langextract-tool-2/

towardsdatascience.com 2025-08-13

towardsdatascience.com

One announcement that caught my eye in particular occurred at the end of July, when Google released a new text processing and data extraction tool cal...

🔍 View Similar Articles

https://p.migdal.pl/blog/2025/01/dont-use-cosine-similarity/

p.migdal.pl 2026-02-14

p.migdal.pl

Don't use cosine similarity carelessly 14 Jan 2025 | by Piotr Migdał Midas turned everything he touched into gold. Data scientists turn everything int...

🔍 View Similar Articles 🟠 HN

www.technologyreview.com 2025-10-31

www.technologyreview.com

King – Man + Woman = Queen: The Marvelous Mathematics of Computational Linguistics Computational linguistics has dramatically changed the way research...

🔍 View Similar Articles

https://www.gilesthomas.com/2025/10/llm-from-scratch-22-finally-training-our-llm

www.gilesthomas.com 2025-11-08

www.gilesthomas.com

Writing an LLM from scratch, part 22 -- finally training our LLM! This post wraps up my notes on chapter 5 of Sebastian Raschka's book "Build a Large ...

🔍 View Similar Articles 🟠 HN

https://www.gilesthomas.com/2025/12/llm-from-scratch-28-training-a-base-model-from-scratch

www.gilesthomas.com 2025-12-14

www.gilesthomas.com

Writing an LLM from scratch, part 28 -- training a base model from scratch on an RTX 3090 Having worked through the main body of Sebastian Raschka's b...

🔍 View Similar Articles 🟠 HN

🔍 55.5% similar

So you wanna build a local RAG?

https://blog.yakkomajuri.com/blog/local-rag

blog.yakkomajuri.com 2025-11-28

blog.yakkomajuri.com

When we launched Skald, we wanted it to not only be self-hostable, but also for one to be able to run it without sending any data to third-parties. Wi...

🔍 View Similar Articles 🟠 HN

https://news.ycombinator.com/item?id=45427634

news.ycombinator.com 2025-10-11

news.ycombinator.com

> the generation of 281,128 augmented examples, from which 1,000 were held out as a benchmark test set. This model is trained on a custom dataset of 2...

🔍 View Similar Articles

https://news.ycombinator.com/item?id=38971221

news.ycombinator.com 2025-07-13

news.ycombinator.com,hackernews,tech,news

Thanks for writing this one Simon, I read it some time ago and I just wanted to say thanks and recommend it to folks browsing the comments, it's reall...

🔍 View Similar Articles