Don't use cosine similarity carelessly
14 Jan 2025 | by Piotr MigdaΕ
Midas turned everything he touched into gold. Data scientists turn everything into vectors.
We do it for a reason β as gold is the ...
Similar Articles (10 found)
π 71.2% similar
Note: All figures and formulas in the following sections have been created by the author of this article.
Mathematical Intuition
The cosine similarity...
π 65.1% similar
Pinecone is a vector database that makes it easy to add similarity search to any application. Try it free, and continue reading to learn what makes si...
π 64.6% similar
How to Use FAISS to Build Your First Similarity Search
At Loopio, we use Facebook AI Similarity Search (FAISS) to efficiently search for similar text....
π 62.8% similar
Thanks for writing this one Simon, I read it some time ago and I just wanted to say thanks and recommend it to folks browsing the comments, it's reall...
π 62.0% similar
The Illustrated Word2vec
Discussions:
Hacker News (347 points, 37 comments), Reddit r/MachineLearning (151 points, 19 comments)
Translations: Chinese ...
π 58.0% similar
word2vec-style vector arithmetic on docs embeddingsΒ§
2025 October 29
word2vec popularized the idea of representing words as vectors where semantically...
π 57.8% similar
I think image-encoder from CLIP (even smallest variant ViT B/32) is good enough to capture a lot of semantic information to allow natural language que...
π 57.0% similar
Hamming Distance for Hybrid Search in SQLite
This article shows how I implemented semantic search in SQLite using binary embeddings and Hamming distan...
π 56.4% similar
King β Man + Woman = Queen: The Marvelous Mathematics of Computational Linguistics
Computational linguistics has dramatically changed the way research...
π 56.2% similar
The Modern Data Toolbox: Combining LLMs, ML, and Statistics for Greater Impact
Co-written with
Matching the Tool to the Task
A Quick Recap
In our prev...