Similar Articles

Articles similar to the selected content.

Domain: blog.wilsonl.in Added: 2025-08-13 Status: βœ“ Success
blog.wilsonl.in
Building a web search engine from scratch in two months with 3 billion neural embeddings A while back, I decided to undertake a project to challenge myself: build a web search engine from scratch. Asi...
Similar Articles (10 found)
πŸ” 62.7% similar
Designing Multimodal AI Search Engines for Smarter Online Retail
https://pub.towardsai.net/designing-multimodal-ai-search-engines-for-smarter-online-retail-43bafa996238?source=rss----98111c9905da---4
Designing Multimodal AI Search Engines for Smarter Online Retail How AI redefines product discovery, search, and growth in E-Commerce β€œWow, that shirt...
πŸ” View Similar Articles
πŸ” 62.6% similar
Vector Databases: A Technical Primer [pdf] (digitaloceanspaces.com)
https://news.ycombinator.com/item?id=38971221
Thanks for writing this one Simon, I read it some time ago and I just wanted to say thanks and recommend it to folks browsing the comments, it's reall...
πŸ” View Similar Articles
πŸ” 60.1% similar
Introducing Google’s LangExtract tool
https://towardsdatascience.com/introducing-googles-langextract-tool-2/
One announcement that caught my eye in particular occurred at the end of July, when Google released a new text processing and data extraction tool cal...
πŸ” View Similar Articles
πŸ” 59.7% similar
Extract-0: A specialized language model for document information extraction
https://news.ycombinator.com/item?id=45427634
> the generation of 281,128 augmented examples, from which 1,000 were held out as a benchmark test set. This model is trained on a custom dataset of 2...
πŸ” View Similar Articles
πŸ” 59.6% similar
Indexing iCloud Photos with AI Using LLaVA and Pgvector (medium.com/mustafaakin)
https://news.ycombinator.com/item?id=39067615
I think image-encoder from CLIP (even smallest variant ViT B/32) is good enough to capture a lot of semantic information to allow natural language que...
πŸ” View Similar Articles
πŸ” 59.5% similar
The Case Against pgvector | Alex Jacobs
https://alex-jacobs.com/posts/the-case-against-pgvector/
Everyone Loves pgvector (in theory) If you’ve spent any time in the vector search space over the past year, you’ve probably read blog posts explaining...
πŸ” View Similar Articles 🟠 HN
πŸ” 59.1% similar
So you wanna build a local RAG?
https://blog.yakkomajuri.com/blog/local-rag
When we launched Skald, we wanted it to not only be self-hostable, but also for one to be able to run it without sending any data to third-parties. Wi...
πŸ” View Similar Articles 🟠 HN
πŸ” 58.9% similar
Building a Simple Search Engine That Actually Works
https://karboosx.net/post/4eZxhBon/building-a-simple-search-engine-that-actually-works
Why Build Your Own? Look, I know what you're thinking. "Why not just use Elasticsearch?" or "What about Algolia?" Those are valid options, but they co...
πŸ” View Similar Articles 🟠 HN
πŸ” 58.4% similar
Running SmolVLM Locally in Your Browser with Transformers.js - PyImageSearch
https://pyimagesearch.com/2025/10/20/running-smolvlm-locally-in-your-browser-with-transformers-js/
Table of Contents - Running SmolVLM Locally in Your Browser with Transformers.js - Introduction - SmolVLM: A Small But Capable Vision-Language Model -...
πŸ” View Similar Articles
πŸ” 58.2% similar
How big are our embeddings now and why?
https://vickiboykis.com/2025/09/01/how-big-are-our-embeddings-now-and-why/
How big are our embeddings now and why? #embeddings #openai #anthropic #huggingface #dimensionality A few years ago, I wrote a paper on embeddings. At...
πŸ” View Similar Articles 🟠 HN