Similar Articles

Fine-Tune Your Topic Modeling Workflow with BERTopic

https://towardsdatascience.com/finetune-your-topic-modeling-workflow-with-bertopic/

Domain: towardsdatascience.com Added: 2025-08-13 Status: ✓ Success

towardsdatascience.com

Topic modeling remains a critical tool in the AI and NLP toolbox. While large language models (LLMs) handle text exceptionally well, extracting high-level topics from massive datasets still requires d...

Similar Articles (10 found)

https://www.gilesthomas.com/2025/10/llm-from-scratch-22-finally-training-our-llm

www.gilesthomas.com 2025-11-08

www.gilesthomas.com

Writing an LLM from scratch, part 22 -- finally training our LLM! This post wraps up my notes on chapter 5 of Sebastian Raschka's book "Build a Large ...

🔍 View Similar Articles 🟠 HN

https://technicalwriting.dev/embeddings/arithmetic/index.html

technicalwriting.dev 2025-11-03

technicalwriting.dev

word2vec-style vector arithmetic on docs embeddings§ 2025 October 29 word2vec popularized the idea of representing words as vectors where semantically...

🔍 View Similar Articles 🟠 HN

https://news.ycombinator.com/item?id=45427634

news.ycombinator.com 2025-10-11

news.ycombinator.com

> the generation of 281,128 augmented examples, from which 1,000 were held out as a benchmark test set. This model is trained on a custom dataset of 2...

🔍 View Similar Articles

🔍 58.5% similar

Error extracting title

https://www.pinecone.io/learn/series/image-search/clip/?_hsenc=p2ANqtz-_MZUbziNKCoB2HdM3hBzmaHEesRF9TFZ-S2FkjdJPtOZ2z4GVwso8C-LuBAx8f1Ac7N3G2rnc19e3xHqfVE4zty3DNoQ&_hsmi=251366668&utm_content=251366668&utm_medium=email&utm_source=hs_automation

www.pinecone.io 2025-07-12

www.pinecone.io

Multi-modal ML with OpenAI's CLIP Language models (LMs) can not rely on language alone. That is the idea behind the “Experience Grounds Language” pape...

🔍 View Similar Articles

🔍 58.4% similar

MicroGPT explained interactively

https://growingswe.com/blog/microgpt

growingswe.com 2026-03-01

growingswe.com

MicroGPT explained interactively Andrej Karpathy wrote a 200-line Python script that trains and runs a GPT from scratch, with no libraries or dependen...

🔍 View Similar Articles 🟠 HN

🔍 58.2% similar

How big are our embeddings now and why?

https://vickiboykis.com/2025/09/01/how-big-are-our-embeddings-now-and-why/

vickiboykis.com 2025-09-10

vickiboykis.com

How big are our embeddings now and why? #embeddings #openai #anthropic #huggingface #dimensionality A few years ago, I wrote a paper on embeddings. At...

🔍 View Similar Articles 🟠 HN

🔍 57.4% similar

2 Years of ML vs. 1 Month of Prompting

https://www.levs.fyi/blog/2-years-of-ml-vs-1-month-of-prompting/

www.levs.fyi 2025-11-15

www.levs.fyi

2 Years of ML vs. 1 Month of Prompting November 7, 2025 Recalls at major automakers cost hundreds of millions of dollars a year. It’s a huge issue. To...

🔍 View Similar Articles 🟠 HN

🔍 56.7% similar

The Illustrated Word2vec

https://jalammar.github.io/illustrated-word2vec/

jalammar.github.io 2025-12-14

jalammar.github.io

The Illustrated Word2vec Discussions: Hacker News (347 points, 37 comments), Reddit r/MachineLearning (151 points, 19 comments) Translations: Chinese ...

🔍 View Similar Articles 🟠 HN

https://news.ycombinator.com/item?id=37484135

news.ycombinator.com 2025-07-13

open-source,gpt-4,tech,hackernews,llms,news,news.ycombinator.com,machine learning

There has been a lot of interest on HN in fine-tuning open-source LLMs recently (eg. Anyscale's post at https://news.ycombinator.com/item?id=37090632)...

🔍 View Similar Articles

🔍 56.0% similar

Error extracting title

https://simonwillison.net/2024/Dec/31/llms-in-2024/

simonwillison.net 2025-07-12

simonwillison.net

Things we learned about LLMs in 2024 31st December 2024 A lot has happened in the world of Large Language Models over the course of 2024. Here’s a rev...

🔍 View Similar Articles 🟠 HN