January 2003
(This article was given as a talk at the 2003 Spam Conference.
It describes the work I've done to improve the performance of
the algorithm described in A Plan for Spam,
and what I plan to...
Similar Articles (10 found)
π 84.6% similar
August 2002
(This article describes the spam-filtering techniques
used in the spamproof web-based mail reader we
built to exercise Arc. An
improved al...
π 71.9% similar
August 2003
We may be able to improve the accuracy of Bayesian spam filters
by having them follow links to see what's
waiting at the other end. Richar...
π 61.1% similar
Writing an LLM from scratch, part 22 -- finally training our LLM!
This post wraps up my notes on chapter 5 of Sebastian Raschka's book "Build a Large ...
π 54.2% similar
| |
February 2009
Hacker News was two years
old last week. Initially it was supposed to be a side projectβan
application to sharpen Arc on, and a plac...
π 53.4% similar
> the generation of 281,128 augmented examples, from which 1,000 were
held out as a benchmark test set.
This model is trained on a custom dataset of 2...
π 53.2% similar
2 Years of ML vs. 1 Month of Prompting
November 7, 2025
Recalls at major automakers cost hundreds of millions of dollars a year. Itβs a huge issue. To...
π 52.7% similar
Building Confidence: A Case Study in How to Create Confidence Scores for GenAI Applications
TL;DR Getting a response from GenAI is quick and straightf...
π 52.6% similar
Agree with much of thisβparticularly that these systems are uncannily good at inferring how to 'play along' with the user and extreme caution is there...
π 52.6% similar
For probabilities, use Fermi numbers, not words
Probability words
Words have fuzzy meanings (Figure 1), but that alone doesnβt mean words are useless....
π 52.4% similar
The Pattern-Seeking Fallacy
What do these have in common?
- βThis pitcher has retired 5 of the last 7 batters.β
- βWe tried 10 AdWords variants and co...