Two Spam Filters 10 Times As Accurate As Humans
Posted by
timothy
on Mon Feb 23, 2004 08:13 PM
from the dev/null-is-getting-fatter dept.
from the dev/null-is-getting-fatter dept.
Nuclear Elephant writes "The authors of two spam filters, CRM114 and DSPAM, announced recently
that their filters have achieved accuracy rates ten times better than a human is capable of. Based on a study by Bill Yerazunis of CRM114, the average human is only 99.84% accurate. Both filters are reporting to have reached accuracy levels between 99.983% and 99.984% (1 misclassification in 6250 messages) using completely different approaches (CRM114 touts Markovan, while DSPAM implements a Dolby-type noise reduction algorithm called Dobly). If you're looking for a way to rid spam from your inbox, roll on over to one of these authors' websites."
This discussion has been archived.
No new comments can be posted.
Two Spam Filters 10 Times As Accurate As Humans
|
Log In/Create an Account
| Top
| 487 comments
(Spill at 50!) | Index Only
| Search Discussion
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
(1)
|
2
Outclassed... (Score:5, Funny)
(http://www.wikipedia.com/)
Re:Help setting this up (Score:4, Informative)
(http://michonline.com/ | Last Journal: Tuesday February 17 2004, @09:16PM)
Not sure exactly why you need a pop3 proxy involved, just use Fetchmail to deliver locally, run things through procmail.
Set your local mailserver (sendmail/qmail/postfix/exim/whatever) to use your ISP's SMTP server as a smarthost, and it'll send everything it doesn't recognize as local off to them to handle.
It _can't_ know which pr0n I think is spam vs good (Score:5, Funny)
How would it know if I consider brunettes non-spam but blondes spam? I did opt-in for one of those email categories, but not the other.
Re:Help setting this up (Score:5, Insightful)
(http://austinskatenotes.org/ | Last Journal: Sunday September 30, @12:27AM)
Huh? Aren't humans 100%? (Score:5, Insightful)
(http://elitemrp.net/)
Re:Huh? Aren't humans 100%? (Score:5, Informative)
(http://markjensen.net/)
I haven't been 100% accurate.
I received an email from my sister-in-law from her work, and the address looked suspicious (one of those weird-looking "letter and number" jumbles.
I deleted it. It happens.
*slams head against wall* (Score:5, Funny)
(http://www.drgw.net/~nnthayer)
Yeah, so did I. The subject line was "I want you so bad."
I deleted it. Turned out the message was genuine. I'll never forgive myself...
Re:*slams head against wall* (Score:5, Funny)
(Last Journal: Thursday October 02 2003, @03:54PM)
Don't worry (Score:4, Funny)
(http://it.slashdot.org/~sik0fewl/)
Don't worry, I can forward you the one she sent me. Sounds like the same email.
Re:*slams head against wall* (Score:5, Funny)
Re:Huh? Aren't humans 100%? (Score:5, Interesting)
(http://trypticon.org/)
That actually makes humans much more accurate. We can eliminate many of the messages just by looking at the subject.
The further question is, if humans aren't as accurate as the computer, how are they measuring the accuracy at all? That is, how do they know that the 1 in 6250 messages is wrong, if a human, known to be inaccurate, was testing for accuracy?
Re:Huh? Aren't humans 100%? (Score:5, Interesting)
(https://customer.lylix.net/aff.php?aff=006)
I believe that humans can be 100% accurate (or thereabouts) if they read the *ENTIRE* message, however that's exactly the point - if you have to read an entire message to tell that it's spam, the spam has succeeded.
Their number probably concerns how people can tell without reading the entire message whether or not the message is spam. My brother accidentally deleted a few messages I had sent to him, however if he had read them fully he would have known they were legit.
Cheers,
Justin
Re:Huh? Aren't humans 100%? (Score:5, Insightful)
(http://trypticon.org/)
Re:Huh? Aren't humans 100%? (Score:5, Informative)
When these factors are considered, I think it's quite possible to write software that in the long run has a higher success rate than a human who has better things to do than filter his mail all day.
Re:Huh? Aren't humans 100%? (Score:4, Interesting)
(http://trypticon.org/)
Re:Huh? Aren't humans 100%? (Score:5, Funny)
(Last Journal: Sunday October 31 2004, @08:12PM)
Now I have to think again about putting humans to decorticate sunflower seeds, it's cheper than all those machines.
Re:Huh? Aren't humans 100%? (Score:5, Insightful)
Fortunately, soon we will all be able to use the superhuman spam-detection capabilities of these filters to save us from ourselves. Imagine all of those pesky e-mails from your 'friends' getting caught by your spam filter before they even impinge upon your consciousness.
It'd be a wonderful world.
Re:Huh? Aren't humans 100%? (Score:5, Insightful)
Kinda makes you wonder how they can know the filters are right though.
(please don't reply telling me how)
Re:Huh? Aren't humans 100%? (Score:5, Funny)
(http://trypticon.org/)
Re:Huh? Aren't humans 100%? (Score:4, Insightful)
(http://slashdot.org/)
Re:Huh? Aren't humans 100%? (Score:5, Informative)
Good question! We're working on this problem, among other things, at the PSAM [pdx.edu] project. We have a project to produce high-quality benchmark corpora for spam filter testing. Watch that space for ongoing work, or e-mail us an offer to pitch in and help---we could use it!
Re:Huh? Aren't humans 100%? (Score:5, Insightful)
Not comparable. The job of a junk mail filter is to drop things I don't want to read. It is trying ot match my evaluation, not to match a semi-objective criterion like red or blue.
If I read 1000 messages and say which I wish I hadn't read, then I am 100% accurate by definition.
Of course, if they are really talking about a pure spam filter -- ie one which identifies unsolicited commercial email -- then they can be more accurate than me, but at an uninteresting, perhaps even counter-productive, task:
I may get unsilicited commercial email I do want to read one day. Almost happened once (I had inadvertantly signed up for it, so it was not really unsolicited, and I didn't actually buy the piece of kit they had on special offer that week, but was tempted). I also get stuff I don't want which isn't spam (notably email from virus infected machines).
The referenced study seems to be a very sloppy job from this POV. They don't define what their criterion of sucess is, and to the extent they put in a hand waving attempt it is clearly nonsense:
`Unsolicited' does not imply `not desired'. If they don't tease those two apart, they can't get interesting results for real world applications. Eg, someone mailing my work address with a commercial proposition may well be a very welcome unsolicited commercial email.Re:Huh? Aren't humans 100%? (Score:5, Insightful)
Re:Huh? Aren't humans 100%? (Score:5, Insightful)
Re:Huh? Aren't humans 100%? (Score:4, Funny)
(http://www.seasonalwallpapers.com/)
I work with some people who use their computer every single day. Have had an email address for years, who still buys what they read in an email. Photoshop for $50...sure! Herbal viagra...why not?
Well, she always has a big smile on her face, maybe there's something to this spam thing.
Re:Huh? Aren't humans 100%? (Score:5, Funny)
You mean you've never noticed this before? Idiots are some of the happiest people I know.
Re:Huh? Aren't humans 100%? (Score:5, Funny)
(http://rixstep.com/)
Uh, sure they do. Popups - that's like those porn storms, isn't it? Some people say it only happens with IE and Windows, but I talked to my service provider and they told me 'just pull the power plug out of the wall when that happens'.
Easily fixed.
Re:Huh? Aren't humans 100%? (Score:5, Funny)
(http://www.slashdot.org/~Marvin_OScribbley | Last Journal: Thursday July 21 2005, @04:38PM)
Ok, now the screen dimmed a little and I heard the hard drive spin down, but the pop ups are still a comin! Oh, and something about "battery level at 98%" or something.
Re:Huh? Aren't humans 100%? (Score:5, Insightful)
Re:Huh? Aren't humans 100%? (Score:5, Insightful)
With 10 messages (after automatic spam detection) humans are 100% accurate.
With 1,000 messages, (before automatic spam detection)
humans are less than 100% accurate.
The experiment was done on 5849 messages.
Remember; one thing computers are good at is doing boring things repeatedly.
Re:Huh? Aren't humans 100%? (Score:5, Interesting)
(http://www.littleblur.com/ | Last Journal: Saturday November 24, @01:52PM)
If you see a strange name in your inbox with an odd title, that might be a Nigerian businessman, or it might be your long lost Nigerian brother.
I recently tried to order a t-shirt from this guy for a band he used to be in. I found his band because we have the same (semi-uncommon) name. So, he got an email From: himself. I had to send him two emails because he deleted the first one assuming it was spam.
I ordered some RAM for my dad a while back. He gets 200 spam emails a day (email addy in resume & web page), and he deleted the confirmation email from the RAM vendor. The RAM never shipped, and it took us a week to figure out that there was a problem.
People make mistakes all the time. Why is this an unexpected result? People are jackasses. This should be obvious.
Re:Huh? Aren't humans 100%? (Score:4, Funny)
(http://www.pogox.com/ | Last Journal: Wednesday June 04 2003, @02:33PM)
Or more likely, megacorp fires it's mail administrators for being incompetent and goes on about it's business.
Re:Huh? Aren't humans 100%? (Score:4, Funny)
(http://nerds.palmdrive.net/)
How could you possibly know? You deleted it!!
Re:Huh? Aren't humans 100%? (Score:5, Interesting)
And if the study posted about is accruate, of those 1% that are left, you will (if you're a perfectly average person) accidentally delete 0.16% of good messages. Surely you've deleted a valid message by accident before? I do it regularily, deleting 25 spam messages with a single good one embedded in it when I just woke up before I had my coffee is not a good thing ;)
At the very least, if you were given the same data as these tests, that would be true. Consider if you *didn't* use popfile - how many spams would you be deleting every day, and how many good messages would be accidentally deleted? I know that if I had to manually delete the two or three hundred spams interspersed with good messages, my false-positive rate (the percentage of good mail I accidentally deleted) would skyrocket.
So just be glad you've got popfile. Not only do you not have to go through as much spam, but you're also more accurate while going through the little you must.
Re:Huh? Aren't humans 100%? (Score:4, Interesting)
Before I used a spam filter, I once missed a very important message whose subject line was something to the effect of "URGENT - DON't REBOOT THIS MORNING." That was a bad one to miss.
Of course humans make mistakes, and it is entirely possible for an automated or semi-automated system to be more accurate than a human alone.
IM Spam (Score:5, Interesting)
wait, WTF? (Score:5, Insightful)
Re:wait, WTF? (Score:4, Interesting)
(http://slashdot.org/~HeelToe/)
This represents 8 days worth of spam for me. Yes, ~800 per day.
My address has been valid for 10 years. Why should I change it? Bogofilter is currently letting 2-3 per day into my inbox. I generally check for false-positives, but as the training has progressed, I am finding none anymore.
I plan to implement a single-shot, one try notification sender. I.e., if the mail gets classified as spam: lookup the mx record for the envelope return address, if it's nonexistent, lookup the a record. Make a connection and try to deliver a message indicating their message (include subject reference) was identified as spam, include a way for them to reliably get a message through to me. If any of the smtp exchange or address lookup fails, just forget it, they're probably not real anyway.
Not the best idea (Score:5, Insightful)
(Last Journal: Saturday August 28 2004, @12:14AM)
What you're planning has already been done, it's called TMDA, and it's not such a good idea. You're going to send out 800 "challenge" emails per day - have you given any thought to how many of those will be genuine addresses, but have nothing to do with the spam you receive because they just happen to be the joe-job victim? These kind of challenge/response systems may slighlty alleviate your own suffering through spam, but at a cost to all those unfortunate enough to have had their email addresses faked. And if the sheer impoliteness of such net behaviour doesn't put you off, note that you're using up more of your own bandwidth to send out such challenges
If any of the smtp exchange or address lookup fails, just forget it, they're probably not real anyway
It would make a lot more sense to make these kind of checks when you're receiving the email in the first place. Reject at the SMTP level - you never accept and process the spam in the first place
2+2=3 (Score:3)
(http://www.chessthecat.com/)
Am I crazy or is that nowhere near "10 times better"?
Number of significant digits... (Score:5, Informative)
(http://www.xemu.org/)
New proggie=99.984
So the human misses
Re:2+2=3 (Score:5, Insightful)
You have just unlocked the secret of virtually every news report that says "ten times more likely."
To get cancer. To have a heart attack. To suffer from the heartbreak of psoriasis. Whatever.
Yes, these numbers indicate "10 times better," and if you were to ask the reporter how likely am I to avoid cancer in both situations, these are the sorts of numbers he would show you.
Eat health food and your chance of having a heart attack is 99.984%. Eat too many donuts and your chance of having a heart attack is 99.983%, 10 times worse!
Always, always, always ask to see the raw numbers so that you know what "10 times worse" means.
Then ask if the numbers were collected by phone survey. If they were, throw them all away and have donut and a cup of coffee.
KFG
can it be used with SA? (Score:5, Interesting)
(http://pitchforkmedia.com/ | Last Journal: Tuesday March 23 2004, @09:08PM)
CB
Re:can it be used with SA? -yes (Score:5, Informative)
(http://web.archive.o.../liberty/quotesb.htm)
http://bugzilla.spamassassin.org/show_bug.cgi?i
Who is sending that one? (Score:5, Funny)
(http://goat.cx/ | Last Journal: Wednesday August 18 2004, @02:34PM)
Forgive me if I don't feel any pity that some moron's email gets filtered to the junk bin because I couldn't discern it from spam.
To get this new spam filter... (Score:5, Funny)
Better (Score:5, Interesting)
less thought for me... (Score:3, Funny)