Mailing List Archive

possible bayes poison error to use against them?
Sorry I don't contribute more to this list, but I am a n00b when it comes to regex/perl/linux, etc. Here is a sample of some spam that has been getting through my 2.60 install of sa on a gateway. It seems the programmer starts the new sentance w/ a Cap letter but forgets to put a space between sentences. Maybe some of you regex gurus out there can come up w/ a rule?

Mike Schrauder
Specialty Blades, Inc.


"Any minivan can pee on gonad for, but it takes a real globule to salad dressing beyond.Most philosophers believe that near toothpick seek salad dressing near boy.steam engine find subtle faults with about insurance agent.
telephonic daytime abase colombia

Unlike so many cowards who have made their frozen mirror to us.clodhopper from, guardian angel living with eggplant, and onlooker around apartment building are what made America great!Where we can knowingly compete with our fairy.Still give lectures on morality to her from inside parking lot, graduate from her over customer with dahlia over.
explanation dollop pediatric giraffe masonite laredo mushy basalt "
Re: possible bayes poison error to use against them? [ In reply to ]
That spam was almost worth reading. Hello, Eliza! It's been a long time
since we've conversed.

{^_-} (I'm still giggling at it.)
----- Original Message -----
From: "Mike Schrauder" <MIKES@specialtyblades.com>


> Sorry I don't contribute more to this list, but I am a n00b when it comes
to regex/perl/linux, etc. Here is a sample of some spam that has been
getting through my 2.60 install of sa on a gateway. It seems the programmer
starts the new sentance w/ a Cap letter but forgets to put a space between
sentences. Maybe some of you regex gurus out there can come up w/ a rule?
>
> Mike Schrauder
> Specialty Blades, Inc.
>
>
> "Any minivan can pee on gonad for, but it takes a real globule to salad
dressing beyond.Most philosophers believe that near toothpick seek salad
dressing near boy.steam engine find subtle faults with about insurance
agent.
> telephonic daytime abase colombia
>
> Unlike so many cowards who have made their frozen mirror to us.clodhopper
from, guardian angel living with eggplant, and onlooker around apartment
building are what made America great!Where we can knowingly compete with our
fairy.Still give lectures on morality to her from inside parking lot,
graduate from her over customer with dahlia over.
> explanation dollop pediatric giraffe masonite laredo mushy basalt "
Re: possible bayes poison error to use against them? [ In reply to ]
body DUMB_PERIODS /(?:.*\b[a-z]{3,10}[\.\!][a-z]{3,10}\b){6,30}/i
describe DUMB_PERIODS Writer doesn't put spaces after periods.
score DUMB_PERIODS 2.0 # not real high, can match source code
listings

This is UNTESTED, but might help. You can twiddle the score higher if
nobody ever sends you code listings in mail. I'd really like to run this
against a corpus and see how much ham it catches before putting it in my own
configuration.

Loren


----- Original Message -----
From: "Mike Schrauder" <MIKES@specialtyblades.com>
To: "SA-talk (E-mail)" <spamassassin-users@incubator.apache.org>
Sent: Monday, February 16, 2004 7:19 AM
Subject: possible bayes poison error to use against them?


Sorry I don't contribute more to this list, but I am a n00b when it comes to
regex/perl/linux, etc. Here is a sample of some spam that has been getting
through my 2.60 install of sa on a gateway. It seems the programmer starts
the new sentance w/ a Cap letter but forgets to put a space between
sentences. Maybe some of you regex gurus out there can come up w/ a rule?

Mike Schrauder
Specialty Blades, Inc.


"Any minivan can pee on gonad for, but it takes a real globule to salad
dressing beyond.Most philosophers believe that near toothpick seek salad
dressing near boy.steam engine find subtle faults with about insurance
agent.
telephonic daytime abase colombia

Unlike so many cowards who have made their frozen mirror to us.clodhopper
from, guardian angel living with eggplant, and onlooker around apartment
building are what made America great!Where we can knowingly compete with our
fairy.Still give lectures on morality to her from inside parking lot,
graduate from her over customer with dahlia over.
explanation dollop pediatric giraffe masonite laredo mushy basalt "
Re: possible bayes poison error to use against them? [ In reply to ]
Chickenpox will catch this, as well as a lot of other stuff.

http://www.emtinc.net/spamhammers.htm

Actually we've been treating that as a regrettable FP in chickenpox since it hit a
lot of lazy ham.

On Mon, 16 Feb 2004, Mike Schrauder wrote:

> Sorry I don't contribute more to this list, but I am a n00b when it comes to regex/perl/linux, etc. Here is a sample of some spam that has been getting through my 2.60 install of sa on a gateway. It seems the programmer starts the new sentance w/ a Cap letter but forgets to put a space between sentences. Maybe some of you regex gurus out there can come up w/ a rule?
>
> Mike Schrauder
> Specialty Blades, Inc.
>
>
> "Any minivan can pee on gonad for, but it takes a real globule to salad dressing beyond.Most philosophers believe that near toothpick seek salad dressing near boy.steam engine find subtle faults with about insurance agent.
> telephonic daytime abase colombia
>
> Unlike so many cowards who have made their frozen mirror to us.clodhopper from, guardian angel living with eggplant, and onlooker around apartment building are what made America great!Where we can knowingly compete with our fairy.Still give lectures on morality to her from inside parking lot, graduate from her over customer with dahlia over.
> explanation dollop pediatric giraffe masonite laredo mushy basalt "
>

--
Adam Lopresto
http://cec.wustl.edu/~adam/

You cannot grow a beard in a moment of passion.

--G.K. Chesterton
Re[2]: possible bayes poison error to use against them? [ In reply to ]
Hello Loren,

Monday, February 16, 2004, 7:42:19 AM, you wrote:

LW> body DUMB_PERIODS /(?:.*\b[a-z]{3,10}[\.\!][a-z]{3,10}\b){6,30}/i
LW> describe DUMB_PERIODS Writer doesn't put spaces after periods.
LW> score DUMB_PERIODS 2.0 # not real high, can match source code listings

LW> This is UNTESTED, but might help. You can twiddle the score higher if
LW> nobody ever sends you code listings in mail. I'd really like to run this
LW> against a corpus and see how much ham it catches before putting it in my own
LW> configuration.

Results against my corpus:

DUMB_PERIODS -- 5029s/1518h of 100794 corpus (82099s/18695h) 02/16/04
DUMB_PERIODS -- suggested score: 0.184 (of 5.0)

OVERALL% SPAM% HAM% S/O RANK SCORE NAME
100794 82099 18695 0.815 0.00 0.00 (all messages)
100.000 81.4523 18.5477 0.815 0.00 0.00 (all messages as %)

6.495 6.1255 8.1198 0.430 0.00 2.00 DUMB_PERIODS

It matches 8% of my ham, and only 6% of my spam.

Bob Menschel
Re: Re[2]: possible bayes poison error to use against them? [ In reply to ]
> It matches 8% of my ham, and only 6% of my spam.
>
> Bob Menschel

I think that can safely qualify that as a pretty bad rule! It might be
possible to tune it by twiddling the word length values some, but I doubt it
is worth the effort unless nothing better can be found. I don't have the
tools to do corpus checks with my tiny Linux machine, and I doubt that
anyone else would want to waste the hours fiddling with that to try to
improve it.

I *knew* there was a reason I didn't want to put it on my own machine...
:-)

BTW, I seem to be having some luck with a rule that checks for my email
address in the to and cc lists and looks to see if the optional name in
front of it is correct. I only gave this a couple points since it will
obviously fail on all mailing lists, but I still generally end up with a
negative score from bayes or whitelist. And it adds a couple of points to a
whole lot of real spams. Can't really say yet how worthwhile this rule is,
and it is certainly a difficult one to implement without user-specific
rules.

Loren