[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <200610031932.13125.m.kozlowski@tuxland.pl>
Date: Tue, 3 Oct 2006 19:32:13 +0200
From: Mariusz Kozlowski <m.kozlowski@...land.pl>
To: Alan Cox <alan@...rguk.ukuu.org.uk>
Cc: linux-kernel <linux-kernel@...r.kernel.org>
Subject: Re: Spam, bogofilter, etc
> every good spammer reruns their message through spamassassin adding random
> text till they get a good score *then* they spew it out.
That's a flaw in the whole idea of having pre-defined (by human) separate
rules catching misc obvious (to us) spam indicators. If you had a filter that
you just feed with raw data from many sources and that does pattern
recognition and learns on its own, there (probably) would be no way to go
around it. At least it wouldn't be easy. In fact i.e. when ANN is used as
classifier, the rules created after training are hidden and have no obvious
represantation to us so one would have no idea what to change to get the
desired filter output.
Mariusz
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists