[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Pan-devel] Scoring articles by ration of bytes/lines

From: Konrad Karl
Subject: [Pan-devel] Scoring articles by ration of bytes/lines
Date: Thu, 1 Feb 2007 00:13:19 +0100
User-agent: Mutt/


I want to be able to score/filter/delete articles where
the ration of article_bytes / article_lines is below a certain

Many sporged postings could be easily identified. With
the old pan 1.x I have been using a simple perl filter program
in oder to delete articles with a too low ratio and this simple
approach worked surprisingly well - the algortithm might require some
tweaking, e.g if number of lines < 10 then dont apply the
ratio rule etc. etc.

Now I have started looking into the latest sources but I am
afraid it will take considerable time until I will understand
whats going on.

What do you think?


reply via email to

[Prev in Thread] Current Thread [Next in Thread]