[Teammetrics-discuss] Observations from current run of `archiveparser.py`.

Andreas Tille andreas at an3as.eu
Sat Feb 4 22:35:12 UTC 2012


On Sun, Feb 05, 2012 at 02:48:57AM +0530, Sukhbir Singh wrote:
> To add to the above: either ways, we won't be able to catch all spam
> messages. So we should not let clean messages be wasted because of our
> spam fighting effort. And it's not wasting also: we will be populating
> listspam, it's just that I want to populate listarchives also.

As I said, data-duplication makes no sense.  Just use an additional
bolean column (spam) and set it to true id we suspect spam.
 
Kind regards

    Andreas. 

-- 
http://fam-tille.de



More information about the Teammetrics-discuss mailing list