[Teammetrics-discuss] Report II

Sukhbir Singh sukhbir.in at gmail.com
Wed Jun 22 16:04:06 UTC 2011


Hi,

It's good to see you Scott!

>> Well, I think Sukhbir was meaning top X posters and just took over my
>> X=10.  If it is about SPAM prevention:  Ignoring posters with less than
>> 5 postings might do the job.  They are not really relevant (except in
>> quite spares settled and new teams) and spammers tend to use different
>> names so will not post more than five times.

That is true.

As far as spam is concerned, after identifying the clear patterns that
there are to spam in the Alioth lists, we should be able to cut down a
substantial chunk of it. I intend to make sample runs to compare how
much spam we _can_ actually filter.

> Cutting out the bottom (less than 5 posts) would be good for the
> mailing lists. I'd prefer chopping off the bottom than limiting to
> just the top.

Seems a good strategy. Let me finish implementing the spam 'filter'
(it should be done by tomorrow) and then we will proceed. Also, once
we have the information in the database, trimming down is trivial :-)

There was the issue of encodings also but that I have taken care of it
and it should be resolved in tonight's commit.



More information about the Teammetrics-discuss mailing list