[Teammetrics-discuss] Fixed bugs in NNTPstat.py

Andreas Tille andreas at an3as.eu
Thu Aug 4 06:17:41 UTC 2011


On Thu, Aug 04, 2011 at 02:07:34AM +0530, Sukhbir Singh wrote:
>     git pull
> 
> There were some bugs in NNTPstat.py that were causing problems. This
> was because I forgot that the liststat code exists after even one team
> is parsed.

This is what I expected to be the reason.
 
> Anyways, I have changed some other things also, most notably you will
> see a more detailed log output as the messages are being downloaded.
> This is the only thing I could come up with, so feel free to comment.

The log is fine.

> I recommend you run the code with two small teams, just to be sure:
> `debian-accessibility` and `debian-blends` (a total of ~6000
> articles). It seems to work for me now but as usual, I leave the
> testing to you :) I think all is well now.

teammetrics=# SELECT project, count(*) from listarchives where domain='lists.debian.org' group by  project;
       project        | count 
----------------------+-------
 debian-amd64         |     5
 debian-curiosa       |  3908
 debian-boot          |   393
 debian-ctte          |     4
 debian-arm           |  9666
 debian-derivatives   |   546
 debian-blends        |  2930
 debian-accessibility |  3177
 debian-desktop       |  2690

Looks good except for amd64 and ctte.  While I think the mboxes have
a reasonable size to reflect the expected number of postings:

/var/cache/teammetrics$ ls -l debian-amd64-2-27592.mbox debian-ctte-1-2608.mbox 
-rw-rw-r-- 1 tille tille 50053394  3. Aug 13:10 debian-amd64-2-27592.mbox
-rw-rw-r-- 1 tille tille  6224948  3. Aug 19:44 debian-ctte-1-2608.mbox

something is wrong with the import.  Could you please have a look into
these two specific lists. 

> Just in case if we run into any more privacy issues (!), we can have
> this code as backup and that is why it should work perfectly.

That's correct.

Kind regards

       Andreas.

-- 
http://fam-tille.de



More information about the Teammetrics-discuss mailing list