[Teammetrics-discuss] How does filling up the database work?

Sukhbir Singh sukhbir.in at gmail.com
Tue Aug 9 20:02:39 UTC 2011


Hi,

    git pull

> Hmmm, this algorithm does not really need a MD5sum - just the name of
> the parsed mbox would be sufficient, right?  You are just not

Removed the checksum and implemented a simple list.

>> > BTW, steps 3. and 4. should be exchanged.  If 4. might fail for some
>> > reason you should not set the "not for download" flag in lists.hash.

Fixed and exchanged.

Also if a list is specified that does not exist:

2011-08-10 01:12:04,154 ERROR: Unable to fetch mailing list archive
HTTP Error 404: Not Found

So don't worry about it as it's handled much before downloading :)

>  run if
>   (script_run_date) != 1st day of Month    OR
>   mbox_for_this_month (which is ignored) exists

Implemented.

> Well, once the mboxes are deleted as we agreed upon that boils down to
> the stored mbox names in lists.hash or did I missed something.

To finalize this 'clearing' properly, here is what all we need to do
if we need to start from ZERO:

1. Clear lists.hash AND/ OR nntplists.hash
2. Clear the database.

So no mbox (whether gzipped or unzipped) is saved to the disk, they
are parsed and removed. The two things above are the only things you
need to clear. No more disk space problems. Yayay!

> Nooooooooooooooooooo!  :-)

;)

> A primary key is a primary key, right.  We will probably do this with a
> simple script - but this is no reason to not implement proper database
> logic which is to set proper constraints where these do make sense.

Not done, either I can add a primary key or include this in the
script. Your call!



More information about the Teammetrics-discuss mailing list