[Pkg-shadow-devel] How does commit mail work in SVN and CVS

Martin Quinson martin.quinson@loria.fr
Sun, 15 May 2005 22:49:20 +0200


--6WlEvdN9Dv0WHSBl
Content-Type: multipart/mixed; boundary="aT9PWwzfKXlsBJM1"
Content-Disposition: inline


--aT9PWwzfKXlsBJM1
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

On Sun, May 15, 2005 at 09:22:55AM +0200, Christian Perrier wrote:
> For another project, I'd like to setup a automated commit mail system
> go get commit mails when someone commits in the project's CVS.
>=20
> Nicolas, how does this work for CVS?

Copy the attached script on the server running the cvs, in the CVSROOT
directory of your project and chmod +x it. Then add=20

 DEFAULT          $CVSROOT/CVSROOT/syncmail -q -S '[mon cvs] ' -f ens-lyon.=
fr -u %{sVv} Martin.Quinson@loria.fr

to the CVSROOT/loginfo of a working directory and commit it.

> And, while we're at it, how does it work for SVN?

Edit /svn/pkg-shadow/hooks/post-commit to fit your needs, make sure this is
executable (chmod +x) and you're set.

HTH, Mt.

--aT9PWwzfKXlsBJM1
Content-Type: text/plain; charset=us-ascii
Content-Disposition: attachment; filename=syncmail
Content-Transfer-Encoding: quoted-printable

#! /usr/bin/python

# Copyright (c) 2002, 2003, Barry Warsaw, Fred Drake, and contributors
# All rights reserved.
# See the accompanying LICENSE file for details.

# NOTE: SourceForge currently runs Python 2.2.3, so we need to remain
# compatible with the Python 2.2 line.

"""Complicated notification for CVS checkins.

This script is used to provide email notifications of changes to the CVS
repository.  These email changes will include context diffs of the changes.
Really big diffs will be trimmed.

This script is run from a CVS loginfo file (see $CVSROOT/CVSROOT/loginfo). =
 To
set this up, create a loginfo entry that looks something like this:

    mymodule /path/to/this/script %%s some-email-addr@your.domain

In this example, whenever a checkin that matches `mymodule' is made, this
script is invoked, which will generate the diff containing email, and send =
it
to some-email-addr@your.domain.

    Note: This module used to also do repository synchronizations via
    rsync-over-ssh, but since the repository has been moved to SourceForge,
    this is no longer necessary.  The syncing functionality has been ripped
    out in the 3.0, which simplifies it considerably.  Access the 2.x versi=
ons
    to refer to this functionality.  Because of this, the script is misname=
d.

It no longer makes sense to run this script from the command line.  Doing so
will only print out this usage information.

Usage:

    %(PROGRAM)s [options] <%%S> email-addr [email-addr ...]

Where options are:

    --cvsroot=3D<path>
        Use <path> as the environment variable CVSROOT.  Otherwise this
        variable must exist in the environment.

    --context=3D#
    -C #
        Include # lines of context around lines that differ (default: 2).

    -c
        Produce a context diff (default).

    -m hostname
    --mailhost hostname
        The hostname of an available SMTP server.  The default is
        'localhost'.

    -u
        Produce a unified diff (smaller).

    -S TEXT
    --subject-prefix=3DTEXT
        Prepend TEXT to the email subject line.

    -R ADDR
    --reply-to=3DADDR
      Add a "Reply-To: ADDR" header to the email message.

    --quiet / -q
        Don't print as much status to stdout.

    --fromhost=3Dhostname
    -f hostname
        The hostname that email messages appear to be coming from.  The Fro=
m:
        header of the outgoing message will look like user@hostname.  By
        default, hostname is the machine's fully qualified domain name.

    --help / -h
        Print this text.

The rest of the command line arguments are:

    <%%S>
        CVS %%s loginfo expansion.  When invoked by CVS, this will be a sin=
gle
        string containing the directory the checkin is being made in, relat=
ive
        to $CVSROOT, followed by the list of files that are changing.  If t=
he
        %%s in the loginfo file is %%{sVv}, context diffs for each of the
        modified files are included in any email messages that are generate=
d.

    email-addrs
        At least one email address.
"""
__version__ =3D '1.3'

import os
import sys
import re
import time
import getopt
import smtplib
import pwd
import socket

=66rom cStringIO import StringIO

# Which SMTP server to do we connect to?
MAILHOST =3D 'localhost'
MAILPORT =3D 25

# Diff trimming stuff
DIFF_HEAD_LINES =3D 20
DIFF_TAIL_LINES =3D 20
DIFF_TRUNCATE_IF_LARGER =3D 1000

COMMASPACE =3D ', '

PROGRAM =3D sys.argv[0]

BINARY_EXPLANATION_LINES =3D [
    "(This appears to be a binary file; contents omitted.)\n"
    ]

NOVERSION =3D "Couldn't generate diff; no version number found for file: %s"
BACKSLASH =3D "Couldn't generate diff: backslash in filespec's filename: %s"


=0C
def usage(code, msg=3D''):
    print __doc__ % globals()
    if msg:
        print msg
    sys.exit(code)


=0C
def calculate_diff(entry, contextlines):
    file =3D entry.name
    oldrev =3D entry.revision
    newrev =3D entry.new_revision

    # Make sure we can find a CVS version number
    if oldrev is None and newrev is None:
        return NOVERSION % file

    if file.find("'") <> -1:
        # Those crazy users put single-quotes in their file names!  Now we
        # have to escape everything that is meaningful inside double-quotes.
        filestr =3D filestr.replace('\\', '\\\\')
        filestr =3D filestr.replace('`', '\`')
        filestr =3D filestr.replace('"', '\"')
        filestr =3D filestr.replace('$', '\$')
        # and quote it with double-quotes.
        filestr =3D '"' + filestr + '"'
    else:
        # quote it with single-quotes.
        filestr =3D "'" + file + "'"
    if oldrev is None:
        # File is being added.
        try:
            if os.path.exists(file):
                fp =3D open(file)
            else:
                update_cmd =3D "cvs -fn update -r %s -p %s" % (newrev, file=
str)
                fp =3D os.popen(update_cmd)
            lines =3D fp.readlines()
            fp.close()
            # Is this a binary file?  Let's look at the first few
            # lines to figure it out:
            for line in lines[:5]:
                for c in line.rstrip():
                    if c.isspace():
                        continue
                    if c < ' ' or c > chr(127):
                        lines =3D BINARY_EXPLANATION_LINES[:]
                        break
            lines.insert(0, '--- NEW FILE: %s ---\n' % file)
        except IOError, e:
            lines =3D ['***** Error reading new file: ',
                     str(e), '\n***** file: ', file, ' cwd: ', os.getcwd()]
    elif newrev is None:
        lines =3D ['--- %s DELETED ---\n' % file]
    else:
        # File has been changed.
        # This /has/ to happen in the background, otherwise we'll run into =
CVS
        # lock contention.  What a crock.
        if contextlines > 0:
            difftype =3D "-C " + str(contextlines)
        else:
            difftype =3D "-u"
        diffcmd =3D "/usr/bin/cvs -f diff -kk %s --minimal -r %s -r %s %s" \
                  % (difftype, oldrev, newrev, filestr)
        fp =3D os.popen(diffcmd)
        lines =3D fp.readlines()
        # ignore the error code, it always seems to be 1 :(
        fp.close()
    if len(lines) > DIFF_TRUNCATE_IF_LARGER:
        removedlines =3D len(lines) - DIFF_HEAD_LINES - DIFF_TAIL_LINES
        del lines[DIFF_HEAD_LINES:-DIFF_TAIL_LINES]
        lines.insert(DIFF_HEAD_LINES,
                     '[...%d lines suppressed...]\n' % removedlines)
    return ''.join(lines)


=0C
rfc822_specials_re =3D re.compile(r'[\(\)\<\>\@\,\;\:\\\"\.\[\]]')

def quotename(name):
    if name and rfc822_specials_re.search(name):
        return '"%s"' % name.replace('"', '\\"')
    else:
        return name


=0C
def blast_mail(subject, people, entries, contextlines, fromhost, replyto):
    # cannot wait for child process or that will cause parent to retain cvs
    # lock for too long.  Urg!
    if not os.fork():
        # in the child
        # give up the lock you cvs thang!
        time.sleep(2)
        # Create the smtp connection to the localhost
        conn =3D smtplib.SMTP()
        conn.connect(MAILHOST, MAILPORT)
        pwinfo =3D pwd.getpwuid(os.getuid())
        user =3D pwinfo[0]
        name =3D pwinfo[4].split(',')[0]
        domain =3D fromhost or socket.getfqdn()
        address =3D '%s@%s' % (user, domain)
        s =3D StringIO()
        sys.stdout =3D s
        datestamp =3D time.strftime('%a, %d %b %Y %H:%M:%S +0000',
                                  time.gmtime(time.time()))
        try:
            vars =3D {'address' : address,
                    'name'    : quotename(name),
                    'people'  : COMMASPACE.join(people),
                    'subject' : subject,
                    'version' : __version__,
                    'date'    : datestamp,
                    }
            print '''\
=46rom: %(name)s <%(address)s>
To: %(people)s''' % vars
            if replyto:
                print 'Reply-To: %s' % replyto
            print '''\
Subject: %(subject)s
Date: %(date)s
X-Mailer: Python syncmail %(version)s <http://sf.net/projects/cvs-syncmail>
''' % vars
            s.write(sys.stdin.read())
            # append the diffs if available
            print
            for entry in entries:
                print calculate_diff(entry, contextlines)
        finally:
            sys.stdout =3D sys.__stdout__
        resp =3D conn.sendmail(address, people, s.getvalue())
        conn.close()
        os._exit(0)


=0C
class CVSEntry:
    def __init__(self, name, revision, timestamp, conflict, options, tagdat=
e):
        self.name =3D name
        self.revision =3D revision
        self.timestamp =3D timestamp
        self.conflict =3D conflict
        self.options =3D options
        self.tagdate =3D tagdate

def get_entry(prefix, mapping, line, filename):
    line =3D line.strip()
    parts =3D line.split("/")
    _, name, revision, timestamp, options, tagdate =3D parts
    key =3D namekey(prefix, name)
    try:
        entry =3D mapping[key]
    except KeyError:
        if revision =3D=3D "0":
            revision =3D None
        if timestamp.find("+") !=3D -1:
            timestamp, conflict =3D tuple(timestamp.split("+"))
        else:
            conflict =3D None
        entry =3D CVSEntry(key, revision, timestamp, conflict,
                         options, tagdate)
        mapping[key] =3D entry
    return entry

def namekey(prefix, name):
    if prefix:
        return os.path.join(prefix, name)
    else:
        return name

def load_change_info(prefix=3DNone):
    if prefix is not None:
        entries_fn =3D os.path.join(prefix, "CVS", "Entries")
    else:
        entries_fn =3D os.path.join("CVS", "Entries")
    entries_log_fn =3D entries_fn + ".Log"
    mapping =3D {}
    f =3D open(entries_fn)
    while 1:
        line =3D f.readline()
        if not line:
            break
##        if line.strip() =3D=3D "D":
##            continue
        # we could recurse down subdirs, except the Entries.Log files
        # we need haven't been written to the subdirs yet, so it
        # doesn't do us any good
##        if line[0] =3D=3D "D":
##            name =3D line.split("/")[1]
##            dirname =3D namekey(prefix, name)
##            if os.path.isdir(dirname):
##                m =3D load_change_info(dirname)
##                mapping.update(m)
        if line[0] =3D=3D "/":
            # normal file
            get_entry(prefix, mapping, line, entries_fn)
        # else: bogus Entries line
    f.close()
    if os.path.isfile(entries_log_fn):
        f =3D open(entries_log_fn)
        while 1:
            line =3D f.readline()
            if not line:
                break
            if line[1:2] !=3D ' ':
                # really old version of CVS
                break
            entry =3D get_entry(prefix, mapping, line[2:], entries_log_fn)
            parts =3D line.split("/")[1:]
            if line[0] =3D=3D "A":
                # adding a file
                entry.new_revision =3D parts[1]
            elif line[0] =3D=3D "R":
                # removing a file
                entry.new_revision =3D None
        f.close()
    for entry in mapping.values():
        if not hasattr(entry, "new_revision"):
            print 'confused about file', entry.name, '-- ignoring'
            del mapping[entry.name]
    return mapping

def load_branch_name():
    tag_fn =3D os.path.join("CVS", "Tag")
    if os.path.isfile(tag_fn):
        f =3D open(tag_fn)
        line =3D f.readline().strip()
        f.close()
        if line[:1] =3D=3D "T":
            return line[1:]
    return None

# scan args for options
def main():
    # XXX Should really move all the options to an object, just to
    # avoid threading so many positional args through everything.
    try:
        opts, args =3D getopt.getopt(
            sys.argv[1:], 'hC:cuS:R:qf:m:',
            ['fromhost=3D', 'context=3D', 'cvsroot=3D', 'mailhost=3D',
             'subject-prefix=3D', 'reply-to=3D',
             'help', 'quiet'])
    except getopt.error, msg:
        usage(1, msg)

    # parse the options
    contextlines =3D 2
    verbose =3D 1
    subject_prefix =3D ""
    replyto =3D None
    fromhost =3D None
    for opt, arg in opts:
        if opt in ('-h', '--help'):
            usage(0)
        elif opt =3D=3D '--cvsroot':
            os.environ['CVSROOT'] =3D arg
        elif opt in ('-C', '--context'):
            contextlines =3D int(arg)
        elif opt =3D=3D '-c':
            if contextlines <=3D 0:
                contextlines =3D 2
        elif opt =3D=3D '-u':
            contextlines =3D 0
        elif opt in ('-S', '--subject-prefix'):
            subject_prefix =3D arg
        elif opt in ('-R', '--reply-to'):
            replyto =3D arg
        elif opt in ('-q', '--quiet'):
            verbose =3D 0
        elif opt in ('-f', '--fromhost'):
            fromhost =3D arg
        elif opt in ('-m', '--mailhost'):
            global MAILHOST
            MAILHOST =3D arg

    # What follows is the specification containing the files that were
    # modified.  The argument actually must be split, with the first compon=
ent
    # containing the directory the checkin is being made in, relative to
    # $CVSROOT, followed by the list of files that are changing.
    if not args:
        usage(1, 'No CVS module specified')
    subject =3D subject_prefix + args[0]
    specs =3D args[0].split()
    del args[0]

    # The remaining args should be the email addresses
    if not args:
        usage(1, 'No recipients specified')

    # Now do the mail command
    people =3D args

    if specs[-3:] =3D=3D ['-', 'Imported', 'sources']:
        print 'Not sending email for imported sources.'
        return

    branch =3D load_branch_name()
    changes =3D load_change_info()

    if verbose:
        print 'Mailing %s...' % COMMASPACE.join(people)
        print 'Generating notification message...'
    blast_mail(subject, people, changes.values(),
               contextlines, fromhost, replyto)
    if verbose:
        print 'Generating notification message... done.'


=0C
if __name__ =3D=3D '__main__':
    main()
    sys.exit(0)

--aT9PWwzfKXlsBJM1--

--6WlEvdN9Dv0WHSBl
Content-Type: application/pgp-signature; name="signature.asc"
Content-Description: Digital signature
Content-Disposition: inline

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.5 (GNU/Linux)

iD8DBQFCh7XQIiC/MeFF8zQRArCTAJ42g6vYOpW2EfmkRUOhQ8nUMbkFLwCaA0A7
/GoSm0ItrdUA28PX9K6gka8=
=rtzb
-----END PGP SIGNATURE-----

--6WlEvdN9Dv0WHSBl--