[Shootout-list] Rule 30

Thu, 19 May 2005 17:15:20 -0700 (PDT)

--- Jon Harrop <jon@ffconsultancy.com> wrote:
> On Thursday 19 May 2005 21:32, Brent Fulgham wrote:
> > Can anyone suggest a regular expression problem
> > that involves:
> >
> > 1)  A large input file so that we don't have to
> >     have programs iterate over the same data multiple
> >     times.
> 
> Perhaps it would be better to randomly generate the data?
> 
> > 2)  Involves useful regular expression features such
> >     as capture?
> >
> > I think we might need a couple of tests:
> >
> > 1.  Find the elements in some big string.
> > 2.  Revise some input document in some fashion.
> >
> > Ideas?
> 
> Here are some vague ideas:
> 
> 1. finding satellites in DNA sequences.
> 
> 2. extracting identifiers from a program.
> 
> Perhaps (2) could be the basis of a code obfuscator/shrinker?

We already have large randomly generated DNA sequences

Here's a source
Elementary Sequence Analysis
http://helix.biology.mcmaster.ca/chpt1.pdf

Unfortunately there aren't any FMR-1 triplets in the randomly generated data.

__________________________________ 
Yahoo! Mail Mobile 
Take Yahoo! Mail with you! Check email on your mobile phone. 
http://mobile.yahoo.com/learn/mail