[Shootout-list] Rule 30
Isaac Gouy
igouy2@yahoo.com
Thu, 19 May 2005 17:15:20 -0700 (PDT)
--- Jon Harrop <jon@ffconsultancy.com> wrote:
> On Thursday 19 May 2005 21:32, Brent Fulgham wrote:
> > Can anyone suggest a regular expression problem
> > that involves:
> >
> > 1) A large input file so that we don't have to
> > have programs iterate over the same data multiple
> > times.
>
> Perhaps it would be better to randomly generate the data?
>
> > 2) Involves useful regular expression features such
> > as capture?
> >
> > I think we might need a couple of tests:
> >
> > 1. Find the elements in some big string.
> > 2. Revise some input document in some fashion.
> >
> > Ideas?
>
> Here are some vague ideas:
>
> 1. finding satellites in DNA sequences.
>
> 2. extracting identifiers from a program.
>
> Perhaps (2) could be the basis of a code obfuscator/shrinker?
We already have large randomly generated DNA sequences
Here's a source
Elementary Sequence Analysis
http://helix.biology.mcmaster.ca/chpt1.pdf
Unfortunately there aren't any FMR-1 triplets in the randomly generated data.
__________________________________
Yahoo! Mail Mobile
Take Yahoo! Mail with you! Check email on your mobile phone.
http://mobile.yahoo.com/learn/mail