[Shootout-list] Rule 30

Jon Harrop jon@ffconsultancy.com
Fri, 20 May 2005 00:59:16 +0100


On Thursday 19 May 2005 21:32, Brent Fulgham wrote:
> Can anyone suggest a regular expression problem
> that involves:
>
> 1)  A large input file so that we don't have to
>     have programs iterate over the same data multiple
>     times.

Perhaps it would be better to randomly generate the data?

> 2)  Involves useful regular expression features such
>     as capture?
>
> I think we might need a couple of tests:
>
> 1.  Find the elements in some big string.
> 2.  Revise some input document in some fashion.
>
> Ideas?

Here are some vague ideas:

1. finding satellites in DNA sequences.

2. extracting identifiers from a program.

Perhaps (2) could be the basis of a code obfuscator/shrinker?

-- 
Dr Jon D Harrop, Flying Frog Consultancy Ltd.
Objective CAML for Scientists
http://www.ffconsultancy.com/products/ocaml_for_scientists