[pymvpa] NSplitter?

Per B. Sederberg persed at princeton.edu
Sat Mar 28 00:57:46 UTC 2009


Hi folks:

I was helping a colleague perform a multivariate analysis of EEG data
today and we ran into a missing splitter issue.  Given that it's EEG
there's no real need for chunks, like in fMRI runs, and we didn't want
to run an NFoldSplitter-based cross validation (CV) because it would,
quite possibly, take forever.  Instead, we simply wanted to run a CV
by splitting the data into 10 chunks (though we could try different
numbers of splits).

I realize we could have set up custom dataset chunks ourselves or
created a CustomSplitter, but what do folks think of extending the
HalfSplitter into an NSplitter, where you specify how many pieces you
want to split your data into.  Obviously, providing N=2 would be
identical to the HalfSplitter.  But this would make it really easy to
split your data into arbitrary numbers of equal-sized chunks.

Do you think this is worthwhile and that there is nothing that already
does this?  If you think I should add it in, are you happy with the
name NSplitter or do you have a better name?  I'm open to all
suggestions and comments.

Thanks,
Per



More information about the Pkg-ExpPsy-PyMVPA mailing list