[pymvpa] NSplitter?

Yaroslav Halchenko debian at onerussian.com
Sat Mar 28 03:13:55 UTC 2009


> I was helping a colleague perform a multivariate analysis of EEG data
> today and we ran into a missing splitter issue.  Given that it's EEG
> there's no real need for chunks, like in fMRI runs, and we didn't want
> to run an NFoldSplitter-based cross validation (CV) because it would,
> quite possibly, take forever.  Instead, we simply wanted to run a CV
> by splitting the data into 10 chunks (though we could try different
> numbers of splits).

just look at our Frontiers paper code ;) 

I guess you just want to use

coarsenChunks(10)  with NFoldSplitter ;)

or may be look at the docstring of Splitter at such parameters as
nperlabel, count and strategy. Not sure if it is applicable directly yet
to some splitter to generate your usecase, but I guess we could alter 
NoneSplitter to spit out 'the other' part as a testing part


> I realize we could have set up custom dataset chunks ourselves or
> created a CustomSplitter, but what do folks think of extending the
> HalfSplitter into an NSplitter, where you specify how many pieces you
> want to split your data into. 

hm... is that wording wrong or me is slow? are you saying that you want
to generate splits which consistn more than of 2 parts? like 1 for training,
2nd for testing and parameter selection and 3rd one for resultant
cross-validation?

>  Obviously, providing N=2 would be
> identical to the HalfSplitter.  But this would make it really easy to
> split your data into arbitrary numbers of equal-sized chunks.

yeah... I guess those parameters described above is what you are looking
for... I am just not sure now if we have that "TheOtherSplitter" which toseses
what wasn't selected into a testing part... may be it is already there ;)

or did I misunderstood smth?

P.S. how is pyepl release coming? ;)

-- 
Yaroslav Halchenko
Research Assistant, Psychology Department, Rutgers-Newark
Student  Ph.D. @ CS Dept. NJIT
Office: (973) 353-1412 | FWD: 82823 | Fax: (973) 353-1171
        101 Warren Str, Smith Hall, Rm 4-105, Newark NJ 07102
WWW:     http://www.linkedin.com/in/yarik        



More information about the Pkg-ExpPsy-PyMVPA mailing list