[Pkg-exppsy-maintainers] Q: Crossvalidation feature selection

Yaroslav Halchenko debian at onerussian.com
Sun Jan 6 15:55:39 UTC 2008


Yesterday git push --mirror did bad favor for us and removed some
branches (eg {mh,per}/master) from our 'central' git repository. Michael
already pushed his back into the repository, could you Per push yours?

On Sat, 29 Dec 2007, Per B. Sederberg wrote:

> Hellooo Yarik!

> On Dec 24, 2007 4:08 PM, Yaroslav Halchenko <debian at onerussian.com> wrote:
> > Hi Per,

> > And Merry Christmass!


> And a Merry X-masses to you, too!  You have sent me a great present :)

> I've prepared my datasets and am ready to try all this out.  I've
> updated to the latest version of pymvpa and I'll let you know how it
> goes.

> Latros,
> P

> > > > In any case, Michael  would correct me if I am wrong, by now we didn't
> > > > yet have a Classifier which would do some feature selection, ie you had
> > > > to implement loop through the splits manually and run RFE
> > > > FeatureSelection (using some SensitivityAnalyzer such as OnewayAnova or
> > > > LinearSVMWeights if you use SVM) on each split manually.
> > > OK, I can certainly do that loop and keep track of the results myself.
> > >  I just thought some version may already be there (and I see it may be
> > > soon :))
> > oki doki -- it is accomplished to some degree ;-)

> > I really hate naming of classifiers we have now, so a bit of refactoring
> > will be needed to make them sane. If you have any suggestion -- please
> > don't hesitate to share

> > Ok -- we have few new classifiers and sensitivity selections. First let
> > me describe FeatureSelection's:

> > SensitivityBasedFeatureSelection -- just basic one, alike 1 step RFE --
> >    ie it just removes some features based on results of some
> >    SensitivityAnalyzer.

> > FeatureSelectionPipeline -- which is to
> >   apply a list of FeatureSelection algorithms (like first remove 50% of
> >   silent according to ANOVA, and do SVM-sensitivity based RFE on the
> >   rest))

> > Now we come to the classifier you wanted:

> > FeatureSelectionClassifier -- given a base classifier and
> >   FeatureSelection (like one from above) - create a classifier which
> >   first does feature selection and then trains, and predicts using only
> >   selected features.


> > I've done just basic testing, so it still might have bugs. Also I really
> > want to discuss with Michael and maybe to come up with some better
> > naming convention.  May be simply shorten the names of the classes: like
> > every Classifier-derived class simply has Clf suffix, the same for
> > FeatureSelection to have FS, etc.

> > > I'm working on an analysis right now that could easily be generalized
> > > into an example once this classifier is in there.  Let me know when
> > > you want me to try it out.
> > please do ;-)




> > --
> > Yaroslav Halchenko
> > Research Assistant, Psychology Department, Rutgers-Newark
> > Student  Ph.D. @ CS Dept. NJIT
> > Office: (973) 353-5440x263 | FWD: 82823 | Fax: (973) 353-1171
> >         101 Warren Str, Smith Hall, Rm 4-105, Newark NJ 07102
> > WWW:     http://www.linkedin.com/in/yarik


> > _______________________________________________
> > Pkg-exppsy-maintainers mailing list
> > Pkg-exppsy-maintainers at lists.alioth.debian.org
> > http://lists.alioth.debian.org/mailman/listinfo/pkg-exppsy-maintainers



-- 
Yaroslav Halchenko
Research Assistant, Psychology Department, Rutgers-Newark
Student  Ph.D. @ CS Dept. NJIT
Office: (973) 353-5440x263 | FWD: 82823 | Fax: (973) 353-1171
        101 Warren Str, Smith Hall, Rm 4-105, Newark NJ 07102
WWW:     http://www.linkedin.com/in/yarik        



More information about the Pkg-exppsy-maintainers mailing list