Wrapper Feature Set Evaluation

Categories: Feature selection

Exposed methods:

Feature selection
Input:
Output:
Input format:	Weka's ARFF format
Output format:	Weka's ARFF format
User-specified parameters:	Minimum support
Reporting information:	Frequent free trees (SMARTs) with occurrence maps, border elements

Description:

The wrapper approach depends on the classifier that should be used with the resulting attribute subset.
Wrapper methods evaluate subsets by running the classifier on the training data, using only the attributes of
the subset. The better the classifier performs, usually based on cross-validation, the better is the selected
attribute set. One normally uses the classification-accuracy as the score for the subset. Though this technique
has a long history in pattern recognition, [JOH94] introduced the term wrapper that is now commonly used.

Background (publication date, popularity/level of familiarity, rationale of approach, further comments)
Standard feature selection method. Leads to superior results compared to Filter
methods. Slow. Resulting feature set is specific to the QSAR model that is used by the
wrapper.

Class-blind/class-sensitive feature selection
Class-sensitive feature selection

Filter/wrapper/hybrid approach
Wrapper

Type of Descriptor:

Interfaces:

Priority: Low

Development status:

Homepage:

Dependencies:
External components: WEKA

Technical details

Data: No

Software: Yes

Programming language(s): Java

Operating system(s): Linux, Win, Mac OS

Input format: Weka's ARFF format

Output format: Weka's ARFF format

License: GPL

References

References:
[JOH94] George H. John, Ron Kohavi and Karl Pfleger, Irrelevant Features and the Subset Selection Problem. ICML, 1994

Sections

Wrapper Feature Set Evaluation

Technical details

References

Document Actions