microsoftml.count_select: Feature selection based on counts
Usage
microsoftml.count_select(cols: [list, str], count: int = 1, **kargs)
Description
Selects the features for which the count of non-default values is greater than or equal to a threshold.
Details
When using the count mode in feature selection transform, a feature is
selected if the number of examples have at least the specified count
examples of non-default values in the feature. The count mode feature
selection transform is very useful when applied together with a categorical
hash transform (see also, categorical_hash
. The count feature
selection can remove those features generated by hash transform that have no
data in the examples.
Arguments
cols
Specifies character string or list of the names of the variables to select.
count
The threshold for count based feature selection. A feature is
selected if and only if at least count
examples have non-default
value in the feature. The default value is 1.
kargs
Additional arguments sent to compute engine.
Returns
An object defining the transform.