sklearn-学习：Dimensionality reduction(降维)-（feature selection）特征选择

最新推荐文章于 2023-12-30 14:01:49 发布

VIP文章 marui1982

最新推荐文章于 2023-12-30 14:01:49 发布

阅读量6.4k

点赞数

本文链接：https://blog.csdn.net/newmarui/article/details/52119869

版权

本文主要对对应文档的内容进行简化（以代码示例为主）及汉化

对应文档位置：http://scikit-learn.org/stable/modules/feature_selection.html#feature-selection

1.13. Feature selection

feature selection 作用：增加分类器的score ，提升分类器在高纬数据集上的表现

1.13.1. Removing features with low variance

from sklearn.feature_selection import VarianceThreshold
X = [[0, 0, 1], [0, 1, 0], [1, 0, 0], [0, 1, 1], [0, 1, 0], [0, 1, 1]]
sel = VarianceThreshold(threshold=(.8 * (1 - .8)))
sel.fit_transform(X)
array([[0, 1],
       [1, 0],
       [0, 0],
       [1, 1],
       [1, 0],
       [1, 1]])

说明：

VarianceThreshold

默认值：去除差异值为0（或者为相同值的变量）

VarianceThreshold(threshold=(.8 * (1 - .8))) ，例子中假设为bool型变量（取值为0,1），其参数threshold的值为方差值；                            对于伯努利分布，其方差为p(1-p)=0.8*(1-0.8)

1.13.2. Univariate feature selection(单变量特征选择)

最低0.47元/天解锁文章

marui1982

关注

0
点赞
踩
4

收藏

觉得还不错? 一键收藏
0
评论
sklearn-学习：Dimensionality reduction(降维)-（feature selection）特征选择

本文主要对对应文档的内容进行简化（以代码示例为主）及汉化对应文档位置：http://scikit-learn.org/stable/modules/feature_selection.html#feature-selection1.13. Feature selectionfeature selection 作用：增加分类器的score ，提升分类器在高纬数据集上的表现
复制链接

扫一扫