from sklearn.datasets import make_classification生成随机类别的数据

小李飞刀李寻欢

于 2020-05-31 17:44:24 发布

阅读量1.3k

点赞数 1

分类专栏： python 文章标签：数据生成

本文链接：https://blog.csdn.net/SPESEG/article/details/106458308

版权

python 专栏收录该内容

209 篇文章 69 订阅 ¥9.90 ¥99.00

订阅专栏

超级会员免费看

本文介绍如何利用sklearn.datasets的make_classification函数生成随机分类数据。通过查看帮助文档并理解参数，我们可以创建具有指定特征和类别的训练数据。在深度学习中，此类数据对于视频推荐和语音、图像、视频处理的QQ交流群有重要价值。

摘要由CSDN通过智能技术生成

先看help结果，及返回的结果，就是训练用的数据

make_classification(n_samples=100, n_features=20, n_informative=2, 
n_redundant=2, n_repeated=0, n_classes=2, n_clusters_per_class=2, weights=None, 
flip_y=0.01, class_sep=1.0, hypercube=True, shift=0.0, scale=1.0, shuffle=True, random_state=None)


    X : array of shape [n_samples, n_features]
        The generated samples.
    
    y : array of shape [n_samples]
        The integer labels for class membership of each sample.

y是0~n_classes-1单个数字的集合，分别对应X的sample

X几个特征就有几列，知道三个参数就可以干活了。

>>> X, y = make_classification(n_samples=100, n_features=5, n_clas

了解本专栏

超级会员免费看

小李飞刀李寻欢

关注

1
点赞
踩
1

收藏

觉得还不错? 一键收藏
打赏
5
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录