make_classification

最新推荐文章于 2024-06-16 22:42:56 发布

学渣研究僧3

最新推荐文章于 2024-06-16 22:42:56 发布

阅读量679

点赞数

本文链接：https://blog.csdn.net/weixin_44539090/article/details/115187116

版权

这篇博客介绍了如何使用Python中的ADASYN（Adaptive Synthetic Sampling）算法来解决分类任务中不平衡数据的问题。通过`make_classification`生成了一个不平衡的数据集，展示了ADASYN如何重采样以平衡类别比例，从而改善模型训练效果。

摘要由CSDN通过智能技术生成

from collections import Counter
from sklearn.datasets import make_classification
from imblearn.over_sampling import ADASYN
X, y = make_classification(n_classes=2, class_sep=2,
                           weights=[0.1, 0.9], 
                           n_informative=3, 
                           n_redundant=1, 
                           flip_y=0,
                           n_features=20, 
                           n_clusters_per_class=1, 
                           n_samples=1000,
                           random_state=10)

print(X)
print(y)
print('Original dataset shape %s' % Counter(y))
ada = ADASYN(random_state=42)
X_res, y_res = ada.fit_resample(X, y)
print('Resampled dataset shape %s' % Counter(y_res))

make_classification参数注解