sklearn分类模型汇总（二）

最新推荐文章于 2024-07-08 16:41:31 发布

煌澄艾

最新推荐文章于 2024-07-08 16:41:31 发布

阅读量552

点赞数 22

分类专栏：数据分析文章标签： sklearn 分类 python

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/qq_42266272/article/details/136406924

版权

数据分析专栏收录该内容

4 篇文章 0 订阅

订阅专栏

sklearn分类模型汇总（二）

文章目录

sklearn分类模型汇总（二）

Naive Bayes

naive_bayes.CategoricalNB函数参数：

alpha：float, default=1.0，加性 (Laplace/Lidstone) 平滑参数（设置 alpha=0 且force_alpha=True，不进行平滑）。
force_alpha：bool, default=True，如果为 False 并且 alpha 小于 1e-10，则将 alpha 设置为 1e-10。如果为 True，则 alpha 将保持不变。如果 alpha 太接近 0，这可能会导致数值错误。
fit_prior：bool, default=True，是否学习类先验概率。如果为 false，则将使用统一先验。
class_prior：array-like of shape (n_classes,), default=None，类别的先验概率。如果指定，则不会根据数据调整先验。
min_categories：int or array-like of shape (n_features,),每个特征的最少类别数。

naive_bayes.GaussianNB函数参数：

priors：array-like of shape (n_classes,), default=None，类别的先验概率。如果指定，则不会根据数据调整先验。
var_smoothing：float, default=1e-9，为了计算稳定性而添加到方差中的所有特征的最大方差的部分。

k-nearest neighbors algorithm

neighbors.KNeighborsClassifier函数参数：

n_neighbors：int, default=5。
weights：{‘uniform’, ‘distance’}, callable or None, default=’uniform’。
weights：{‘uniform’, ‘distance’}, callable or None, default=’uniform’。
leaf_size：int, default=30，叶子大小传递给 BallTree 或 KDTree。这会影响构建和查询的速度以及存储树所需的内存。最佳值取决于问题的性质。
p：float, default=2，闵可夫斯基度量的功效参数。当 p = 1 时，这相当于使用 manhattan_distance (l1)，而 p = 2 时则使用 euclidean_distance (l2)。对于任意 p，使用 minkowski_distance (l_p)。该参数预计为正。
metric：str or callable,default=’minkowski’，用于距离计算的度量。默认值为“minkowski”，当 p = 2 时，这会产生标准欧几里得距离。
metric_params：dict, default=None，度量函数的附加关键字参数。
n_jobs：int, default=None。
neighbors.RadiusNeighborsClassifier函数参数：
radius：float, default=1.0，radius_neighbors 查询默认使用的参数空间范围。
weights：{‘uniform’, ‘distance’}, callable or None, default=’uniform’。
weights：{‘uniform’, ‘distance’}, callable or None, default=’uniform’。
leaf_size：int, default=30，叶子大小传递给 BallTree 或 KDTree。这会影响构建和查询的速度以及存储树所需的内存。最佳值取决于问题的性质。
p：float, default=2，闵可夫斯基度量的功效参数。当 p = 1 时，这相当于使用 manhattan_distance (l1)，而 p = 2 时则使用 euclidean_distance (l2)。对于任意 p，使用 minkowski_distance (l_p)。该参数预计为正。
metric：str or callable,default=’minkowski’，用于距离计算的度量。默认值为“minkowski”，当 p = 2 时，这会产生标准欧几里得距离。
outlier_label：{manual label, ‘most_frequent’}, default=None，离群样本的标签（给定半径内没有邻居的样本）。
metric_params：dict, default=None，度量函数的附加关键字参数。
n_jobs：int, default=None。

Linear Models

linear_model.LogisticRegression函数参数：

penalty：{‘l1’, ‘l2’, ‘elasticnet’, None}, default=’l2’。
dual：bool, default=False，对偶（约束）或原始公式。对偶公式仅适用于使用liblinear求解器的 l2 惩罚。当 n_samples > n_features 时，首选 Dual=False。
tol：float, default=1e-4。
C：float, default=1.0，正则化强度的倒数；必须是正浮动。与支持向量机一样，较小的值指定更强的正则化。
fit_intercept：bool, default=True，指定是否应将常数（也称为偏差或截距）添加到决策函数中。
intercept_scaling：float, default=1，仅当使用求解器“liblinear”且 self.fit_intercept 设置为 True 时才有用。在这种情况下，x 变为 [x, self.intercept_scaling]，即，将常数值等于截距缩放的“合成”特征附加到实例向量。截距变为intercept_scaling * synthetic_feature_weight。
class_weight：dict or ‘balanced’, default=None。
random_state：int, RandomState instance, default=None。
solver：{‘lbfgs’, ‘liblinear’, ‘newton-cg’, ‘newton-cholesky’, ‘sag’, ‘saga’}, default=’lbfgs’，在优化问题中使用的算法。

linear_model.PassiveAggressiveClassifier函数参数：

C：float, default=1.0，正则化强度的倒数；必须是正浮动。与支持向量机一样，较小的值指定更强的正则化。
fit_intercept：bool, default=True，指定是否应将常数（也称为偏差或截距）添加到决策函数中。
max_iter：int, default=1000。
tol：float, default=1e-3。
early_stopping：bool, default=False，当验证分数没有提高时是否使用提前停止来终止训练。如果设置为 True，它将自动留出训练数据的分层部分作为验证，并在 n_iter_no_change 连续 epoch 的验证分数未提高至少 tol 时终止训练。
validation_fraction：float, default=0.1，留作早期停止验证集的训练数据比例。必须介于 0 和 1 之间。仅在 Early_stopping 为 True 时使用。
n_iter_no_change：int, default=5，在提前停止之前等待没有改进的迭代次数。
shuffle：bool, default=True
verbose：int, default=0
loss：str, default=”hinge”，要使用的损失函数：hinge：相当于参考论文中的PA-I。 squared_hinge：相当于参考论文中的PA-II。
n_jobs：int or None, default=None
random_state：int, RandomState instance, default=None
warm_start：bool, default=False，当设置为 True 时，重用先前调用的解来拟合作为初始化，否则，只需擦除先前的解。
average：bool or int, default=False，设置为 True 时，计算平均 SGD 权重并将结果存储在 coef_ 属性中。如果设置为大于 1 的整数，则一旦看到的样本总数达到平均值，就会开始求平均值。因此，average=10 将在查看 10 个样本后开始求平均值。

linear_model.Perceptron函数参数：

penalty：{‘l1’, ‘l2’, ‘elasticnet’, None}, default=’l2’。
alpha：float, default=0.0001，如果使用正则化，则与正则化项相乘的常数。
l1_ratio：float, default=0.15，Elastic Net混合参数，0 <= l1_ratio <= 1。l1_ratio=0对应L2惩罚，l1_ratio=1对应L1。仅当penalty='elasticnet’时使用。
fit_intercept：bool, default=True，指定是否应将常数（也称为偏差或截距）添加到决策函数中。
max_iter：int, default=1000。
tol：float, default=1e-3。
shuffle：bool, default=True
verbose：int, default=0
eta0：float, default=1，更新所乘的常数。
n_jobs：int or None, default=None
random_state：int, RandomState instance, default=None
early_stopping：bool, default=False，当验证分数没有提高时是否使用提前停止来终止训练。如果设置为 True，它将自动留出训练数据的分层部分作为验证，并在 n_iter_no_change 连续 epoch 的验证分数未提高至少 tol 时终止训练。
validation_fraction：float, default=0.1，留作早期停止验证集的训练数据比例。必须介于 0 和 1 之间。仅在 Early_stopping 为 True 时使用。
n_iter_no_change：int, default=5，在提前停止之前等待没有改进的迭代次数。
class_weight：dict, {class_label: weight} or “balanced” or None, default=None
warm_start：bool, default=False，当设置为 True 时，重用先前调用的解来拟合作为初始化，否则，只需擦除先前的解。

linear_model.RidgeClassifier函数参数：

alpha：float, default=1.0，如果使用正则化，则与正则化项相乘的常数。
fit_intercept：bool, default=True，是否计算该模型的截距。如果设置为 false，则在计算中不会使用截距（例如，数据预计已居中）。
copy_X：bool, default=True，如果为 True，则 X 将被复制；否则，它可能会被覆盖。
max_iter：int, default=None
tol：float, default=1e-4，解的精度 (coef_) 由 tol 确定，它为每个求解器指定不同的收敛标准。
class_weight：dict, {class_label: weight} or “balanced” or None, default=None
solver：{‘lbfgs’, ‘liblinear’, ‘newton-cg’, ‘newton-cholesky’, ‘sag’, ‘saga’}, default=’lbfgs’，在优化问题中使用的算法。
positive：bool, default=False，设置为 True 时，强制系数为正。在这种情况下，仅支持“lbfgs”求解器。
random_state：int, RandomState instance, default=None

linear_model.SGDClassifier函数参数：

loss：{‘hinge’, ‘log_loss’, ‘modified_huber’, ‘squared_hinge’, ‘perceptron’, ‘squared_error’, ‘huber’, ‘epsilon_insensitive’, ‘squared_epsilon_insensitive’}, default=’hinge’
penalty：{‘l2’, ‘l1’, ‘elasticnet’, None}, default=’l2’
alpha：float, default=0.0001，如果使用正则化，则与正则化项相乘的常数。
l1_ratio：float, default=0.15，Elastic Net混合参数，0 <= l1_ratio <= 1。l1_ratio=0对应L2惩罚，l1_ratio=1对应L1。仅当penalty='elasticnet’时使用。
fit_intercept：bool, default=True，是否计算该模型的截距。如果设置为 false，则在计算中不会使用截距（例如，数据预计已居中）。
max_iter：int, default=1000
tol：float, default=1e-3，停止标准。如果不是 None，则当 n_iter_no_change 连续 epoch 的 (loss > best_loss - tol) 时，训练将停止。根据 Early_stopping 参数，根据训练损失或验证损失检查收敛性。值必须在 [0.0, inf) 范围内。
shuffle：bool, default=True
verbose：int, default=0
epsilon：float, default=0.1，对 epsilon 不敏感的损失函数中的 Epsilon；仅当损失为“huber”、“epsilon_insensitive”或“squared_epsilon_insensitive”时。对于“huber”，确定阈值，在该阈值下，获得准确预测变得不那么重要。对于 epsilon 不敏感，如果当前预测和正确标签之间的任何差异小于此阈值，则忽略它们。值必须在 [0.0, inf) 范围内。
n_jobs：int, default=None
random_state：int, RandomState instance, default=None
learning_rate：str, {‘constant’，‘optimal’，‘invscaling’，‘adaptive’}，default=’optimal’。
eta0：float, default=0.0，“恒定”、“逆缩放”或“自适应”计划的初始学习率。默认值为 0.0，因为默认计划“最佳”不使用 eta0。值必须在 [0.0, inf) 范围内。
power_t：float, default=0.5，逆缩放学习率的指数。值必须在 (-inf, inf) 范围内。
early_stopping：bool, default=False，当验证分数没有提高时是否使用提前停止来终止训练。如果设置为 True，它将自动留出训练数据的分层部分作为验证，并在 n_iter_no_change 连续 epoch 的验证分数未提高至少 tol 时终止训练。
validation_fraction：float, default=0.1，留作早期停止验证集的训练数据比例。必须介于 0 和 1 之间。仅在 Early_stopping 为 True 时使用。
n_iter_no_change：int, default=5，在提前停止之前等待没有改进的迭代次数。
class_weight：dict, {class_label: weight} or “balanced” or None, default=None
warm_start：bool, default=False，当设置为 True 时，重用先前调用的解来拟合作为初始化，否则，只需擦除先前的解。
average：bool or int, default=False，当设置为 True 时，计算所有更新的平均 SGD 权重并将结果存储在 coef_ 属性中。如果设置为大于 1 的整数，则一旦看到的样本总数达到平均值，就会开始求平均值。因此，average=10 将在查看 10 个样本后开始求平均值。整数值必须在 [1, n_samples] 范围内。

linear_model.SGDOneClassSVM函数参数：

nu：float, default=0.5，One Class SVM 的 nu 参数：训练误差分数的上限和支持向量分数的下限。应在区间 (0, 1] 内。默认为 0.5。
fit_intercept：bool, default=True，是否计算该模型的截距。如果设置为 false，则在计算中不会使用截距（例如，数据预计已居中）。
max_iter：int, default=1000
tol：float, default=1e-3，停止标准。如果不是 None，则当 n_iter_no_change 连续 epoch 的 (loss > best_loss - tol) 时，训练将停止。根据 Early_stopping 参数，根据训练损失或验证损失检查收敛性。值必须在 [0.0, inf) 范围内。
shuffle：bool, default=True
verbose：int, default=0
random_state：int, RandomState instance, default=None
learning_rate：str, {‘constant’，‘optimal’，‘invscaling’，‘adaptive’}，default=’optimal’。
eta0：float, default=0.0，“恒定”、“逆缩放”或“自适应”计划的初始学习率。默认值为 0.0，因为默认计划“最佳”不使用 eta0。值必须在 [0.0, inf) 范围内。
power_t：float, default=0.5，逆缩放学习率的指数。值必须在 (-inf, inf) 范围内。
warm_start：bool, default=False，当设置为 True 时，重用先前调用的解来拟合作为初始化，否则，只需擦除先前的解。
average：bool or int, default=False，当设置为 True 时，计算所有更新的平均 SGD 权重并将结果存储在 coef_ 属性中。如果设置为大于 1 的整数，则一旦看到的样本总数达到平均值，就会开始求平均值。因此，average=10 将在查看 10 个样本后开始求平均值。整数值必须在 [1, n_samples] 范围内。

关注

22
点赞
踩
10

收藏

觉得还不错? 一键收藏
打赏
1
评论
sklearn分类模型汇总（二）

sklearn分类模型汇总（二）文章目录sklearn分类模型汇总（二）Naive Bayesk-nearest neighbors algorithmLinear ModelsNaive Bayesnaive_bayes.CategoricalNB函数参数：alpha：float, default=1.0，加性 (Laplace/Lidstone) 平滑参数（设置 alpha=0 且force_alpha=True，不进行平滑）。force_alpha：bool, default=True，如
复制链接

扫一扫

专栏目录

煌澄艾 CSDN认证博客专家 CSDN认证企业博客

码龄6年

38: 原创

24万+: 周排名

2万+: 总排名

4万+: 访问

: 等级

1167: 积分

624: 粉丝

738: 获赞

23: 评论

628: 收藏

私信

关注

热门文章

分类专栏

论文 27篇
数据分析 4篇

最新评论

【论文阅读】Faster Neural Networks Straight from JPEG
笙丶: 你好，我最近也在做和这篇文章有关的研究，在工作复现时存在部分问题，可以交流一下么
【论文阅读】Activity Recognition using Cell Phone Accelerometers
CSDN-Ada助手: 你好，CSDN 开始提供 #论文阅读# 的列表服务了。请看：https://blog.csdn.net/nav/advanced-technology/paper-reading?utm_source=csdn_ai_ada_blog_reply 。如果你有更多需求，请来这里 https://gitcode.net/csdn/csdn-tags/-/issues/34?utm_source=csdn_ai_ada_blog_reply 给我们提。
【论文阅读】iTransformer: Inverted Transformers Are Effective for Time Series Forecasting
ruisiqi7: 具体来说，由于序列的时间顺序隐式存储在前馈网络的神经元排列中，因此这里不再需要 vanilla Transformer 中的位置编码嵌入信息。这句话不懂，那transformer中都有全连接层，是不是其他transformer都不需要位置编码？
【论文阅读】xLSTM: Extended Long Short-Term Memory
CSDN-Ada助手: 你好，CSDN 开始提供 #论文阅读# 的列表服务了。请看：https://blog.csdn.net/nav/advanced-technology/paper-reading?utm_source=csdn_ai_ada_blog_reply 。如果你有更多需求，请来这里 https://gitcode.net/csdn/csdn-tags/-/issues/34?utm_source=csdn_ai_ada_blog_reply 给我们提。
【论文阅读】TimesNet: Temporal 2D-Variation Modeling for General Time Series Analysis
CSDN-Ada助手: 你好，CSDN 开始提供 #论文阅读# 的列表服务了。请看：https://blog.csdn.net/nav/advanced-technology/paper-reading?utm_source=csdn_ai_ada_blog_reply 。如果你有更多需求，请来这里 https://gitcode.net/csdn/csdn-tags/-/issues/34?utm_source=csdn_ai_ada_blog_reply 给我们提。

大家在看

最新文章

目录

评论 1

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

打赏作者

煌澄艾 你的鼓励将是我创作的最大动力

¥1 ¥2 ¥4 ¥6 ¥10 ¥20

扫码支付：¥1

获取中

扫码支付

您的余额不足，请更换扫码支付或充值

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。