过拟合解决方法python,如何解决Python sklearn随机森林中的过拟合问题？

最新推荐文章于 2024-01-31 16:59:12 发布

心月同光

最新推荐文章于 2024-01-31 16:59:12 发布

阅读量605

点赞数 1

文章标签：过拟合解决方法python

博客讨论了使用Python sklearn的RandomForestClassifier遇到的过拟合问题。通过交叉验证显示训练集准确率高而测试集准确率低。作者提到可能的解决方案包括增加`n_estimators`数量，减少`max_features`，限制`max_depth`，以及设置`min_samples_leaf`大于1。建议采用科学的方法调整参数，如使用训练集、开发集和测试集，并逐一更改参数或利用gridsearch进行参数搜索。

摘要由CSDN通过智能技术生成

I am using RandomForestClassifier implemented in python sklearn package to build a binary classification model. The below is the results of cross validations:

Fold 1 : Train: 164 Test: 40

Train Accuracy: 0.914634146341

Test Accuracy: 0.55

Fold 2 : Train: 163 Test: 41

Train Accuracy: 0.871165644172

Test Accuracy: 0.707317073171

Fold 3 : Train: 163 Test: 41

Train Accuracy: 0.889570552147

Test Accuracy: 0.585365853659

Fold 4 : Train: 163 Test: 41

Train Accuracy: 0.871165644172

Test Accuracy: 0.756097560976

Fold 5 : Train: 163 Test: 41

Train Accuracy: 0.883435582822

Test Accuracy: 0.512195121951

I am using "Price" feature to predict "quality" which is a ordinal value. In each cross validation, there are 163 training examples and 41 test examples.

最低0.47元/天解锁文章

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

心月同光

关注关注

1
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
过拟合解决方法python,如何解决Python sklearn随机森林中的过拟合问题？

I am using RandomForestClassifier implemented in python sklearn package to build a binary classification model. The below is the results of cross validations:Fold 1 : Train: 164 Test: 40Train Accurac...
复制链接

扫一扫