Algorithm: Random Forest, ensemble model

随机森林作为集成模型的一种,在分类问题中表现出高效性,尤其适用于图像识别等黑箱问题。在工程领域,由于其可解释性,我们能快速定位问题。随机森林通过bagging策略构建,使用多个决策树进行最终预测,适用于回归问题。为了防止过拟合,关键在于训练样本的随机化和特征选择的随机性。超参数如n_estimators、criterion、max_depth等影响模型性能。
摘要由CSDN通过智能技术生成

Ensemble Model

For clasisfication problem the ensemble model is very effective. Such as  the situation of the Image recognition via deep learning.(black box)

For a grade system, we use the GBDT or XGBoost etc.

In engineering field, the Interpretable is very important,since we could determine the problem once we meet an issue.

 

How to build an ensemble model? Bagging and Boosting

Bagging: Random forest

Boosting: GBDT, XGBoost

We calculate the average value from all of the predictions from the models

We use the variance / standard deviation to evaluate the stability of the model

 

from the example above,we know that the model will become more stable

 

Random Forest

Bagging is a framwork for ensemble model

The random forest using multiple decision trees for the final predictions

It also can be used for regression problem(mean value)

Build the random forest

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值