袋装决策树_袋装树是每个数据科学家需要的机器学习算法

最新推荐文章于 2022-11-16 20:52:06 发布

VIP文章张_伟_杰

最新推荐文章于 2022-11-16 20:52:06 发布

阅读量4.5k

点赞数 4

文章标签：算法机器学习 python 人工智能 java

原文链接：https://towardsdatascience.com/bagged-trees-a-machine-learning-algorithm-every-data-scientist-needs-d8417ec2e0d9

版权

袋装决策树

袋装树木介绍 (Introduction to Bagged Trees)

Without diving into the specifics just yet, it’s important that you have some foundation understanding of decision trees.

尚未深入研究细节，对决策树有一定基础了解就很重要。

From the evaluation approach of each algorithm to the algorithms themselves, there are many similarities.

从每种算法的评估方法到算法本身，都有很多相似之处。

If you aren’t already familiar with decision trees I’d recommend a quick refresher here.

如果您还不熟悉决策树，我建议在这里快速复习。

With that said, get ready to become a bagged tree expert! Bagged trees are famous for improving the predictive capability of a single decision tree and an incredibly useful algorithm for your machine learning tool belt.

话虽如此，准备成为袋装树专家！袋装树以提高单个决策树的预测能力和对您的机器学习工具带非常有用的算法而闻名。

什么是袋装树？什么使它们如此有效？ (What are Bagged Trees & What Makes Them So Effective?)

为什么要使用袋装树木 (Why use bagged trees)

The main idea between bagged trees is that rather than depending on a single decision tree, you are depending on many many decision trees, which allows you to leverage the insight of many models.

套袋树之间的主要思想是，您不依赖于单个决策树，而是依赖于许多决策树，这使您可以利用许多模型的洞察力。

偏差偏差的权衡 (Bias-variance trade-off)

When considering the performance of a model, we often consider what’s known as the bias-variance trade-off of our output. Variance has to do with how our model handles small errors and how much that potentially throws off our model and bias results in under-fitting. The model effectively makes incorrect assumptions around the relationships between variables.

在考虑模型的性能时，我们经常考虑所谓的输出偏差-偏差权衡。方差与我们的模型如何处理小错误以及与模型的潜在偏离和导致拟合不足的偏差有关。该模型有效地围绕变量之间的关系做出了错误的假设。

You could say the issue with variation is while your model may be directionally correct, it’s not very accurate, while if your model is very biased, while there could be low variation; it could be directionally incorrect entirely.

您可以说变化的问题在于，模型可能在方向上是正确的，但不是很准确；而如果模型有很大偏差，那么变化可能就很小。它可能完全是方向错误的。

The biggest issue with a decision tree, in general, is that they have high variance. The issue this presents is that any minor change to the data can result in major changes to the model and future predictions.

通常，决策树的最大问题是它们的差异很大。这带来的问题是，数据的任何细微变化都可能导致模型和未来预测的重大变化。

The reason this comes into play here is that one of the ben

最低0.47元/天解锁文章

张_伟_杰

关注

4
点赞
踩
12

收藏

觉得还不错? 一键收藏
0
评论
袋装决策树_袋装树是每个数据科学家需要的机器学习算法

袋装决策树袋装树木介绍 (Introduction to Bagged Trees)Without diving into the specifics just yet, it’s important that you have some foundation understanding of decision trees. 尚未深入研究细节，对决策树有一定基础了解就很重要。 From th...
复制链接

扫一扫