xgboost算法_Hessian矩阵在XGBoost算法的应用小结

最新推荐文章于 2021-11-08 20:26:16 发布

weixin_39732506

最新推荐文章于 2021-11-08 20:26:16 发布

阅读量285

点赞数

文章标签： xgboost算法 xgboost算法原理

前言

Hessian矩阵最常见的应用是牛顿法最优化算法，其主要思想是搜寻一阶导数为0的函数极值点，本文深入浅出的总结了Hessian矩阵在XGboost算法中的两种应用，即权重分位点算法和样本权重和算法。

Hessian矩阵的定义
样本权重和算法
权重分位点算法
总结

1.Hessian矩阵的定义

Hessian矩阵(Hessian matrix或Hessian)是一个自变量为向量的实值函数的二阶导偏导数组成的方块矩阵，此实值函数如下：

自变量为向量：

因此，Hessian矩阵H(f)，定义为：

2. 最小叶子节点样本权重和算法

官方文档对XGBoost算法参数最小叶子节点样本权重和 (min_child_weitht) 的定义：

minimum sum of instance weight (hessian) needed in a child. If the tree partition step results in a leaf node with the sum of instance weight less than min_child_weight, then the building process will give up further partitioning. In linear regression mode, this simply corresponds to minimum number of instances needed to be in each node. The larger, the more conservative the algorithm will be.

翻译：min_child_weight定义为最小叶子节点样本权重(hessian)和，如果树分裂步骤产生的叶子节点上所有样本权重和小于min_child_weight，就停止分裂。在线性回归的模型中，最小样本个数反映了最小叶子节点样本权重和。min_child_weight越大，模型越保守，即降低了模型的复杂度，避免过拟合。

下面讨论树模型和线性模型用hessian表示节点样本权重的含义。

还记得hessian矩阵的定义吗? hessian是函数值对变量的二阶导，xgboost算法中用损失函数对预测值的二阶导表示hessian，如下：