python aic准则_如何在Python中为线性回归模型计算AIC?

在Python中,为了比较线性模型的复杂性,通常需要计算AIC(Akaike Information Criterion)。使用sklearn库的LinearRegression会遇到错误,因为该库不支持统计推断。解决方法是改用statsmodels库的OLS(Ordinary Least Squares)模型,它包含了AIC属性。要确保模型包含截距项,可以使用add_constant函数添加单位向量到X矩阵。
摘要由CSDN通过智能技术生成

I want to compute AIC for linear models to compare their complexity. I did it as follows:

regr = linear_model.LinearRegression()

regr.fit(X, y)

aic_intercept_slope = aic(y, regr.coef_[0] * X.as_matrix() + regr.intercept_, k=1)

def aic(y, y_pred, k):

resid = y - y_pred.ravel()

sse = sum(resid ** 2)

AIC = 2*k - 2*np.log(sse)

return AIC

But I receive a divide by zero encountered in log error.

解决方案

sklearn's LinearRegression is good for prediction but pretty barebones as you've discovered. (It's often said that sklearn stays away from all things statistical inference.)

statsmodels.regression.linear_model.OLS has a property attribute AIC and a number of other pre-canned attributes.

However, note that you'll need to manually add a unit vector to your X m

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值