scikit-learn学习：elastic net

最新推荐文章于 2024-05-20 18:02:54 发布

乱世流星01

最新推荐文章于 2024-05-20 18:02:54 发布

阅读量7.8k

点赞数

分类专栏：数据挖掘，机器学习文章标签： python 数据挖掘机器学习 elastic net

本文链接：https://blog.csdn.net/u014662865/article/details/55251790

版权

数据挖掘，机器学习专栏收录该内容

10 篇文章 2 订阅

订阅专栏

elastic net是结合了lasso和ridge regression的模型，其计算公式如下：

根据官网介绍：elastic net在具有多个特征，并且特征之间具有一定关联的数据中比较有用。

以下为训练误差和测试误差程序：

import numpy as np
from sklearn import linear_model

###############################################################################
# Generate sample data
n_samples_train, n_samples_test, n_features = 75, 150, 500
np.random.seed(0)
coef = np.random.randn(n_features)
coef[50:] = 0.0  # only the top 10 features are impacting the model
X = np.random.randn(n_samples_train + n_samples_test, n_features)
y = np.dot(X, coef)

# Split train and test data
X_train, X_test = X[:n_samples_train], X[n_samples_train:]
y_train, y_test = y[:n_samples_train], y[n_samples_train:]

###############################################################################
# Compute train and test errors
alphas = np.logspace(-5, 1, 60)
enet = linear_model.ElasticNet(l1_ratio=0.7)
train_errors = list()
test_errors = list()
for alpha in alphas:
    enet.set_params(alpha=alpha)
    enet.fit(X_train, y_train)
    train_errors.append(enet.score(X_train, y_train))
    test_errors.append(enet.score(X_test, y_test))

i_alpha_optim = np.argmax(test_errors)
alpha_optim = alphas[i_alpha_optim]
print("Optimal regularization parameter : %s" % alpha_optim)

# Estimate the coef_ on full data with optimal regularization parameter
enet.set_params(alpha=alpha_optim)
coef_ = enet.fit(X, y).coef_

###############################################################################
# Plot results functions

import matplotlib.pyplot as plt
plt.subplot(2, 1, 1)
plt.semilogx(alphas, train_errors, label='Train')
plt.semilogx(alphas, test_errors, label='Test')
plt.vlines(alpha_optim, plt.ylim()[0], np.max(test_errors), color='k',
           linewidth=3, label='Optimum on test')
plt.legend(loc='lower left')
plt.ylim([0, 1.2])
plt.xlabel('Regularization parameter')
plt.ylabel('Performance')

# Show estimated coef_ vs true coef
plt.subplot(2, 1, 2)
plt.plot(coef, label='True coef')
plt.plot(coef_, label='Estimated coef')
plt.legend()
plt.subplots_adjust(0.09, 0.04, 0.94, 0.94, 0.26, 0.26)
plt.show()

实验结果：

Optimal regularization parameter : 0.000335292414925

elastic net的大部分函数也会与之前的大体相似，所以这里仅仅介绍一些比较经常用的到的或者特殊的参数或函数：

参数：

l1_ratio:在0到1之间，代表在l1惩罚和l2惩罚之间，如果l1_ratio=1，则为lasso，是调节模型性能的一个重要指标。

eps:Length of the path. eps=1e-3 means that alpha_min / alpha_max = 1e-3

n_alphas:正则项alpha的个数

alphas：alpha值的列表

返回值：

alphas：返回模型中的alphas值。

coefs：返回模型系数。shape=（n_feature,n_alphas）

函数：

score（X,y,sample_weight）:

评价模型性能的标准，值越接近1，模型效果越好。

如有错误欢迎批评指正。

乱世流星01

关注

0
点赞
踩
6

收藏

觉得还不错? 一键收藏
1
评论
scikit-learn学习：elastic net

elastic net是结合了lasso和ridge regression的模型，其计算公式如下：根据官网介绍：elastic net在具有多个特征，并且特征之间具有一定关联的数据中比较有用。以下为训练误差和测试误差程序：import numpy as npfrom sklearn import linear_model##########################
复制链接

扫一扫