scikit-learning 多项式回归应用房价预测

pynash123

于 2019-04-14 12:27:01 发布

阅读量1k

点赞数

分类专栏：算法机器学习 python

本文链接：https://blog.csdn.net/pynash123/article/details/89294935

版权

本文使用scikit-learn库的load_boston数据集，通过一到三阶多项式回归模型进行房价预测。结果显示，一阶多项式存在欠拟合现象，二阶多项式模型拟合效果较好，而三阶多项式模型由于特征个数超过样本数量导致过拟合。

摘要由CSDN通过智能技术生成

将sklearn.datasets中的load_boston房价数据用多项式回归进行训练，并画出学习曲线

import matplotlib.pyplot as plt
import numpy as np
from sklearn.datasets import load_boston
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression
from sklearn.preprocessing import PolynomialFeatures
from sklearn.pipeline import Pipeline
from common.utils import plot_learning_curve
from sklearn.model_selection import ShuffleSplit

def get_bosten_houst_price_train_test_data():
    bosten_houst_price_data = load_boston()
    x = bosten_houst_price_data.data
    y = bosten_houst_price_data.target
    print(bosten_houst_price_data.feature_names)
    #print(x)
    print(x.shape)
    x_train