Section I: Brief Introduction on ValidatingCurves
Validation curves are a useful tool for improving the performance of a model by addressing issues such as overfitting or underfitting. Validation curves are related to learning curves, but instead of plotting the training and test accuracies as function of the sample size, the values of the model parameters, i.e., the inverse regulation parameter C in logistic regression, are gradually varied, and then the trend of accuracies for both dataset versus regulation strength are plotted.
Personal Views
验证曲线可以用来可视化分析,参数的优化调整对训练/测试数据集的性能影响。
FROM
Sebastian Raschka, Vahid Mirjalili. Python机器学习第二版. 南京:东南大学出版社,2018.
Section II: Code Bundle and Analyses
代码
from sklearn import datasets
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import StandardScaler
from sklearn.decomposition import PCA
from sklearn.linear_model import LogisticRegression
from sklearn.pipeline import make_pipeline
import numpy as np
from sklearn.model_selection import validation_curve
import matplotlib.pyplot as plt
import warnings
warnings.filterwarnings("ignore")
plt.rcParams['figure.dpi']=200
plt.