泽野-CSDN博客

原创 Python数据表示与特征工程1

数据表示与特征工程目前为止，数据都是浮点数的二维数据，而且横着那一排还是个连续特征（continue feature），实际上生活中基本是分类特征（categorical feature）,也叫离散特征（discrete feature），且不是数值呢。且我们知道数据缩放很重要，同样的用额外的特征扩充（augment）数据也很有帮助，比如添加特征的交互项呢。特征工程（feature eng...

2020-02-06 15:52:57 981 1

原创 Python无监督学习3

聚类聚类（clustering）将数据划分为组，这些组叫作簇。聚类算法为每个数字分配一个数字，表示这个点属于哪个簇。1.K均值聚类它试图找到聚类的簇中心，将每个数据点分配到最近的簇中心，然后每个簇中心设置为分配的所有点的平均值。mglearn.plots.plot_kmeans_algorithm()plt.show()先初始化。分配数据点，重新计算中心，一共迭代了3...

2020-02-05 19:59:06 1145

原创 Python无监督学习2

降维、特征提取与流形学习为了实现数据变换的那些目的，最常用的算法就是主成分分析。以及另外两种算法：用于特征提取的非负矩阵分解 NMF、用于二位散点图可视化的 t-SNE1.主成分分析主成分分析（principal component analysis, PCA）是一种旋转数据集的方法，旋转后的特征在统计上不相关。通常是根据特征对解释数据的重要性来选择它的一个子集。mglearn....

2020-02-04 21:22:46 1030

原创 Python无监督学习1

无监督学习与预处理1.无监督学习的类型本章将研究两种类型的无监督学习：数据集变换与聚类数据集的无监督变换（unsupervised transformation）是创建数据新的表示的算法，提取其特征降维处理。另一个应用是找到“构成”数据的各个组成部分。与之相反，聚类算法（clustering algorithm）将数据划分成不同的组.2.无监督学习的挑战主要挑战是评估算法是...

2020-02-02 11:22:21 619

原创 Pyhton监督学习_分类器的不确定度估计

分类器的不确定度估计scikit-learn 的另一个有用之处就是分类器能给出预测的不确定度估计。我们不仅关心预测点的被分类情况，还关心这个预测的置信度，这个在医学上是很严肃的问题呢。scikit-learn 中有两个函数可以获取不确定度估计，decision_function 和 predict_proba 。from sklearn.model_selection import t...

2020-01-31 23:50:04 428

原创 Python监督学习_神经网络（深度学习）

神经网络（深度学习）深度学习算法往往经过精确调整，只适用于特定的应用场景。这里讨论简单的用于分类和回归的多层感知机(multilayer perceptron, MLP)，MLP也被称为前馈神经网络/神经网络。1.神经网络模型MLP可以被视为广义的线性模型。display(mglearn.plots.plot_logistic_regression_graph())输入特征...

2020-01-31 23:18:29 1128 1

原创 Python监督学习_和支持向量机SVM

核支持向量机在线性模型那里，我们学习了将线性支持向量机用于分类任务。和支持向量机 (SVM) 是可以推广到更复杂模型的扩展，这些模型无法被输入控件的超平面定义，他既可以回归也可以分类，用于分类的情况在SVC中实现，用于回归的在SVR中实现。可以阅读Hastie、Tibshirani 和 Friedman 合著的《统计学基础》一书的第12章。1.线性模型与非线性特征线性模型在低维空间容易...

2020-01-28 12:57:44 791 1

原创 Python监督学习_决策树

决策树决策树事广泛引用于分类和回归的。决策树中每个结点代表一个问题或一个包含答案的终结点。1.构造决策树决策树中的问题叫做测试，机器学习中数据通常没有类似“是否”这样具有二元特征的数据，而是连续特征，用来划分连续数据的方式是：是否大于a？顶结点(称为根结点)，通过对数据的反复递归划分，直到树中某个结点所包含的数据的目标值都相同，那么这个结点就是纯的。当决策树用于回归的时候，遍历整个树，...

2020-01-19 23:07:18 734

原创 Python监督学习_五种常见回归（线性模型）+ 朴素贝叶斯分类器_线性回归、岭回归、lasso、用于分类的线性模型、用于多分类的线性模型

接下来是五种回归(线性模型)用于回归的线性模型(可以理解为直线方程或者加权求和)单一预测为一条直线，两个特征为一个平面，以此类推。线性模型对多个特征的数据集而言非常强大！X, y = mglearn.datasets.make_forge()mglearn.plots.plot_linear_regression_wave()1.线性回归(普通最小二乘法)两个参数w(权重/...

2020-01-18 15:37:13 1831

原创 Pyhthon监督学习_K近邻回归

k近邻回归，邻居为1时的回归X, y = mglearn.datasets.make_wave(n_samples=40)mglearn.plots.plot_knn_regression(n_neighbors=1)plt.show()邻居为3时的回归X, y = mglearn.datasets.make_wave(n_samples=40)mglearn.plots...

2020-01-18 11:33:02 377 1

原创 Python监督学习_K近邻分类

首先导入数据集forge()，画出邻居为1的情况的图，横纵轴均为特征值X, y = mglearn.datasets.make_forge()# mglearn.discrete_scatter(X[:, 0], X[:, 1], y)# plt.legend(["Class 0", "Class 1"], loc=4)# plt.xlabel("First feature")#...

2020-01-17 13:20:48 344

原创 Engineering Mathematics_Exercise_2

#include <stdio.h>#include <stdlib.h>#include <math.h>#include <time.h>#define MaxNumber 30;typedef double ElemType;typedef struct matrix { int col; int column; Ele...

2019-10-13 18:51:23 163

原创 Python必要的库与工具的简单使用

Numpyimport numpy as npx = np.array([[1, 2, 3], [4, 5, 6]])print("x:\n{}".format(x))Scipyimport numpy as npfrom scipy import sparseeye = np.eye(4)print("\nNumpy array:\n{}".format(eye))#...

2019-10-07 21:15:13 260

原创 Engineering Mathematics_Exercise_1

#include <stdio.h>#include <stdlib.h>#include <math.h>#include <time.h>#define MaxNumber 30;typedef double ElemType;typedef struct matrix { int col; int column; Ele...

2019-10-05 11:01:20 179

原创 Python Exerise_1

Test_1y = 1x = y << 2print(x) # 4 左移两位x = y | 2print(x) # 3 二进制的位或x = y & 2print(x) # 0 只有1 & 1才为1嘛 1 & 2为0内置类型 Number String List Dictionary Tuple File 数字 -24....

2019-10-04 16:05:48 230

原创 Data Structure_LinkList

#include <stdio.h>#include <stdlib.h>#define TRUE 1#define FALSE 0typedef char ElemType;typedef struct Node { ElemType data; struct Node* next; //记的是个数，不是下标}Node, *LinkList;...

2019-10-03 20:24:33 173

原创 Data Structure_SeqList

#include <stdio.h>#define MAX_SIZE 100#define TRUE 1#define FALSE 0typedef int ElemType;typedef struct { ElemType data[MAX_SIZE]; int length; //记的是个数，不是下标}SeqList;void InputList(...

2019-10-03 19:08:00 218

原创 C++ Exercise_4

Test_1#include <iostream>//maybe we can use use 枚举enum{white,black,yeelow};int main(){ using namespace std; int code; cout << "Enter plaese: "; cin >> code; if (code ==...

2019-10-03 14:43:22 133

原创 C++ Exercise_3

Test_1#include <iostream>#include <cstring>int main(){ using namespace std; //for (cin>>x; x==0; cin>>x) //----------------------------------// //for (char ch = 'a'...

2019-10-03 12:45:27 219

原创 C++ Exercise_2

Test_1#include <iostream>#include <string>int main(){ using namespace std; char animal[20] = "rabbit"; char* ps = new char[strlen(animal) - 1]; strcpy(ps, animal); //为什么有警告？...

2019-10-03 03:14:42 700

原创 C++ Exercise_1

Test_1#include <iostream>using namespace std;int main() //更新为了避免重名{std::cout << "hello,world" << std::endl;}Test_2using namespace std; int a, b; std::cout << "E...

2019-10-03 01:42:15 167