机器学习模型python实现汇总

最新推荐文章于 2024-07-02 11:45:31 发布

moonfansLTH

最新推荐文章于 2024-07-02 11:45:31 发布

阅读量909

点赞数 1

分类专栏：学习笔记

本文链接：https://blog.csdn.net/moonfansLTH/article/details/80719662

版权

学习笔记专栏收录该内容

27 篇文章 0 订阅

订阅专栏

这篇博客汇总了机器学习中的特征处理，包括使用sklearn.preprocessing进行标准化和最小最大规范化。同时，介绍了随机森林模型，特别是RandomForestRegressor的使用，并探讨了模型参数如n_estimators和max_features的选择问题。最后，提到了构建模型汇总图的需求。

摘要由CSDN通过智能技术生成

Target：记录学习过程中看到的模型python实现

特征处理

标准化

import sklearn.preprocessing

官方文档：http://scikit-learn.org/stable/modules/preprocessing.html#preprocessing
关键要素：要转换成数组
normalization（ x-m/sigma）
- 构造中心点：scaler = preprocessing.StandardScaler().fit(X_train)
- 规范化：scaler.transform(X_train)
range（ x-min / max-min）
- min_max_scaler = preprocessing.MinMaxScaler()
- X_train_minmax = min_max_scaler.fit_transform(X_train)

随机森林

from sklearn.ensemble import RandomForestRegressor

官方文档：http://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestRegressor.html
关键参数：
- n_estimators : integer, optional (default=10)，The number of trees in the forest.子树的数量
- max_features：optional (default=”auto”)，子分类器用到的属性数
疑问：
- 子树的数量如何确定？
- 默认的分类器模型是什么？根据criterion（gini，mse）来区分？朴素贝叶斯分类怎么搞？