自动机器学习
自动机器学习(AutoML)是将机器学习应用于现实问题的端到端流程自动化的过程。AutoML 使真正意义上的机器学习成为可能。
AutoML库
Featuretools
import pandas as pd
#第一个表格展示数据库信息
# First dataset contains the basic information for databases.
databases_df=pd.DataFrame({"database_id":[2234,1765,8796,2237,3398], "creation_date":["2018-02-01","2017-03-02","2017-05-03","2013-05-12", "2012-05-09"]})
databases_df.head()
# 以下是数据库事务的代码:
# Second dataset contains the information of transaction for each database id
db_transactions_df= pd.DataFrame({"transaction_id":[26482746,19384752, 48571125,78546789,19998765,26482646,12484752,42471125,75346789, 16498765,65487547,23453847,56756771,45645667,23423498,12335268, 76435357,34534711,45656746,12312987],
"database_id":[2234,1765,2234,2237,1765,8796,2237,
8796,3398,2237,3398,2237,2234,8796,1765,2234,2237,1765,8796, 2237],
"transaction_size":[10,20,30,50,100,40,60,60,10,
20,60,50,40,40,30,90,130,40,50,30],
"transaction_date":["2018-02-02","2018-03-02",
"2018-03-02","2018-04-02","2018-04-02","2018-05-02","2018-06-02", "2018-06-02","2018-07-02","2018-07-02","2018-01-03","2018-02-03", "2018-03-03","2018-04-03","2018-04-03","2018-07-03", "2018-07-03", "2018-07-03","2018-08-03","2018-08-03"]})
db_transactions_df.head()
# Entities for each of datasets should be defined
entities = {
"databases":(databases_df,"database_id"),
"transactions":(db_transactions_df,"transaction_id")
}
# Relationships between tables should also be defined as below
relationships=[("databases","database_td","transactions", "database_id")]
print(entities)
auto-sklearn
!pip3 install auto-sklearn
import autosklearn.classification
cls = autosklearn.classification.AutoSklearnClassifier()
cls.fit(X_train, y_train)
predictions = cls.predict(X_test)
实例
auto-sklearn_Classification
auto-sklearn要求Python>=3.7
python3 -V
cat /etc/redhat-release
我租用的阿里云CentOS8服务器的python是3.6,需要升级
centos8(linux):编译升级python版本:从python3.6到python3.11
python_install.sh
#!/bin/bash
wget https://www.python.org/ftp/python/3.7.9/Python-3.7.9.tgz
tar -zxvf Python-3.7.9.tgz
cd Python-3.7.9
./configure --prefix=/usr/local/soft/python3.7 --enable-optimizations
make
make install
#!/bin/bash
/usr/local/soft/python3.7/bin/python3 --version
/usr/local/soft/python3.7/bin/pip3 -V
#替换原有python并查看效果
whereis python3
mv /usr/bin/python3 /usr/bin/python3.6.bak
ln -s /usr/local/soft/python3.7/bin/python3 /usr/bin/python3
#查看原有的pip的路径并替换
mv /usr/bin/pip3 /usr/bin/pip3.bak
ln -s /usr/local/soft/python3.7/bin/pip3 /usr/bin/pip3
#测试效果
python3 --version
pip3 -V
参考
《自动机器学习入门与实践 使用Python》