数据exdata.txt见最后:
第一:导入工具包
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
二:读取数据
path = 'ex1data1.txt'
data = pd.read_csv(path,header = 0,names = ['Population','Profit'])
data.tail()
Population | Profit | |
---|---|---|
91 | 5.8707 | 7.20290 |
92 | 5.3054 | 1.98690 |
93 | 8.2934 | 0.14454 |
94 | 13.3940 | 9.05510 |
95 | 5.4369 | 0.61705 |
三:数据可视化
data.plot(kind='scatter',x='Population',y='Profit')
plt.show()
四:数据处理
data.insert(0,'Ones',1)
cols = data.shape[1]#二维:[0]行,[1]列
x = data.iloc[:,:cols-1] #保留前两列 'Ones','population'
y = data.iloc[:,cols-1