Pandas库基础
底层实现是numpy实现的
1.1读取csv文件
import pandas as pd
food_info=pd.read_csv(r"F:\唐宇迪机器学习资料\机器学习\Python库代码(4个)\2-数据分析处理库pandas\food_info.csv")
print(type(food_info))
#print(food_info.dtypes)
#print(help(pd.read_csv))
<class 'pandas.core.frame.DataFrame'>
其中Shrt_Desc是object类型可以看为是string类型
其中对应的数据类型如下
object–string
int–int
flota–float
datetime–time value
bool–bool
food_info.head()
NDB_No | Shrt_Desc | Water_(g) | Energ_Kcal | Protein_(g) | Lipid_Tot_(g) | Ash_(g) | Carbohydrt_(g) | Fiber_TD_(g) | Sugar_Tot_(g) | ... | Vit_A_IU | Vit_A_RAE | Vit_E_(mg) | Vit_D_mcg | Vit_D_IU | Vit_K_(mcg) | FA_Sat_(g) | FA_Mono_(g) | FA_Poly_(g) | Cholestrl_(mg) | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 1001 | BUTTER WITH SALT | 15.87 | 717 | 0.85 | 81.11 | 2.11 | 0.06 | 0.0 | 0.06 | ... | 2499.0 | 684.0 | 2.32 | 1.5 | 60.0 | 7.0 | 51.368 | 21.021 | 3.043 | 215.0 |
1 | 1002 | BUTTER WHIPPED WITH SALT | 15.87 | 717 | 0.85 | 81.11 | 2.11 | 0.06 | 0.0 | 0.06 | ... | 2499.0 | 684.0 | 2.32 | 1.5 | 60.0 | 7.0 | 50.489 | 23.426 | 3.012 | 219.0 |
2 | 1003 | BUTTER OIL ANHYDROUS | 0.24 | 876 | 0.28 | 99.48 | 0.00 | 0.00 | 0.0 | 0.00 | ... | 3069.0 | 840.0 | 2.80 | 1.8 | 73.0 | 8.6 | 61.924 | 28.732 | 3.694 | 256.0 |
3 | 1004 | CHEESE BLUE | 42.41 | 353 | 21.40 | 28.74 | 5.11 | 2.34 | 0.0 | 0.50 | ... | 721.0 | 198.0 | 0.25 | 0.5 | 21.0 | 2.4 | 18.669 | 7.778 | 0.800 | 75.0 |
4 | 1005 | CHEESE BRICK | 41.11 | 371 | 23.24 | 29.68 | 3.18 | 2.79 | 0.0 | 0.51 | ... | 1080.0 | 292.0 | 0.26 | 0.5 | 22.0 | 2.5 | 18.764 | 8.598 | 0.784 | 94.0 |
5 rows × 36 columns
把刚刚读取的数据部分显示以下自动显示前5条数据,如果想显示前三条,即在括号里写3
food_info.head(3)
NDB_No | Shrt_Desc | Water_(g) | Energ_Kcal | Protein_(g) |
---|