pandas
top啦它
这个作者很懒,什么都没留下…
展开
-
将两个文件a.txt和b.txt中的数据集成在一起,合并后的数据保存在c.txt中。
import pandas as pd dataa = pd.read_csv('a.txt') datab = pd.read_csv('b.txt') dataab = pd.merge(dataa,datab,how='left') dataab.to_csv('c.txt',index=False)原创 2021-01-07 16:15:08 · 1254 阅读 · 0 评论 -
Pandas的SettingWithCopyWarning报警
fpath = "datas/beijing_tianqi/beijing_tianqi_2018.csv" df = pd.read_csv(fpath) df.loc[:,"bWendu"] = df["bWendu"].str.replace("℃","").astype('int32') df.loc[:,"yWendu"] = df["yWendu"].str.replace("℃","").astype('int32') # print(df.head()) # 1、复现 condition =原创 2020-10-07 18:13:26 · 109 阅读 · 0 评论 -
Pandas对缺失值的处理
studf = pd.read_excel("datas/student_excel/student_excel.xlsx",skiprows=2) # print(studf) # 检测空值 # print(studf.isnull()) # print(studf["分数"].isnull()) # print(studf["分数"].notnull()) # print(studf.loc[studf["分数"].notnull(),:]) # 删除掉全是空值的列 studf.dropna(axis=原创 2020-10-07 16:27:13 · 127 阅读 · 0 评论 -
Pandas的数据统计函数
fpath = "datas/beijing_tianqi/beijing_tianqi_2018.csv" df = pd.read_csv(fpath) # print(df.head(3)) df.loc[:,"bWendu"] = df["bWendu"].str.replace("℃","").astype('int32') df.loc[:,"yWendu"] = df["yWendu"].str.replace("℃","").astype('int32') # print(df.head(3原创 2020-10-07 11:38:06 · 258 阅读 · 0 评论 -
Pandas怎样新增数据列
fpath = "datas/beijing_tianqi/beijing_tianqi_2018.csv" df = pd.read_csv(fpath) # print(df.head()) df.loc[:,"bWendu"] = df["bWendu"].str.replace("℃","").astype('int32') df.loc[:,"yWendu"] = df["yWendu"].str.replace("℃","").astype('int32') # 1 # 新增wencha d原创 2020-10-07 11:14:58 · 574 阅读 · 0 评论 -
Pandas查询数据的5种方法
df = pd.read_csv("datas/beijing_tianqi/beijing_tianqi_2018.csv") print(df.head()) print(df.index) print('------------------------') df.set_index('ymd',inplace=True) print(df.index) df.loc[:,"bWendu"] = df["bWendu"].str.replace("℃","").astype('int32') df.lo原创 2020-10-07 10:31:05 · 799 阅读 · 0 评论 -
Pandas的数据结构DataFrame和Series
""" dataframe、series s1 = pd.Series([1,'a',5,2.7]) print(s1) print(s1.index) print(s1.values) s2 = pd.Series([1,'a',5.2,7],index=['d','b','a','c']) print(s2) print(s2.index) print(s2.values) sdata = {'Ohio':35000,'Texas':72000,'Oregon':16000,'Utah':5000原创 2020-10-07 09:43:47 · 58 阅读 · 0 评论 -
pandas读取mysql、xlsx、txt数据
""" 读取txt文件 fpath = "datas/crazyant/access_pvuv.txt" pvuv = pd.read_csv( fpath, sep="\t", header=None, names=['pdate','pv','uv'] ) print(pvuv) """ """ 读取mysql import pymysql conn = pymysql.connect( host='127.0.0.1', user='root',原创 2020-10-07 09:14:29 · 98 阅读 · 0 评论