![](https://img-blog.csdnimg.cn/20201014180756754.png?x-oss-process=image/resize,m_fixed,h_64,w_64)
pandas
top啦它
这个作者很懒,什么都没留下…
展开
-
将两个文件a.txt和b.txt中的数据集成在一起,合并后的数据保存在c.txt中。
import pandas as pddataa = pd.read_csv('a.txt')datab = pd.read_csv('b.txt')dataab = pd.merge(dataa,datab,how='left')dataab.to_csv('c.txt',index=False)原创 2021-01-07 16:15:08 · 1243 阅读 · 0 评论 -
Pandas的SettingWithCopyWarning报警
fpath = "datas/beijing_tianqi/beijing_tianqi_2018.csv"df = pd.read_csv(fpath)df.loc[:,"bWendu"] = df["bWendu"].str.replace("℃","").astype('int32')df.loc[:,"yWendu"] = df["yWendu"].str.replace("℃","").astype('int32')# print(df.head())# 1、复现condition =原创 2020-10-07 18:13:26 · 100 阅读 · 0 评论 -
Pandas对缺失值的处理
studf = pd.read_excel("datas/student_excel/student_excel.xlsx",skiprows=2)# print(studf)# 检测空值# print(studf.isnull())# print(studf["分数"].isnull())# print(studf["分数"].notnull())# print(studf.loc[studf["分数"].notnull(),:])# 删除掉全是空值的列studf.dropna(axis=原创 2020-10-07 16:27:13 · 122 阅读 · 0 评论 -
Pandas的数据统计函数
fpath = "datas/beijing_tianqi/beijing_tianqi_2018.csv"df = pd.read_csv(fpath)# print(df.head(3))df.loc[:,"bWendu"] = df["bWendu"].str.replace("℃","").astype('int32')df.loc[:,"yWendu"] = df["yWendu"].str.replace("℃","").astype('int32')# print(df.head(3原创 2020-10-07 11:38:06 · 249 阅读 · 0 评论 -
Pandas怎样新增数据列
fpath = "datas/beijing_tianqi/beijing_tianqi_2018.csv"df = pd.read_csv(fpath)# print(df.head())df.loc[:,"bWendu"] = df["bWendu"].str.replace("℃","").astype('int32')df.loc[:,"yWendu"] = df["yWendu"].str.replace("℃","").astype('int32')# 1# 新增wenchad原创 2020-10-07 11:14:58 · 571 阅读 · 0 评论 -
Pandas查询数据的5种方法
df = pd.read_csv("datas/beijing_tianqi/beijing_tianqi_2018.csv")print(df.head())print(df.index)print('------------------------')df.set_index('ymd',inplace=True)print(df.index)df.loc[:,"bWendu"] = df["bWendu"].str.replace("℃","").astype('int32')df.lo原创 2020-10-07 10:31:05 · 783 阅读 · 0 评论 -
Pandas的数据结构DataFrame和Series
"""dataframe、seriess1 = pd.Series([1,'a',5,2.7])print(s1)print(s1.index)print(s1.values)s2 = pd.Series([1,'a',5.2,7],index=['d','b','a','c'])print(s2)print(s2.index)print(s2.values)sdata = {'Ohio':35000,'Texas':72000,'Oregon':16000,'Utah':5000原创 2020-10-07 09:43:47 · 53 阅读 · 0 评论 -
pandas读取mysql、xlsx、txt数据
"""读取txt文件fpath = "datas/crazyant/access_pvuv.txt"pvuv = pd.read_csv( fpath, sep="\t", header=None, names=['pdate','pv','uv'])print(pvuv)""""""读取mysqlimport pymysqlconn = pymysql.connect( host='127.0.0.1', user='root',原创 2020-10-07 09:14:29 · 91 阅读 · 0 评论