pandas
SherryGo
这个作者很懒,什么都没留下…
展开
-
python获取ElasticSearch数据
1、导入es的python库from elasticsearch import Elasticsearchfrom elasticsearch.helpers import bulkfrom elasticsearch import helpersimport pandas as pdimport os2、连接eses = Elasticsearch(hosts="http://192.168.194.16:9200/",timeout=30, max_retries=10, r..原创 2021-12-22 15:47:02 · 2468 阅读 · 0 评论 -
pandas处理字符串
import pandas as pddf = pd.DataFrame({'name':["test_1", "test_2"]})字符串处理之split问题1:将name列拆为两列df['name'].str.split('_', expand=True)这样返回一个DataFrame结构,如果未设置expand,返回时一个Series结构。字符串处理之extr...原创 2018-12-03 11:38:27 · 3099 阅读 · 0 评论 -
pandas选取元素loc,iloc
pandas可以通过loc和iloc来筛选元素,ix不推荐使用 data = {'AAA':[4,5,6,7], 'BBB':[10,20,30,40], 'CCC':[100,50,-30, -50]}df = pd.DataFrame(data=data, index=['foo','bar','boo','kar']);dfpandas.ilocPurely integer-...原创 2018-08-06 16:58:58 · 2000 阅读 · 0 评论 -
Pandas groupby保存到dataFrame
直接使用reset_index方法删除原来索引 test_df = pd.DataFrame({'a':[1,1,3,2],'b':[1,4,6,9],'c':[1,4,8,12]})g_df =test_df['c'].groupby([test_df['a'], test_df['b']]).sum()c_df = pd.DataFrame(g_df)c_df.reset_inde...原创 2018-08-06 11:44:57 · 19721 阅读 · 1 评论 -
pandas多索引使用
import numpy as npimport pandas as pdcols = pd.MultiIndex.from_tuples([(x,y) for x in ['A','B','C','D'] for y in ['O','I']])df_test_2 = pd.DataFrame(np.random.randn(2,8), index=['n','m'], columns...原创 2018-08-06 11:33:47 · 2431 阅读 · 0 评论 -
pandas使用apply多列生成一列数据
import pandas as pddef my_min(a, b): return min(abs(a),abs(b))s = pd.Series([10.0247,10.0470, 10.0647,10.0761,15.0800,10.0761,10.0647,10.0470,10.0247,10.0,9.9753,9.9530,9.9353,9.9239,18.92,9.9239...原创 2018-06-20 16:25:55 · 14436 阅读 · 0 评论 -
pandas读写excel
导入pandas数据包import pandas as pd读取excelpd.read_excel(u'E:\\test.xlsx')DataFrame写入excelwriter = pd.ExcelWriter('output.xlsx')df1.to_excel(writer,'Sheet1')df2.to_excel(writer,'Sheet2')writer.save()...原创 2018-06-05 11:24:28 · 713 阅读 · 0 评论 -
python datetime返回特定时间-which-date
import datetimedef which_date(start_date,time): """ This function takes as input a string depicting a date in YYYY/mm/dd format and a string stating a time period in the form of "X day(s)" or "Y ...原创 2018-05-08 14:44:34 · 1510 阅读 · 0 评论 -
pandas映射,replace和map
replacedf['消费性别倾向'] = df['消费性别倾向'].replace(2,'女')df['消费性别倾向'] = df['消费性别倾向'].replace('2','女')df['消费性别倾向'] = df['消费性别倾向'].replace(['1','2','3','4','5'],'女')df['消费性别倾向'] = df['消费性别倾向'].replace(['6...原创 2018-05-08 13:21:06 · 7609 阅读 · 0 评论 -
pandas重复行处理
1、检测重复行import pandas as pddframe = pd.DataFrame({'color':['white', 'white', 'red','red', 'white'],'value':[2,1,3,3,2]})#检测重复行dframe.duplicated()2、删除重复行#删除重复行,该函数返回的是删除重复行后的DataFrame对象dframe.drop_d...原创 2018-05-08 11:39:15 · 21317 阅读 · 0 评论