![](https://img-blog.csdnimg.cn/20201014180756918.png?x-oss-process=image/resize,m_fixed,h_64,w_64)
python
owenbb
这个作者很懒,什么都没留下…
展开
-
Pandas简单实现groupby分组统计
Pandas实现groupby分组统计类似SQL:select city.max(temperature) from city_weather groupby by citygroupby: 先对数据分组,然后在每个分组上应用聚合函数、转换函数import pandas as pdimport numpy as np%matplotlib inlinedf = pd.DataFr...原创 2019-11-02 11:27:27 · 1953 阅读 · 1 评论 -
python语法错误解决
1、UnicodeDecodeError: 'utf-8' codec can't decode byte 0xbf in position 0: invalid start bytedata = pd.read_csv('./data/data.csv',encoding='gbk')data原创 2019-08-06 09:36:01 · 1262 阅读 · 0 评论 -
pandas读写多个sheet操作excel
sheet1 = pd.read_excel(path,sheet_name='sheet1')sheet2 = pd.read_excel(path,sheet_name='sheet2')sheet3 = pd.read_excel(path,sheet_name='sheet3')wite_path = './data/final/'+iwrite = pd.ExcelWriter...原创 2019-07-23 14:57:24 · 1842 阅读 · 0 评论 -
将classifaction_report保存成dataframe画图
import pandas as pdimport numpy as npdef classifaction_report_csv(report): report_data = [] lines = report.split('\n') for line in lines[2:-3]: row = {} row_data = line....原创 2019-06-12 21:14:17 · 1303 阅读 · 0 评论 -
padas分块读取超大csv,并根据值过滤
path = '../data/'reader = pd.read_csv(path+'train_20190518.csv', iterator=True, names=['label','uid','adid','operTime','siteid','slotid','contentid','netType'])loop = Truechu...原创 2019-05-30 20:00:42 · 777 阅读 · 0 评论 -
effective python学习
6、在单次切片操作内,不要同时指定start、end、stridelist[start:end:stride]7、用列表推导取代map 和 filter原创 2019-05-14 12:37:47 · 355 阅读 · 0 评论 -
python类型的操作
1、list list连接 list = list1 +list2sen_list = []for i in tqdm(data.index): sen_list += data.coreEmotions_list[i] pd.value_counts(sen_list) list交集2、dict3、pandasdatafram...原创 2019-04-01 19:23:15 · 247 阅读 · 0 评论 -
制作小的数据集测试
import osfilenames = os.listdir('./image_10000/')filenamestxt = []for x in filenames: x = x.split('jpg')[0]+'txt' txt.append(x)txtimport shutildef cp_img(file,source,new_folder): ...原创 2019-03-09 14:11:31 · 209 阅读 · 0 评论 -
python遇到的问题2
1、调整阈值prediction = model.predict(X_test)prediction[prediction>=0.5] = 1prediction[prediction<0.5] = 02、正太拟合函数(mu, sigma) = norm.fit(train['SalePrice'])print( '\n mu = {:.2f} and sigma...原创 2018-10-26 14:34:28 · 342 阅读 · 0 评论 -
python遇到的问题
1、删除后需要重置索引data_y = data_y.drop([0,1]).reset_index(drop=True)data_y2、保留索引能整除4的行数据,for循环太慢###for 循环for i in data_y.index: if i%4!=0: data_y = data_y.drop([i])data_y = data_y.rese...原创 2018-04-12 11:44:25 · 1288 阅读 · 0 评论 -
python 画图
1、给图形添加数据标签plt.plot(datat.index,datat)plt.xlabel('index', fontsize=15)plt.legend(['t_bottom','t_top'],loc = 'upper_right',fontsize = 10)plt.show()2、将标签置于最右边plt.legend(bbox_to_anchor=(1.05...原创 2018-07-10 16:34:23 · 4822 阅读 · 0 评论 -
python读取文件夹下所有csv文件
### 读取文件夹下的所有csv文件import os# 输出文件夹下的所有文件os.listdir('../data/simulation_data_generation/pdata2_1000')'1+0.1+139+0.6.csv', '1+0.2+290+0.6.csv', '1+0.5+411+0.8.csv', '1+0.9+62+0.5.csv', '10+0.4+4...原创 2018-03-11 16:56:00 · 24574 阅读 · 4 评论