python
文章平均质量分 79
伴生伴熟
知行合一
展开
-
python3访问url
python3 访问url# -*- coding:utf-8 -*-import requestsr = requests.get('https://www.baidu.com/')print(r.content)原创 2020-10-15 10:16:34 · 1283 阅读 · 0 评论 -
DataFrame 窗口函数rolling()
在数据开发最经常会用到将最近几个值相加、求平均等操作,使用rolling操作简单、计算效率高官方文档链接DataFrame.rolling(window, min_periods=None, center=False, win_type=None, on=None, axis=0, closed=None)参数说明:window:时间窗的大小,数值int,即向前几个数据(可以理解将最近...原创 2020-03-09 17:59:42 · 27929 阅读 · 1 评论 -
DataFrame 行列移动函数(shift)
用于移动df中的行列,或滚动取第N行值做为新列,代码如下:# -*- coding:utf-8 -*-import pandas as pdimport numpy as np# 创建DFdf = pd.DataFrame(np.random.randint(5, size=(3, 2)), index=pd.date_range('1/1/2020', periods=3), col...原创 2020-03-05 22:11:41 · 8909 阅读 · 0 评论 -
python dataframe astype 字段类型转换
使用astype实现dataframe字段类型转换# -*- coding: UTF-8 -*-import pandas as pddf = pd.DataFrame([{'col1':'a', 'col2':'1'}, {'col1':'b', 'col2':'2'}])print df.dtypesdf['col2'] = df['col2'].astype('int')print '-原创 2016-08-17 13:59:11 · 137978 阅读 · 0 评论 -
ValueError: time data '5:45:56 AM' does not match format '%I:%M:%S %p'
最近在使用最新Anaconda 4.2.0的PyQt5时报如下错:Traceback (most recent call last): File "/opt/package/test1.py", line 81, in <module> File "/opt/package/test1.py", line 41, in __init__ print 6666, test(time_s原创 2016-10-14 17:46:17 · 5135 阅读 · 0 评论 -
python dataframe 输出结果整行显示
在使用dataframe时遇到datafram在列太多的情况下总是自动换行显示的情况,导致数据阅读困难,效果如下:# -*- coding: utf-8 -*-import numpy as npimport pandas as pddf = pd.DataFrame(np.random.randn(1, 20))print df显示效果: 0 1...原创 2016-10-09 17:09:59 · 29535 阅读 · 2 评论 -
python pandas series
series使用简单示例# -*- coding:utf-8 -*-import pandas as pd# 数组生成serviesdata = [1, 2, 3, 4]print pd.Series(data)print pd.Series(data=data, index=['a', 'b', 'c', 'd'])print pd.Series(data=data, index=['原创 2016-09-19 10:09:27 · 529 阅读 · 0 评论 -
redis.exceptions.ResponseError: MOVED 6918
最近这使用python连接redis时很奇怪的报以下错误信息:Traceback (most recent call last): File "D:/chen_test/redist_test/redis_copy2.py", line 10, in <module> print r.set('test', 111) File "D:\Anaconda\lib\site-packag原创 2016-10-19 11:35:25 · 12675 阅读 · 4 评论 -
python修改list中所有元素类型
修改list中所有元素类型:方法一:new = list()a = ['1', '2', '3']for x in a: new.append(int(x))print(new)方法二:a = ['1', '2', '3']b = [int(x) for x in a]print(b)原创 2017-03-30 16:01:27 · 40800 阅读 · 6 评论 -
numpy函数:arange(),reshape()用法,
import numpy as np默认一维为数组print(np.arange(5))自定义起点一维数组print(np.arange(1, 5))自定义起点步长一维数据print(np.arange(2, 10, 2))用一维数组生成二维数组data = np.arange(8) print(data.reshape((2, 4)))用一维数组生成三维数组data = np.arange(60原创 2017-11-16 10:42:00 · 38326 阅读 · 2 评论 -
python递归父子节点数据,打印节点路径层级
递归父子节点数据数据样式 id name p_id 1 学校A 0 2 专业1 1 3 专业2 1 4 专业3 1 5 专业4 1 6 班级11 2 7 班级12 2 8 班级21 3目的:打印每个节点路径,层级代码:...转载 2018-03-15 20:39:51 · 4279 阅读 · 0 评论 -
python dataframe递归找出每个节点的所有父节点
数据存储格式 pid id None a a a1 a a2 a1 a11 a11 a111 a a12期望获得每个id的所有pid代码如下:import pandas as pddf = pd.DataFrame( { 'id': ['a', 'a1',...原创 2018-03-27 17:47:11 · 6657 阅读 · 1 评论 -
python递归解析嵌套json
json数据示例{"node":"a","subNodes":[{"node":"a1","subNodes":[{"node":"a11","subNodes":[{"node":"a111","subNodes":[]},{"node":"a112","subNodes":[]}]},{"node":"a12","subNodes":[]}]},{"node":"a2","subNodes"...原创 2019-08-16 17:54:31 · 12401 阅读 · 0 评论 -
python dataframe where
在dataframe实现行过滤# -*- coding: UTF-8 -*-import pandas as pddf = pd.DataFrame({'col1': ['a', 'b', 'c', 'a'], 'col2': [1, 2, 3, 4]})# 取出col1中值为'a', 'b'的行lists1 = ['a', 'b']print df[df['col1'].isin(list原创 2016-08-12 11:04:06 · 5979 阅读 · 0 评论 -
python dataframe unionall
用DataFrame实现类似SQL中的unionall方法,将多个DataFrame合并成一个方法一:# -*- coding: UTF-8 -*-import pandas as pddf1 = pd.DataFrame([{'col1':'a', 'col2':1}])df2 = pd.DataFrame([{'col1':'b', 'col2':2}])df3 = pd.D...原创 2016-08-12 10:35:05 · 9497 阅读 · 0 评论 -
python dataframe join
**用Dataframe实现SQL中的inner join, left join, right join 首先准备二张表,df1, df2**import pandas as pddf1 = pd.DataFrame([{'col1':'a', 'co12':1}, {'col1':'b', 'co12':2}])df2 = pd.DataFrame([{'col1':'a', 'co13':原创 2016-08-12 15:56:05 · 12945 阅读 · 2 评论 -
python当前时间,日期格式化,字符串日期互转
python + 日期 + 时间 + datetime + utc原创 2016-06-17 09:16:35 · 15237 阅读 · 1 评论 -
python自定义round
def user_round(number, ndigits=4): tmp = "%."+str(ndigits)+"f" return float(tmp %number)print user_round(1.001401)print user_round(1.001502)原创 2016-07-01 16:16:18 · 656 阅读 · 0 评论 -
python *args **kwargs用法
# -*- coding:utf-8 -*-def test(*args, **kwargs): print 'args = ', args print 'kwargs = ', kwargs print '----end----'if __name__ == '__main__': test(1, '2', 3) test(a=1, b='2', c=3)原创 2016-07-13 17:59:04 · 611 阅读 · 0 评论 -
linux下anaconda安装mysqldb
由于CentOS自带python,使用yum只会把mysqldb安装在系统默认的python环境中,修改yum中的python路径yum又会报错,下载源码编辑又会各种报错,后来终于找到一个简单安装方法,记载如下:使用anaconda自带conda命令安装: /opt/anaconda/bin/conda install mysql-python注:本人用以上命令安装时曾遇到以下报错, -bash原创 2016-06-21 17:34:55 · 7723 阅读 · 0 评论 -
python生成嵌套字典
根据传入带分隔符的Key与value生成对应的嵌套字典 方法一:k1, v1 = 'a.b.c', 1def nested_dict(k, v): key_list = k.split('.') key_list.reverse() for i in key_list: result = dict() result[i] = v原创 2016-07-20 10:14:40 · 11479 阅读 · 1 评论 -
SyntaxError: Non-ASCII character '\xe4' in file
报错代码:a = '你好'print a报错信息:SyntaxError: Non-ASCII character '\xe4' in file test.py on line 2, but no encoding declared; see http://python.org/dev/peps/pep-0263/ for details报错原因:python编码问题,因为python中默认编原创 2016-07-20 15:19:37 · 1046 阅读 · 0 评论 -
python反射(自省)
反射(自省)的简单理解: 通过类名获得类的实例对象; 通过方法名得到方法,实现调用;简单示例:# -*- coding: utf-8 -*-class User(object): def __init__(self): self.name = 'abc' self.age = 18 def introduce(se原创 2016-07-20 17:09:50 · 577 阅读 · 0 评论 -
python range,xrange区别
# -*- coding:utf-8 -*-a = range(0, 10)print type(a), ab = xrange(0, 10)print type(b), b输出结果:<type 'list'> [0, 1, 2, 3, 4, 5, 6, 7, 8, 9]<type 'xrange'> xrange(10)从输出结果可以看出range返回的是个list,xrange返回的是原创 2016-08-02 10:16:44 · 473 阅读 · 0 评论 -
python 多线程(threading)
程序思路: 1.定义二个函数a, b 2.在同一个主进程中a 2秒执行一次, b 5秒执行一次# -*- coding:utf-8 -*- import timeimport threadingdef a(param): for i in range(2): print "This is func a() %s. %s" %(param, time.ctime())原创 2016-08-01 18:15:21 · 415 阅读 · 0 评论 -
python dataframe NaN处理
将dataframe中的NaN替换成希望的值import pandas as pddf1 = pd.DataFrame([{'col1':'a', 'col2':1}, {'col1':'b', 'col2':2}])df2 = pd.DataFrame([{'col1':'a', 'col3':11}, {'col1':'c', 'col3':33}])data = pd.merge(left原创 2016-08-12 17:00:26 · 28618 阅读 · 0 评论 -
python json转csv示例
# -*- coding:utf-8 -*- import pandas as pdimport sysdef json2csv(json_path, json_file, csv_path= None, csv_file=None): if csv_path is None: csv_path = json_path if csv_file is None:原创 2016-08-10 15:19:34 · 2526 阅读 · 1 评论 -
python dataframe groupby
使用dataframe实现groupby方法:# -*- coding: UTF-8 -*-import pandas as pddf = pd.DataFrame([{'col1':'a', 'co12':1}, {'col1':'b', 'co12':2}, {'col1':'c', 'co12':3}, {'col1':'a', 'co12':44}])print df.groupby(b原创 2016-08-12 16:27:32 · 6364 阅读 · 0 评论 -
python dataframe基础入门
1.用数据生成DataFrame# -*- coding: UTF-8 -*-import pandas as pd# 方法一d1 = [{'col1':'a', 'co12':1}, {'col1':'b', 'co12':2}, {'col1':'c', 'co12':3}]df = pd.DataFrame(d1)print df# 方法二data = [['a', 1], ['原创 2016-08-12 10:04:34 · 1354 阅读 · 0 评论 -
python dataframe rename
# -*- coding: UTF-8 -*-import pandas as pddf = pd.DataFrame([{'col1':'a', 'col2':1, 'col3':'aa'}, {'col1':'b', 'col2':2, 'col3':'bb'}, {'col1':'c', 'col2':3, 'col3':'cc'}, {'col1':'a', 'col2':44, 'co原创 2016-08-12 16:54:25 · 8840 阅读 · 0 评论 -
ValueError: unsupported format character ‘Y‘ (0x59) at index 70
错误信息: ValueError: unsupported format character ‘Y’ (0x59) at index 70产生原因:因为python执行的sql中存在类似DATE_FORMAT(MAX(CREATE_TIME), ‘%Y-%m-%d’) 的写法, 其中%Y与python的参数%s冲突解决方法:将DATE_FORMAT(MAX(CREATE_TIME), ‘%Y-%原创 2016-07-11 19:41:51 · 13472 阅读 · 0 评论