2021年02月_周小董

12月 11月 10月 09月 08月 07月 06月 05月 04月 03月 02月 01月

原创 [948]Pandas数据分组的函数应用（df.apply()、df.agg()、df.transform()、df.applymap()、df.groupby()）

这个函数需要自己实现，函数的传入参数根据axis来定，比如axis = 1，就会把一行数据作为Series的数据结构传入给自己实现的函数中，我们在函数中实现对Series不同属性之间的计算，返回一个结果，则apply函数会自动遍历每一行DataFrame的数据，最后将所有结果组合成一个Series数据结构并返回。Dataframe在行（axis=0）或列（axis=1）上进行分组，将一个函数应用到各个分组并产生一个新值，然后函数执行结果被合并到最终的结果对象中。默认axis=0，即逐列进行操作；

2021-02-28 22:39:58 5376

原创 [947]ImportError: [joblib] Attempting to do parallel computing without protecting

python报错ImportError: [joblib] Attempting to do parallel computing without protecting错误：ImportError: [joblib] Attempting to do parallel computing without protecting your import on a system that does not support forking. To use parallel-computing in a scr

2021-02-28 21:39:31 287

原创 [946]pandas.errors.ParserError: Error tokenizing data

pandas.errors.ParserError: Error tokenizing datamydf = pd.read_csv(filename, encoding=‘utf-8’, error_bad_lines=False) #加上error_bad_lines=False

2021-02-28 21:38:53 553

原创 [945]AttributeError: module ‘pandas‘ has no attribute ‘rolling_mean‘

文章目录AttributeError: module 'pandas' has no attribute 'rolling_mean'AttributeError: module ‘pandas’ has no attribute ‘rolling_mean’moving_avg = pd.rolling_mean(ts_log,12)上面代码报错：AttributeError: module ‘pandas’ has no attribute ‘rolling_mean’解决方法：moving

2021-02-28 21:38:09 384

原创 [944]AttributeError:‘DataFrame‘ object has no attribute ‘sort‘，‘as_matrix‘，‘ix‘

文章目录AttributeError:'DataFrame' object has no attribute 'sort'AttributeError DataFrame object has no attribute as_matrixAttributeError: 'DataFrame' object has no attribute 'ix'AttributeError:‘DataFrame’ object has no attribute ‘sort’解决办法：将“sort”改为“sort_va

2021-02-28 21:35:27 1239 3

原创 [943]thefuck的安装和使用

文章目录简介截图示例安装简介你是不是经常在终端敲错命令？敲错命令，删掉重敲，很烦有没有？当你一再敲错的时候，内心一定是崩溃的，一定在默念What The FUCK!。就这样thefuck神器就诞生了。thefuck不仅能修复字符输入顺序的错误，在很多别的你想说fuck的情况下，thefuck依然有效，反正只要你因为命令的问题报错，就请fuck一下。thefuck是一个使用Python编写的开源小工具，它可以自动纠正前一个命令的拼写错误。这个工具非常酷，尤其对于常常使用命令行的童鞋。thefuck支持

2021-02-19 22:53:03 799

原创 [942]IndexError: boolean index did not match indexed array along dimension 0

在学习回归算法的时候，使用sklearn.linear_model下的RandomizedLogisticRegression（下列简称为RLR）来做预测但是总是会遇到下面这个错误：IndexError: boolean index did not match indexed array along dimension 0; dimension is 9 but corresponding boolean dimension is 8之后就想看下这个get_support()函数原型，找到官方文档，截个

2021-02-17 21:36:51 3531 2

原创《python数据分析与挖掘实战》笔记第5章

文章目录第5章：挖掘建模5.1、分类与预测5.1.1、实现过程5.1.2、常用的分类与预测算法5.1.3、回归分析5.1.4、决策树5.1.5、人工神经网络5.1.7、 Python分类预测模型特点5.2、聚类分析第5章：挖掘建模5.1、分类与预测分类和预测是预测问题的两种主要类型，分类主要是预测分类标号（离散属性），而预测主要是建立连续值函数模型，预测给定自变量对应的因变量的值。5.1.1、实现过程(1）分类分类是构造一个分类模型，输入样本的属性值，输岀对应的类别，将每个样本映射到预先定义好

2021-02-13 22:36:50 4303 4

原创《python数据分析与挖掘实战》笔记第4章

文章目录第4章：数据预处理4.1、数据清洗4.1.1、缺失值处理4.1.1、异常值处理4.2、数据集成4.2.1、实体识别4.2.2、冗余属性识别4.3、数据变换4.3.1、简单函数变换4.3.2、规范化4.3.3、连续属性离散化4.3.4、属性构造4.3.5、小波变换4.4、数据规约4.4.1、属性规约4.4.2、数值规约4.5、Python主要数据预处理函数4.6、小结第4章：数据预处理数据预处理一方面是要提高数据的质量，另一方面是要让数据更好地适应特定的挖掘技术或工具。统计发现，在数据挖掘的过程

2021-02-13 22:35:42 1662 1

原创《python数据分析与挖掘实战》笔记第3章

文章目录第3章：数据探索3.1、数据质量分析3.2、数据特征分析3.2.1、分布分析3.2.2、对比分析3.2.3、统计量分析1.集中趋势度量2.离中趋势度量3.2.4、周期性分析3.2.5、贡献度分析3.2.6、相关性分析1. 直接绘制散点图2. 绘制散点图矩阵3. 计算相关系数3.3、python主要数据探索函数3.3.1、基本统计特征函数corr()cov()skew/kurt3.3.2、拓展统计特征函数3.3.3、统计作图函数(1) plot(2) pie(3) hist(4) boxplot(5)

2021-02-13 22:33:20 1701

原创《python数据分析与挖掘实战》笔记第2章

文章目录第2章：python数据分析简介2.2、python使用入门2.2.3、数据结构(1)列表/元组(2)字典(3)集合(4)函数式编程2.2.4、库的导入与添加2.3、python数据分析工具2.3.1、numpy2.3.2、scipy2.3.3、matplotlib2.3.4、pandas2.3.5、statsmodels2.3.6、scikit-learn2.3.7、keras2.3.8、gensim第2章：python数据分析简介2.2、python使用入门2.2.3、数据结构pytho

2021-02-13 22:30:47 770

TA关注的人

周小董