- 博客(8)
- 收藏
- 关注
原创 《模型融合》投票法、stacking和blending
import numpy as np import pandas as pdimport matplotlib.pyplot as plt import seaborn as sns%matplotlib inlineplt.rcParams["font.sans-serif"] = ["FangSong"] plt.rcParams["axes.unicode_minus"] = False import warningswarnings.filterwarnings("ignore")
2020-09-27 22:53:38 704
原创 【违约预测】TASK 04
import numpy as np import pandas as pdimport matplotlib.pyplot as plt import seaborn as sns%matplotlib inlineplt.rcParams["font.sans-serif"] = ["FangSong"] plt.rcParams["axes.unicode_minus"] = False import warningswarnings.filterwarnings("ignore")
2020-09-24 22:48:50 142
原创 【五分钟精通R语言】R数据类型、判断、循环
R的基本运算a = c(1,2,3,4)b = c(3,4,5,6)print(a + b)print(a ^ b) # a ** b print(a %% b) # 整除取余print(a %/% b) # 整除v <- a # 向左赋值b -> w # 向右赋值 ls() # 列出所有变量print( 1 %in% v) # 相当于 inprint(a %*% b) # 相当于 a*a.Ts = 1:10[1] 4 6 8 10[1]
2020-09-22 16:48:02 2030 1
原创 【分箱操作】决策树、卡方、分位数、等距和映射分箱操作代码实现
from sklearn.tree import DecisionTreeClassifierimport pandas as pdimport numpy as npdata = pd.read_csv('train.csv',index_col = 'id')data.head()决策树分箱def optimal_binning_boundary(x: pd.Series, y: pd.Series) -> list: ''' 利用决策树获得最优分箱的边界
2020-09-21 23:38:24 2424 1
原创 【可视化】matplotlib.animation_动图
import numpy as npimport pandas as pd from matplotlib.animation import FuncAnimationfig, ax = plt.subplots() # 创建图表和axesdef update(i):‘’’函数为更新axes信息i 可以理解为迭代词数返回一个axes'''return tableani = FuncAnimation(fig=fig, # 更新的画布func=update, # 更新函数fr
2020-09-15 16:33:44 1077
原创 【贷款违约预测】task1and2 理解和数据探索
import numpy as np # 导入numpy库import pandas as pd # 导入pandas库import matplotlib as mpl # 导入matplotlib库import matplotlib.pyplot as plt import seaborn as sns # 导入seaborn库%matplotlib inlineplt.rcParams['font.sans-
2020-09-15 08:40:16 877
原创 【地图可视化 】 folium
Table of Contents1 MAP create2 Heatmap3 CircleMarker4 folium.CircleMarker 标记5 folium.PolyLine(6 map save# generated dataimport numpy as npdata = ( np.random.normal(size=(100, 3)) *
2020-09-13 09:31:52 1810 2
原创 【DCIC】task1
import pandas as pd import numpy as npimport seaborn as snsimport matplotlib.pyplot as pltdf = pd.read_csv('taxiGps20200618.csv')df RUNNING_STATUS GPS_SPEED DRIVING_DIRECTION GPS_DATE LONGITUDE
2020-09-11 17:44:23 225
空空如也
空空如也
TA创建的收藏夹 TA关注的收藏夹
TA关注的人