python
apple-nul
数据挖掘、自然语言处理、Fintech、区块链、计量经济学
展开
-
python 3.5 SyntaxError: invalid character in identifier
一般情况下是编码中出现的了中文的标点符号,将其改为英文标点符号即可。原创 2018-04-17 10:10:38 · 243 阅读 · 0 评论 -
python chatterbot(案例)
from chatterbot import ChatBotfrom chatterbot.trainers import ListTrainerconversation = [ "Hello", "Hi there!", "How are you doing?", "I'm doing great.", "That is good to hear"...转载 2018-07-17 21:17:12 · 1415 阅读 · 0 评论 -
xgboost特征重要性
from sklearn.model_selection import train_test_splitfrom sklearn import metricsfrom sklearn.datasets import make_hastie_10_2from xgboost.sklearn import XGBClassifierfrom xgboost import plot_impo...转载 2019-02-16 21:56:58 · 1886 阅读 · 1 评论 -
python读取mat文件
代码如下:import scipy.io as siomatfn = '/Users/wang/Desktop/read-paper/outlier/github-outlier/pyod-master/notebooks/data/letter.mat'data = sio.loadmat(matfn)print(data.keys())#查看变量名...原创 2019-02-14 15:36:44 · 647 阅读 · 0 评论 -
python wordcloud matplotlib(绘图)
##############matplotlib################import maplotlib.pyplot as pltimport numpy as np # 绘制曲线x = linspace(0, 10, 100)ps = plot(x, sin(x), x, cos(x))# 加文字t1 = text(1, -0.5, "hello")# 文字坐标改变...原创 2018-07-17 21:12:05 · 1059 阅读 · 0 评论 -
sklearn/naive_bayes/训练/分类
# -*- coding: utf-8 -*-"""Created on Mon Apr 23 10:39:20 2018@author: NAU"""# -*- coding: utf-8 -*-"""Created on Sun Apr 22 19:29:14 2018@author: NAU"""#导入包from sklearn.feature_extracti原创 2018-09-14 20:31:08 · 495 阅读 · 0 评论 -
python sklearn 案例
#导入模块from sklearn import datasetsfrom sklearn.cross_validation import train_test_split,cross_val_scorefrom sklearn.neighbors import KNeighborsClassifier#创建数据iris = datasets.load_iris()iris_X =...转载 2018-07-17 20:51:43 · 617 阅读 · 0 评论 -
tfidf/kmeans/pca/sklearn
# -*- coding: utf-8 -*-"""Created on Wed Apr 18 11:56:02 2018@author: NAU"""#导入包import randomimport sysfrom sklearn import feature_extractionfrom sklearn.feature_extraction.text import TfidfT...原创 2018-09-14 20:30:32 · 516 阅读 · 1 评论 -
conda和pip安装包方法
conda安装:打开Anaconda Prompt,安装包 conda install package,检查安装包 conda listpip安装: 第一步:cmd;第二步:pip;第三部:pip install xx.whl(位置)lxml安装:第一步:cmd 第二步:cd F:\WANPI931014\我的经验(文件所在文件夹) ...原创 2018-07-17 20:33:21 · 9053 阅读 · 0 评论 -
正则表达式
Python strip() 方法用于移除字符串头尾指定的字符,括号内为移除的字符。(默认为空格).(点):匹配任何字符,除了新的一行。如“ATT.T”将匹配“ATTCT”,“ATTFT”,而不匹配“ATTTCT”。^(异或):匹配字符链的开头。“^AUG”将匹配“AUGAGC”, 但不是“AAUGC”。它用在一个字符集内使用的意思是“相反”的意思。$(美元):匹配链的末端,或者只是在新行...转载 2018-07-17 20:48:59 · 151 阅读 · 0 评论 -
基于gensim包的word2vec
import gensiminputs = open('C:\\Users\\NAU\\Desktop\\neg_tag_del.txt', 'r', encoding='utf8')outputs = open('C:\\Users\\NAU\\Desktop\\neg_feature.txt', 'w', encoding='utf8')sentence = inputs.readlin...原创 2019-05-12 12:52:09 · 298 阅读 · 0 评论 -
sklearn之kmeans文本聚类主题输出
from sklearn import feature_extractionfrom sklearn.feature_extraction.text import TfidfTransformerfrom sklearn.feature_extraction.text import CountVectorizerfrom sklearn.cluster import KMeanscorpu...原创 2018-12-31 14:49:34 · 2100 阅读 · 0 评论