- 博客(27)
- 资源 (20)
- 问答 (1)
- 收藏
- 关注
转载 带圆圈的数字和markdown常用表达式记录(持续更新中)
大括号(记得两边都加上$):p=\left{\begin{aligned}x & = & \cos(t) \y & = & \sin(t) \z & = & \fra
2018-11-13 21:49:34 19229 5
原创 CVP(Critical value pruning)examples with python implemention
The python implemention for CVP(Critical Value Pruning) is here:https://github.com/appleyuchi/Decision_Tree_PruneThe C4.5 model is transformed from C-model to Python model,details can be referred in...
2018-11-30 22:29:08 655
原创 卡方检验contingency(列联表)python计算与实验结果分析
代码如下import numpy as npfrom scipy.stats import chi2_contingencyd = np.array([[37, 49, 23], [150, 100, 57]])print chi2_contingency(d)运行结果如下:(7.6919413561281065,0.021365652322337315,1.2,array(...
2018-11-29 20:16:48 5882 2
原创 python卡方分布计算
根据p-value计算分位点import scipy.statsprint scipy.stats.chi2.ppf(0.05, 5)根据分位点计算p-valuefrom scipy import statsprint 1 - stats.chi2.cdf(critical_value, DF)
2018-11-28 20:48:12 6030
原创 CVP(Critical Value Pruning)illustration with clear principle in details
The following is a contingency table[1]:H0:Xijn=NiNjn2H_0:\frac{X_{ij}}{n}=\frac{N_iN_j}{n^2}H0:nXij=n2NiNjH1:Xijn≠NiNjn2H_1:\frac{X_{ij}}{n}≠\frac{N_iN_j}{n^2}H1:nXij̸=n2NiNjNij=XijN...
2018-11-27 22:45:30 663
原创 The proof of “chi-square statistics follows chi-square distribution”
The target is to prove:∑i=1i=r∑j=1j=s[Xij−Ni⋅(Njn)]2Ni⋅(Njn)∼χ2[(r−1)(s−1)]①\sum_{i=1}^{i=r} \sum_{j=1}^{j=s}\frac{[X_{ij}-N_i·(\frac{N_j}{n})]^2}{N_i·(\frac{N_j}{n})}\sim\chi^2{[(r-1)(s-1)]}①i=1∑i...
2018-11-27 15:44:49 1996 4
转载 关于辅酶Q10的相关常识与选购要点(转)
Q10有类型={泛醌(ubiquinone)泛醇(Ubiquinol) Q10有类型=\left\{\begin{aligned}泛醌(ubiquinone)\\泛醇(Ubiquinol)\\\end{aligned}\right.Q10有类型={泛醌(ubiquinone)泛醇(Ubiquinol)注意这个东西在美国属于食品添加剂性质,是绝对不可能通过所谓的FDA药典的。我们需...
2018-11-24 16:21:04 4770
原创 Error in a paper about CVP
Dear Professor Ricco RAKOTOMALALA:the Reference is:<An Empirical Comparison of Selection Measures for Decision-Tree Induction>The following datasets from above referencetry to find what
2018-11-21 23:13:45 316
原创 MEP(minimum error pruning) principle with python implemention
According to《Estimating Probabilities: A Crucial Task in Machine learning》:Θ=p(C)p(C∣V1)p(C)p(C∣V1V2)p(C∣V1)p(C∣V1V2V3)p(C∣V1V2)⋅⋅⋅⋅①whereΘ=p(C)\frac{p(C|V_1)}{p(C)}\frac{p(C|V_1V_2)}{p(C|V_1)}\fra...
2018-11-18 21:48:24 653 1
原创 详细解释到底啥是共轭先验(用本科知识来解释)
我们直奔主题。根据百度百科上的解释:如果后验分布与先验分布属于同类(分布簇),则先验分布与后验分布被称为共轭分布,而先验分布被称为似然函数的共轭先验。上面这个定义有点复杂,我们待会儿再回过头来看这个定义P(θ∣x)=P(x∣θ)⋅P(θ)∫P(x∣θ′)⋅P(θ′)dθ′P(\theta|x)=\frac{P(x|\theta)·P(\theta)}{\int P(x|\theta&am...
2018-11-16 19:50:37 6559
原创 Two Examples of Minimum Error Pruning(reprint)
Attention,this article is from the following link with my own notes:http://www.cse.unsw.edu.au/~billw/cs9414/notes/ml/06prop/id3/id3-2.htmlExpected Error PruningApproximate expected error assuming ...
2018-11-15 16:04:05 410
原创 maven构建scala工程并最终运行的所有细节(scala maven helloworld)
琢磨了很久,网上虽然各种例子,但是好多都是依赖IDE的,要么就是生成jar文件后没有主清单目录,写这篇博客一次性全部解决掉。1,文件目录(python2.7) appleyuchi@ubuntu:~/Desktop/test3/scala-module-dependency-sample/maven-sample$ lspom.xml src.├── pom.xml├── s...
2018-11-12 13:32:35 2767
转载 sbt構建一個spark工程(scala+spark+sbt)
準備工作,文件結構如下:(python2.7) appleyuchi@ubuntu:~/Desktop/WordCount$ tree.├── build.sbt├── src│ └── main│ └── scala│ └── WordCount.scala其中...
2018-11-12 13:32:17 960
原创 pyspark读写SequenceFile
完整代码如下:# -*- coding: utf-8 -*-import sysreload(sys)sys.setdefaultencoding('utf-8')# @Author: appleyuchi# @Date: 2018-07-19 14:59:02# @Last Modified by: appleyuchi# @Last Modified time: 2...
2018-11-12 13:31:50 1752 2
原创 Unable to load native-hadoop library for your platform
环境:ubuntu-linux 16.04spark-2.3.1-bin-hadoop2.7hadoop-2.7.7可能的原因:1.so文件版本不对查看命令:file libhadoop.so.1.0.0libhadoop.so.1.0.0: ELF 64-bit LSB shared object, x86-64, version 1 (SYSV), dynamically ...
2018-11-12 13:31:32 2218
原创 maven零基础从配置到运行helloworld(java maven helloworld)
首先是maven的安装和配置http://mirrors.shu.edu.cn/apache/maven/maven-3/3.5.4/binaries/apache-maven-3.5.4-bin.tar.gz下载apache-maven-3.5.4解压到~/bigdata/apache-maven-3.5.4---------------------------~/.bashrc配...
2018-11-12 13:31:15 863
转载 sbt安装、配置、運行(scala sbt helloworld)
下载sbt-1.1.6.tgz---------------------------------------------------------------------------------------------------------------------------------------------------.bashrcexport PATH=~/bigdata/sb...
2018-11-12 13:30:55 790
原创 hive与spark的匹配版本汇总
版本信息来自于hive的源码包的pom.xml文件中,汇总下:apache-hive-1.2.2-src <spark.version>1.3.1</spark.version>apache-hive-2.1.1-src <spark.version>1.6.0</spark.version>apache-hive-2.3.3-src <...
2018-11-12 13:30:30 7217
原创 Pessimistic error pruning illustration with C4.5-python implemention
------------------get the datasets-----------------------------------We use the following datasets:https://archive.ics.uci.edu/ml/machine-learning-databases/abalone/abalone.dataTarget:predict the ...
2018-11-11 19:00:59 1659 2
原创 Pessimistic Error Pruning example of C4.5
This example is from 《An Empirical Comparison of Pruning Methodsfor Decision Tree Induction》How to read these node and leaves?For example:node 30:15 are classified as “class1”2 are mis-classifi...
2018-11-09 20:34:49 539
原创 C4.5-Release8的代码架构图
代码链接:http://www.rulequest.com/Personal/c4.5r8.tar.gzfile_structure.dotdigraph abc{"c4.5.c";"besttree.c";"build.c";"info.c";"contin.c";"discr.c&
2018-11-08 22:12:18 377
原创 Error Based Pruning剪枝算法、代码实现与举例
EBP(Error Based Pruning):下列算法转载自链接:https://login.sina.com.cn/crossdomain2.php?action=login&amp;amp;amp;entry=blog&amp;amp;amp;r=http%3A%2F%2Fblog.sina.com.cn%2Fs%2Fblog_64ecfc2f0101r3o5.html%3Fsudaref%3Dwww.baidu.c...
2018-11-08 16:28:39 1395
原创 U25%(1,16) and U25%(1,168)on《C4.5:programs for machine learning》
when calculatingUCFU_{CF}UCF(e,N)CF: Confidence Level(here is 25%)e:misclassifying counts of current subtree we focus onN:counts of sub-datasets relevant to current subtree who is under judgment ...
2018-11-07 23:30:51 751
原创 Earliest PEP Algorithm Principles
PEP:Pessimistic Error Pruning本文分为三部分:第一部分:1986年的PEP剪枝算法原理第二部分:1997年的PEP剪枝算法原理以及与前者的区别-----------------------------第一部分:《Simplifying Decision Trees》-1986作者:J.R. Quinlan该论文的2.3 Pessimistic Pruni...
2018-11-06 22:23:26 430
原创 Weka的-3.6.10的C4.5与Quinlan教授的C4.5算法的区别
使用数据集:http://archive.ics.uci.edu/ml/machine-learning-databases/car/weka-3.6.10的结果是:safety = low: unacc (576.0)safety = med| persons = 2.0: unacc (192.0)| persons = 4.0| | buying = vhigh...
2018-11-06 17:27:11 363
原创 《C4.5: Programs for Machine Learning》chaper4实验结果重现
使用自带的数据集:实验结果如下:剪枝前:physician fee freeze = n:| adoption of the budget resolution = y: democrat (151.0)| adoption of the budget resolution = u: democrat (1.0)| adoption of the budget resolu...
2018-11-05 19:47:27 983
原创 History of pruning algorithm development and python implementation(finished)
All the python-implementation for 7 post-pruning Algorithmsare in the following link:https://github.com/appleyuchi/Decision_Tree_PruneTable of Decision Trees:name of treeinventername of arti...
2018-11-03 23:22:27 4274 6
ngrok-stable-linux-amd64.zip
2019-11-18
VBoxGuestAdditions_5.2.18.iso
2019-04-27
pearson证明卡方检验的原文-1900年
2018-12-07
用于ubuntu16.04下安装mongodb时缺少的库文件
2018-08-08
浙大版《概率论与数理统计》(第四版)第十一章实验数据、实验步骤与实验结果
2018-05-17
virtualbox-5.1_5.1.22-115126~Ubuntu~xenial_amd64.deb
2017-05-21
Visual Assist X 10.6.1823 破解版
2012-02-05
MSP仿真器(支持FET430PIF、FET430UIF、MSP430BSL对MSP430芯片进行编程)
2011-03-28
《半导体集成电路课》朱正涌(第二版)课后习题答案仿真(第四章)
2010-06-18
无warning无error,但不知为何没有输出的代码,求指点~
2015-09-20
TA创建的收藏夹 TA关注的收藏夹
TA关注的人