数据挖掘/机器学习进阶
-Shonna-
这个作者很懒,什么都没留下…
展开
-
Basic基础——数学,线代,概率基础
Basic基础原创 2016-08-13 17:06:46 · 397 阅读 · 0 评论 -
Learning to Rank(基于学习的排序):
Pointwise:McRank; Pairwise:RankingSVM,RankNet,Frank,RankBoost; Listwise:AdaRank,SoftRank,LamdaMART;原创 2016-08-13 17:20:57 · 568 阅读 · 0 评论 -
Outlier Detection(异常点检测)
Outlier Detection(异常点检测): Statistic-based(基于统计),Distance-based(基于距离),Density-based(基于密度),Clustering-based(基于聚类)。原创 2016-08-13 17:20:15 · 2669 阅读 · 0 评论 -
Feature Selection(特征选择)
Feature Selection(特征选择): MutualInformation(互信息),Document Frequence(文档频率),Information Gain(信息增益),Chi-squared Test(卡方检验),Gini(基尼系数)。原创 2016-08-13 17:19:35 · 637 阅读 · 0 评论 -
Optimization(最优化)
Optimization(最优化): Non-constrained Optimization(无约束优化):Cyclic Variable Methods(变量轮换法),Pattern Search Methods(模式搜索法),Variable Simplex Methods(可变单纯形法),Gradient Descent Methods(梯度下降法),Newton Metho原创 2016-08-13 17:18:50 · 1938 阅读 · 0 评论 -
SimilarityMeasure&Distance Measure(相似性与距离度量)
SimilarityMeasure&Distance Measure(相似性与距离度量): EuclideanDistance(欧式距离),Manhattan Distance(曼哈顿距离),Chebyshev Distance(切比雪夫距离),Minkowski Distance(闵可夫斯基距离),Standardized EuclideanDistance(标准化欧氏距离),Maha原创 2016-08-13 17:17:56 · 2271 阅读 · 0 评论 -
Recommendation Engine(推荐引擎)
Recommendation Engine(推荐引擎): DBR(Demographic-basedRecommendation 基于人口统计学的推荐),CBR(Context-based Recommendation 基于内容的推荐),CF(Collaborative Filtering协同过滤),UCF(User-based CollaborativeFiltering Reco原创 2016-08-13 17:17:13 · 974 阅读 · 0 评论 -
Association Mining(关联挖掘)
Association Mining(关联挖掘): Apriori,FP-growth(FrequencyPattern Tree Growth 频繁模式树生长算法),AprioriAll,Spade。原创 2016-08-13 17:16:26 · 608 阅读 · 0 评论 -
Text Mining(文本挖掘)
Text Mining(文本挖掘): VSM(Vector SpaceModel向量空间模型),Word2Vec(词向量学习模型),TF(Term Frequency词频),TF-IDF(TermFrequency-Inverse Document Frequency 词频-逆向文档频率),MI(Mutual Information 互信息),ECE(Expected CrossEntr原创 2016-08-13 17:15:41 · 5643 阅读 · 0 评论 -
Dimensionality Reduction(降维)
Dimensionality Reduction(降维): LDA(LinearDiscriminant Analysis/Fisher Linear Discriminant 线性判别分析/Fish线性判别),PCA(Principal ComponentAnalysis 主成分分析),ICA(Independent ComponentAnalysis 独立成分分析),SVD(Sing原创 2016-08-13 17:14:46 · 333 阅读 · 0 评论 -
Deep Learning(深度学习)
Auto-encoder(自动编码器),SAE(Stacked Auto-encoders堆叠自动编码器:Sparse Auto-encoders稀疏自动编码器、Denoising Auto-encoders去噪自动编码器、ContractiveAuto-encoders 收缩自动编码器),RBM(Restricted BoltzmannMachine 受限玻尔兹曼机),DBN(Deep Be原创 2016-08-13 17:14:13 · 454 阅读 · 0 评论 -
NN(Neural Network神经网络)
NN(Neural Network神经网络): ANN(ArtificialNeural Network 人工神经网络),BP(Error Back Propagation 误差反向传播),HN(Hopfield Network), RNN(Recurrent Neural Network,循环神经网络),SRN(Simple Recurrent Network,简单的循环神经原创 2016-08-13 17:13:20 · 4548 阅读 · 0 评论 -
PGM(ProbabilisticGraphical Models概率图模型)
PGM(ProbabilisticGraphical Models概率图模型): BN(BayesianNetwork/Bayesian Belief Network/ Belief Network 贝叶斯网络/贝叶斯信度网络/信念网络),MC(Markov Chain 马尔科夫链),HMM(Hidden MarkovModel 马尔科夫模型),MEMM(Maximum EntropyM原创 2016-08-13 17:12:22 · 852 阅读 · 0 评论 -
Classification&Regression(分类&回归)
Classification&Regression(分类&回归): LR(LinearRegression 线性回归),LR(Logistic Regression逻辑回归),SR(SoftmaxRegression 多分类逻辑回归),GLM(Generalized LinearModel 广义线性模型),RR(Ridge Regression 岭回归/L2正则最小二乘回归),LASSO原创 2016-08-13 17:09:42 · 1618 阅读 · 0 评论 -
Clustering(聚类)
Clustering(聚类): K-Means,K-Mediods,二分K-Means,FK-Means,Canopy,Spectral-KMeans(谱聚类),GMM-EM(混合高斯模型-期望最大化算法解决),K-Pototypes,CLARANS(基于划分),BIRCH(基于层次),CURE(基于层次),DBSCAN(基于密度),CLIQUE(基于密度和基于网格),2014年Scie原创 2016-08-13 17:08:47 · 625 阅读 · 0 评论 -
Data Pre-processing(数据预处理)
数据预处理原创 2016-08-13 17:08:03 · 1702 阅读 · 0 评论 -
Tool(工具):
Tool(工具): MPI,Hadoop生态圈,Spark,BSP,Weka,Mahout,Scikit-learn,PyBrain…原创 2016-08-13 17:21:35 · 368 阅读 · 0 评论