机器学习
猿与禅
think more , write less , more value !
君子藏器于身,待时而动,争其必然,顺其自然
展开
-
数据挖掘-目录-关联分析
Apriori(频繁项集挖掘并行化)FP AssociationRules FPGrowth FPTree原创 2017-03-23 19:06:28 · 404 阅读 · 0 评论 -
数据挖掘-目录-推荐(recommendation)
matrix factorization Alternating Least Squares (ALS)原创 2017-03-23 17:12:15 · 678 阅读 · 0 评论 -
数据挖掘-目录-特征处理(feature)
BinarizerBucketizerChiSqSelectorCountVectorizerDCTElementwiseProductHashingTFIDFInteractionMinMaxScalerNGramNormalizerOneHotEncoderPCAPolynomialExpansionQuantileDiscretizerSQLTransformerStandardScalerS转载 2017-03-23 17:35:06 · 1182 阅读 · 0 评论 -
spark-MLlib-架构
sparkmlib-架构转载 2017-03-24 16:49:47 · 1003 阅读 · 2 评论 -
数据挖掘-目录-正则化方法
Ridge Regression Least Absolute Shrinkage Selection Operator ( LASSO )弹性网络( Elastic Net )原创 2017-03-24 13:41:27 · 604 阅读 · 1 评论 -
数据挖掘-目录-集成算法
Boosting Bootstrapped Aggregation ( Bagging ) AdaBoost 堆叠泛化( Stacked Generalization , Blending) 梯度推进机( Gradient Boosting Machine, GBM ) 随机森林( Random Forest )。原创 2017-03-24 13:16:01 · 583 阅读 · 1 评论 -
数据挖掘-目录-降维(Dimensionality Reduction)
EigenValueDecomposition(特征值分解) SingularValueDecomposition(奇异值分解) Principal Component Analysis(主成分分析)原创 2017-03-24 00:15:57 · 451 阅读 · 0 评论 -
数据挖掘-目录-深度学习(Deep Learning)
受限波尔兹曼机( Restricted Boltzmann Machine, RBN )Deep Belief Networks ( DBN )卷积网络( Convolutional Network )堆栈式自动编码器( Stacked Auto-encoders )原创 2017-03-24 12:02:07 · 562 阅读 · 1 评论 -
数据挖掘-目录-人工神经网络 (Artificial Neural Network)
感知器神经网络( Perceptron Neural Network )反向传递( Back Propagation ) Hopfield 网络自组织映射( Self-Organizing Map, SOM )学习矢量量化( Learning Vector Quantization , LVQ )原创 2017-03-24 11:59:40 · 1087 阅读 · 1 评论 -
数据挖掘-目录-基本统计
correlation Correlation PearsonCorrelation SpearmanCorrelationdistribution MultivariateGaussianKernelDensityMultivariateOnlineSummarizerMultivariateStatisticalSummaryKolmogorovSmirnov原创 2017-03-23 18:57:36 · 814 阅读 · 0 评论 -
数据挖掘-目录-impurity
Entropy Gini Impurities Impurity Variance原创 2017-03-24 00:24:53 · 622 阅读 · 1 评论 -
数据挖掘-目录-loss
AbsoluteErrorLogLossLossLossesSquaredError原创 2017-03-24 00:22:58 · 503 阅读 · 0 评论 -
数据挖掘-目录-评估
AreaUnderCurveBinaryClassificationMetricComputersBinaryConfusionMatrixBinaryLabelCounterBinaryClassificationMetricsMulticlassMetricsRankingMetricsRegressionMetricsBinaryClassificationEvaluatorEvaluator原创 2017-03-24 00:19:51 · 438 阅读 · 0 评论 -
数据挖掘-目录-最优化算法(optimization)
GradientDescent (梯度下降算法)L-BFGS(限制内存BFGS)NNLS(非负最小二乘)原创 2017-03-24 00:10:56 · 1133 阅读 · 0 评论 -
数据挖掘-目录-聚类(clustering)
K-means bisecting k-means DBSCANMAXGaussianMixtureLatent Dirichlet AllocationGaussianMixturePowerIterationClustering原创 2017-03-23 17:07:03 · 847 阅读 · 0 评论 -
数据挖掘-目录-回归分析(regression)
AFTSurvivalRegressionDecisionTreeRegressorGBTRegressorIsotonicRegressionLinearRegressionRandomForestRegressorGeneralizedLinearAlgorithmGLMRegressionModelIsotonicRegressionLabeledPointLassoLinearRegress原创 2017-03-23 17:57:53 · 854 阅读 · 0 评论 -
数据挖掘-目录-线性代数( linear algebra)
Basic Linear Algebra SubprogramsCholeskyDecompositiondistributedBlockMatrixCoordinateMatrixDistributedMatrixIndexedRowMatrixRowMatrixEigenValueDecompositionMatricesSingularValueDecompositionVectors原创 2017-03-23 19:36:47 · 997 阅读 · 0 评论 -
数据挖掘-目录-分类器(classification)
GLMNaiveBayesSupport Vector Machines Stochastic Gradient Descent LogisticRegressionDecisionTree CART Hunt ID3 C4.5 KNIMEGradient-Boosted TreesMultilayerPerceptronRandomForest原创 2017-03-23 16:59:22 · 1763 阅读 · 0 评论