- Spark 概述
- 编程指南
- 快速入门
- Spark 编程指南
- Spark Streaming
- DataFrames,Datasets 和 SQL
- Structured Streaming
- MLlib(机器学习)
- 机器学习库(MLlib)指南
- MLlib:基于RDD的API
- Data Types - RDD-based API(数据类型)
- Basic Statistics - RDD-based API(基本统计)
- Classification and Regression - RDD-based API(分类和回归)
- Collaborative Filtering - RDD-based API(协同过滤)
- Clustering - RDD-based API(聚类 - 基于RDD的API)
- Dimensionality Reduction - RDD-based API(降维)
- Feature Extraction and Transformation - RDD-based API(特征的提取和转换)
- Frequent Pattern Mining - RDD-based API(频繁模式挖掘)
- Evaluation metrics - RDD-based API(评估指标)
- PMML model export - RDD-based API(PMML模型导出)
- Optimization - RDD-based API(最优化)
- GraphX(图形处理)
- Spark R
- 部署
- 更多
http://cwiki.apachecn.org/pages/viewpage.action?pageId=2883613