参考网址
spark
http://spark.apache.org/docs/1.3.0/api/python/pyspark.html#subpackages
http://www.csdn.net/article/2015-07-10/2825184
http://www.cnblogs.com/shishanyuan/p/4699644.html
http://homepage.cs.latrobe.edu.au/zhe/ZhenHeSparkRDDAPIExamples.html 整理的spark RDD API参考
https://www.iteblog.com/archives/1657.html Spark性能优化指南——基础篇
http://litaotao.github.io/deep-into-spark-exection-model Spark原理
http://flykobe.com/index.php/2015/04/18/pyspark-and-spark/comment-page-1/ pyspark与spark的集成方式
http://www.aboutyun.com/thread-21058-1-1.html 3个有用的TensorFlow On Spark 开源项目分析
http://takwatanabe.me/pyspark/index.html pspark api 参考
http://geek.csdn.net/news/detail/203492 Spark App自动化分析和故障诊断
http://www.aboutyun.com/thread-6047-1-1.html hive优化----hive使用经验
http://blog.csdn.net/qq_34531825/article/details/52689654 Spark2.0机器学习系列之12: 线性回归及L1、L2正则化区别与稀疏解
https://github.com/claesenm/spark-ml-inventory spark相关开源库
TensorFlow
http://wiki.jikexueyuan.com/project/tensorflow-zh/how_tos/reading_data.html TensorFlow程序读取数据
https://sec.xiaomi.com/article/13 TensorFlow初学者在使用过程中可能遇到的问题及解决办法
http://geek.csdn.net/news/detail/235465 TensorFlow Wide And Deep 模型详解与应用
特征工程
http://blog.csdn.net/Dream_angel_Z/article/details/49388733 机器学习之特征工程
https://www.zhihu.com/question/28641663 机器学习中,有哪些特征选择的工程方法
https://zhuanlan.zhihu.com/p/26444240?utm_source=weibo&utm_medium=social 机器学习特征工程实用技巧大全
http://liuzhiqiangruc.iteye.com/blog/2145143 http://blog.csdn.net/calculusearch/article/details/52751218 http://blog.csdn.net/lujiandong1/article/details/52412123http://blog.csdn.net/u013818406/article/details/70494800 关于连续值离散化
http://phunters.lofter.com/post/86d56_194e956 http://dataunion.org/13206.html 关于推荐系统中的特征工程
其他
https://brenocon.com/blog/2012/03/cosine-similarity-pearson-correlation-and-ols-coefficients/
https://wenku.baidu.com/view/c616e7c008a1284ac85043cd.html
http://blog.csdn.net/hguisu/article/details/7866173
http://blog.csdn.net/jiaomeng/article/details/1495500
http://www.cnblogs.com/chaosimple/archive/2013/07/31/3227271.html 数据归一化和两种常用的归一化方法
http://www.cnblogs.com/heaad/archive/2011/03/08/1977733.html 机器学习中的相似性度量
http://www.jianshu.com/p/c5b8268d273b youtube的推荐系统
http://www.infoq.com/cn/articles/personalized-recommendation-practice-and-optimization 个性化推荐:实践与优化
http://blog.sina.com.cn/s/blog_5357c0af0102uxof.html http://www.52ml.net/318.html 推荐系统中所使用的混合技术介绍
http://www.cnblogs.com/EE-NovRain/p/3810737.html 各大公司广泛使用的在线学习算法FTRL详解
http://blog.csdn.net/wemedia/details.html?id=38193 这位成功转型机器学习的老炮,想把他多年的经验分享给你
http://www.docin.com/p-1583970966.html 标签共现的标签聚类算法
http://archive.ics.uci.edu/ml/index.php 流行训练数据集
http://bradyzhu.iteye.com/blog/2271057 http://blog.csdn.net/han_xiaoyang/article/details/49797143 机器学习实例:逻辑回归应用之Kaggle泰坦尼克之灾
http://www.lining0806.com/spark%E4%B8%8Epandas%E4%B8%ADdataframe%E5%AF%B9%E6%AF%94/ Spark与Pandas中DataFrame对比(详细)
http://blog.sina.com.cn/s/blog_5357c0af0102uxoh.html 机器学习常见的几个误区
https://github.com/PAIR-code/facets http://www.infoq.com/cn/news/2017/07/goole-sight-facets-ai 谷歌开源可视化工具Facets
http://bourneli.github.io/ml/2017/05/25/gdbt-lr-facebook-paper.html GBDT特征转换+LR总结
http://blog.csdn.net/shine19930820/article/details/71713680 GBDT原理及利用GBDT构造新的特征-Python实现
http://dataunion.org/14072.html 干货:结合Scikit-learn介绍几种常用的特征选择方法
http://blog.csdn.net/itplus/article/details/37969519 word2vec 中的数学原理详解
http://blog.csdn.net/myan/article/details/73435469 观点:深度学习,先跟上再说
http://www.uml.org.cn/ai/201708011.asp Youtube 短视频推荐系统变迁:从机器学习到深度学习
https://tracholar.github.io/machine-learning/2017/03/10/factorization-machine.html FM因子机深入解析
https://www.leiphone.com/news/201702/T5e31Y2ZpeG1ZtaN.html TensorFlow和Caffe、MXNet、Keras等其他深度学习框架的对比
https://mooc.study.163.com/smartSpec/detail/1001319001.htm 吴恩达最新深度学习课程在网易云课堂的中文版
https://zh.gluon.ai 李沐深度学习教程
https://tech.meituan.com/dl.html 深度学习在美团点评推荐平台排序中的运用
https://www.zhihu.com/question/22553761 如何简单形象又有趣地讲解神经网络是什么?
http://blog.csdn.net/lilyth_lilyth/article/details/48032119 CTR预估中GBDT与LR融合方案
http://www.csdn.net/article/2015-09-07/2825630 http://fastml.com/evaluating-recommender-systems/ 推荐系统评价:NDCG方法概述
http://blog.csdn.net/u010138758/article/details/69936041 信息检索中常用的评价指标:MAP,nDCG,ERR,F-measure
https://www.zybuluo.com/hanbingtao/note/433855 零基础入门深度学习系列
http://scs.ryerson.ca/~aharley/vis/conv/ 可视化
八卦文
https://mp.weixin.qq.com/s?__biz=MzI4MjYzNDA5Ng==&mid=2247487207&idx=3&sn=cd38f47a47a62d3039f0312ed0b07e05&pass_ticket=OZ7fIASx4mIuFtdr%2FiuCnSlaNlow7LFR4Vb3w1R3tSrQ1QiSa38c%2FaDRULPJHX7ZKaggle爆文:一个框架解决几乎所有机器学习问题
https://mp.weixin.qq.com/s?__biz=MjM5MTQzNzU2NA==&mid=2651653428&idx=1&sn=666d1107531fcca441f6297c7d28bbab&pass_ticket=OZ7fIASx4mIuFtdr%2FiuCnSlaNlow7LFR4Vb3w1R3tSrQ1QiSa38c%2FaDRULPJHX7Zhttp://www.iamwire.com/2016/10/approaching-almost-any-machine-learning-problem/142291?from=groupmessage&isappinstalled=0 解决机器学习问题有通法!看这一篇就够了!
https://towardsdatascience.com/the-10-deep-learning-methods-ai-practitioners-need-to-apply-885259f402c1 The 10 Deep Learning Methods AI Practitioners Need to Apply
视频
https://pan.baidu.com/share/init?shareid=3176952038&uk=3980940323 CAO珍藏 密码puf7
https://www.coursera.org/learn/machine-learning 吴恩达Machine Learning 课程
https://www.deeplearning.ai/ https://www.coursera.org/learn/neural-networks-deep-learning 吴恩达Deep Leanring课程
http://open.163.com/movie/2012/2/I/D/M8FH262HJ_M8FU27PID.html 加州理工学院公开课:机器学习与数据挖掘
电子书
http://pan.baidu.com/share/init?shareid=1894200811&uk=2466840270 密码ov1s