《大数据机器学习实践探索》 ---- 使用spark MLlib进行机器学习（1.简介 -- 从机器学习说起）

shiter

于 2021-04-13 13:41:31 发布

阅读量340

点赞数

分类专栏：大数据机器学习实践探索基于大数据的机器学习原理与最佳实践文章标签： pyspark mllib ml

本文链接：https://blog.csdn.net/wangyaninglm/article/details/115561697

版权

大数据机器学习实践探索同时被 2 个专栏收录

130 篇文章 124 订阅 ¥29.90 ¥99.00

订阅专栏

超级会员免费看

基于大数据的机器学习原理与最佳实践

81 篇文章 140 订阅 ¥29.90 ¥99.00

订阅专栏

超级会员免费看

Up until this point, we have focused on data engineering workloads with Apache Spark. Data engineering is often a precursory step to preparing your data for machine learning (ML) tasks, which will be the focus of this chapter. We live in an era in which machine learning and artificial intelligence applications are an integral part of our lives.

Chances are that whether we realize it or not, every day we come into contact with ML models for purposes such as online shopping recommendations and adver‐ tisements, fraud detection, classification, image recognition, pattern matching, and more. These ML models drive important business decisions for many companies. According to this McKinsey study, 35% of what consumers purchase on Amazon and 75% of what they