机器学习之----数据预处理

最新推荐文章于 2022-06-08 15:42:06 发布

Adam婷

最新推荐文章于 2022-06-08 15:42:06 发布

阅读量412

点赞数

分类专栏： AI程序员算法机器学习

本文链接：https://blog.csdn.net/weixin_41697507/article/details/86769419

版权

166 篇文章 8 订阅 ¥19.90 ¥99.00

订阅专栏

161 篇文章 8 订阅 ¥19.90 ¥99.00

订阅专栏

161 篇文章 4 订阅

订阅专栏

数据预处理方法实现

Step 1: Importing the required Libraries
These Two are essential libraries which we will import every time.
NumPy is a Library which contains Mathematical functions.
Pandas is the library used to import and manage the data sets.
Step 2: Importing the Data Set
Data sets are generally available in .csv format. A CSV file stores tabular data in plain text. Each line of the file is a data record. We use the read_csv method of the pandas library to read a local CSV file as a dataframe.
Step 3: Handling the Missing Data
The data we get is rarely homogeneous. Data can be missing due to various reasons and needs to be handled so that it does not reduce the performance of our machine learning model. We can replace the missing data by the Mean or Median of the entire column. We use Imputer class of sklearn.preprocessing for this task.
Step 4:

了解本专栏

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

关注关注