azure机器学习_Azure机器学习中的预测

azure机器学习 介绍 (Introduction) After discussing the basic features of azure machine learning and how to clean the data from Azure Machine learning, let us look at how to perform prediction in ...
摘要由CSDN通过智能技术生成

azure机器学习

介绍 (Introduction)

After discussing the basic features of azure machine learning and how to clean the data from Azure Machine learning, let us look at how to perform prediction in Azure Machine Learning. Prediction is one of the important aspects of machine learning as it will help to make strategic decisions.

在讨论了天蓝色机器学习的基本功能以及如何从Azure机器学习中清除数据之后,让我们看一下如何在Azure机器学习中执行预测。 预测是机器学习的重要方面之一,因为它将有助于做出战略决策。

数据选择和数据清理 (Data Selection and Data Cleaning)

Like we did in the last two articles, let us quickly select a data source, Bike Buyer, and apply the normalization if needed after choosing only the relevant columns as shown in the below screenshot.

就像我们在前两篇文章中所做的一样,让我们​​快速选择一个数据源“ Bike Buyer”,并在只选择了相关列之后根据需要应用规范化,如下面的屏幕快照所示。

Setting up with a data set in Azure Machine Learning.

In the above configuration, columns such as addresses, names, keys and attributes with different languages other than English were removed. These columns were ignored to improve the Prediction in Azure Machine Learning as those columns do not make major contributions towards buying a bike. For example, customer name and address will not make any contributions towards a bike buyer.

在上述配置中,地址,名称,键和属性之类的列已用英语以外的其他语言删除。 忽略这些列以提高Azure机器学习的预测能力,因为这些列在购买自行车方面没有做出重大贡献。 例如,客户名称和地址不会对自行车购买者做出任何贡献。

数据分割 (Data Split)

In most of the machine learning techniques, we need to have two datasets, Training and Testing datasets. The training dataset is used to build the model while the Testing dataset is used to evaluate the trained model. Though there are different techniques available for data splitting, like SQL Server, in Azure Machine Learning, the percentage split is the available technique to split the data. In this technique, random data of a given percentage will be split to train and test data.

在大多数机器学习技术中,我们需要有两个数据集,即训练和测试数据集。 训练数据集用于构建模型,而测试数据集用于评估训练后的模型。 尽管在Azure机器学习中有多种可用的数据拆分技术(例如SQL Server),但百分比拆分是拆分数据的可用技术。 在这种技术中,给定百分比的随机数据将被拆分为训练和测试数据。

Under Data Transformation -> Sample & Split, there is a control called Split Data. It is configured as shown in the below screenshot.

在“ 数据转换 -> 采样与拆分”下 ,有一个名为“ 拆分数据”的控件 如以下屏幕截图所示配置它。

Including the Split Data control.

The following are the settings for the Split Data in Prediction in Azure Machine Learning.

以下是Azure Machine Learning中的预测中的拆分数据设置。

Configuration of Split data control.

As shown in the above configuration, the train dataset is 0.7 from the dataset. Further, it is a random split and data will be split randomly to train and test dataset. When the Stratified split is set to true, data is split such a way that both training and testing dataset has almost the same sample percentage of data for the selected column. In this configuration,

  • 0
    点赞
  • 2
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值