使用Optimalflow Web应用程序构建无代码的自动机器学习模型

In the latest version(0.1.10) of OptimalFlow, it added a “no-code” Web App as an application demo built on OptimalFlow. The web app allows simple click and selection for all of the parameters inside of OptimalFLow, which means users could build end-to-end Automated Machine Learning workflow without coding at all! (Documentation).

在最新版本(0.1.10) OptimalFlow ,它增加了一个“无码” Web应用程序是建立在OptimalFlow的应用演示。 该Web应用程序允许对OptimalFLow内部的所有参数进行简单的单击和选择,这意味着用户无需编写代码就可以构建端到端的自动化机器学习工作流程! (文档)。

OptimalFlow was designed highly modularized at the beginning, which made it easy to continue developing. And users could build applications based on it. The web app of OptimalFlow is a user-friendly tool for people who don’t have coding experience to build an Omni-ensemble Automated Machine Learning workflow simply and quickly.

OptimalFlow从一开始就被高度模块化地设计,这使得继续开发变得容易。 用户可以基于它构建应用程序。 OptimalFlow的Web应用 对于没有编码经验的人们来说,它是一种用户友好的工具,可用来快速简便地构建Omni-ensemble自动化机器学习工作流程。

Why we use OptimalFlow? You could read another story of its introduction: An Omni-ensemble Automated Machine Learning — OptimalFlow.

为什么我们使用OptimalFlow ? 您可以阅读有关它的介绍的另一个故事: 全集成自动机器学习-OptimalFlow

Image for post

The Web App currently includes 3 function components: Pipeline Cluster Traversal Experiment(PCTE) Builder, LogsViewer, and Visualization. Users can set up the parameters for each OptimalFlow’s key module, and run the workflow all by clicking on the web page. Also, when the automated machine learning process(PCTE) has done, users could check logs information and review the visualization outputs immediately.

该Web应用程序当前包括3个功能组件:管道集群遍历实验(PCTE)构建器,LogsViewer和可视化。 用户可以为每个OptimalFlow的关键模块设置参数,并通过单击网页来运行所有工作流。 此外,完成自动机器学习过程(PCTE)后,用户可以检查日志信息并立即查看可视化输出。

Comparing other popular ‘no-code’ machine learning platforms, the Web App is born from OptimalFlow’s unique Omni-ensemble Automated Machine Learning framework and built on Pipeline Cluster Traversal Experiments(PCTE) theory. It’s also light, scalable, and open-source to users. It was designed for data scientists who need more focus on local implementation and can be easily transplanted among machines. When the data science team wants to deploy it on a server, Flask based Web App can be quickly implemented with high flexibility.

与其他流行的“无代码”机器学习平台相比,该Web App源自OptimalFlow独特的Omni-ensemble自动机器学习框架,并基于管道集群遍历实验(PCTE)理论构建。 对用户而言,它也是轻量级,可伸缩的和开源的。 它是为数据科学家设计的,他们需要更多地关注本地实施,并且可以轻松地在机器之间进行移植。 当数据科学团队希望将其部署在服务器上时,基于Flask的Web App可以快速灵活地实现。

如何部署Web App: (How to deploy Web App:)

步骤1:安装OptimalFlow(Step 1: Install OptimalFlow:)

If you didn’t install OptimalFlow’s latest version, you should do that from PYPI, simply using PIP. To install OptimalFlow’s latest version, run this command in your terminal or prompt:

如果您未安装OptimalFlow 最新版本,您应该使用PIP从PYPI进行操作。 要安装OptimalFlow的最新版本,请在您的终端或提示符下运行以下命令:

pip install --upgrade optimalflow

步骤2:下载Web App的源代码: (Step 2: Download Web App’s source code:)

There’re 2 ways to download its source code:

有两种下载其源代码的方法:

  • Method 1 Get it from OptimalFlow GitHub. Download all codes and sub-folders within the folder ‘webapp’.

    方法1从OptimalFlow GitHub获取它。 下载文件夹“ webapp ”中的所有代码和子文件夹。

Image for post

Method 2 — Get from OptimalFlow’s package. You could find it in your Python core folder. Here’s the example where you can find the ‘Webapp’ folder.

方法2 —从OptimalFlow的程序包中获取。 您可以在Python核心文件夹中找到它。 在此示例中,您可以找到“ Webapp ”文件夹。

Image for post

步骤3:启动服务器端口: (Step 3: Start a server port:)

Move the downloaded ‘Webapp’ folder to the place you want to store it. Using your Terminal or Prompt enter its directory and run the app.py (ignore ‘py’ here when you are not playing it in VS Code)

将下载的“ Webapp”文件夹移至您要存储的位置。 使用终端或提示符进入其目录并运行app.py(如果您不在VS Code中播放,请在此处忽略“ py”)

Image for post

And you will see the output similar to the picture below:

您将看到类似于下图的输出:

Image for post

Done! You can now open your browser (recommend Chrome), and type 127.0.0.1:500 to open the web app on your laptop.

做完了! 现在,您可以打开浏览器(推荐使用Chrome),然后键入127.0.0.1:500在笔记本电脑上打开网络应用。

建立管道集群遍历实验(PCTE)工作流程: (Build Pipeline Cluster Traversal Experiment(PCTE) Workflow:)

When you’ve done the deployment steps, and start running the server, you will see the web app in your browser like the picture below:

完成部署步骤并开始运行服务器后,您将在浏览器中看到该Web应用程序,如下图所示:

Image for post

To build a PCTE Automated Machine Learning Workflow, you need to enter the parameters for each OptimalFlow’s module: autoPP, autoFS, and autoCV.

要构建PCTE自动化机器学习工作流程,您需要输入每个OptimalFlow的参数 模块: autoPPautoFSautoCV

But before that, you need to move the dataset you want to build model into the ./input folder. You will see a sample dataset called “breast-cancer.csv” there.

但是在此之前,您需要将要构建模型的数据集移动到./input文件夹中。 您将在此处看到一个名为“ breast-cancer.csv”的样本数据集。

After doing that, we could select the dataset by clicking the button under “Load Dataset”:

之后,我们可以通过单击“加载数据集”下的按钮来选择数据集:

Image for post

Then, you need to finish the settings for autoPP, autoFS, and autoCV step by step.

然后,您需要逐步完成autoPPautoFSautoCV的设置。

Please NOTE: be careful to the setting “sparsity” and “cols” values in autoPP parameters panel, if the combination of the feature preprocessing can’t meet the restriction values you’ve set, OptimalFlow will not able to continue running the following modules. More details could be found in autoPP’s Documentation.

请注意:请注意autoPP参数面板中设置“稀疏度”和“ cols”值,如果功能预处理的组合不能满足您设置的限制值, OptimalFlow将无法继续运行以下模块。 可以在autoPP的文档中找到更多详细信息。

Image for post

Next, you need to finish set parameters in autoFS module, i.e. set how many top features you want to select, and what algorithm you want PCTE to go through. More details about autoFS module here.

接下来,您需要在autoFS模块中完成设置参数,即设置要选择的主要功能以及PCTE要通过的算法。 有关autoFS模块的更多详细信息,请参见此处

Image for post

For the autoCV module’s settings, you could click the autoCVSet Parameters’ button, and enter the tuning strategy and algorithms comparison scope you want OptimalFlow to automatedly test. More details about autoCV module here.

对于autoCV模块的设置,您可以单击autoCV的设置参数”按钮,然后输入要OptimalFlow自动测试的调整策略和算法比较范围。 有关autoCV模块的更多详细信息,点击这里

Image for post

Finally, when you finished all setting steps(the progress bar should be fulfilled 100%), you can click the ‘Build PCTE Workflow’ button, to run the Omni-ensemble Automated Machine Learning operation. This function is based on autoPipe module, more details about autoPipe is here.

最后,当您完成所有设置步骤(进度条应满足100%)时,您可以单击“ Build PCTE Workflow ”按钮,以运行Omni-ensemble自动化机器学习操作。 此功能基于autoPipe模块,有关autoPipe的更多详细信息在这里

Image for post

Here are the sample outputs when the process’s done. You will find the Top 5 Optimal models with evaluation metrics there. More details and use cases about OptimalFlow modules and notebook samples could be found in Documentation.

这是该过程完成后的示例输出。 您将在其中找到具有评估指标的前5个最佳模型。 有关OptimalFlow模块和笔记本样本的更多详细信息和用例,参见“文档”

Image for post

LogsViewer: (LogsViewer:)

You could view the logs information related to each module in ‘LogsViewer’ page. Please note, you could only use it after the previous automated machine learning process’s done.

您可以在“ LogsViewer ”页面中查看与每个模块相关的日志信息。 请注意,您只能在上一个自动机器学习过程完成后使用它。

Image for post

可视化: (Visualization:)

You could also review the visualization outputs from the ‘Visualization’ page. For the classification problems, it will present ‘Pipeline Cluster Traversal Experiments Model Retrieval Diagram’ and ‘Pipeline Cluster Model Evaluation Dynamic Table’. And for the regression problem, it only will present ‘Pipeline Cluster Model Evaluation Dynamic Table’. All visualization outputs are based on Plotly, so you could play with them by dragging or scrolling. You will find more details on the example page.

您也可以从“可视化”页面查看可视化输出。 对于分类问题,将显示“管道集群遍历实验模型检索图”和“管道集群模型评估动态表”。 对于回归问题,它只会显示“管道集群模型评估动态表”。 所有可视化输出均基于Plotly,因此您可以通过拖动或滚动来使用它们。 您可以在示例页面上找到更多详细信息。

Image for post

更新: (Updates:)

搜索空间 (SearchingSpace)

In OptimalFlow’s next version 0.1.11, you could edit the estimators’ Searching Space via the Web App.

在OptimalFlow的下一版本0.1.11中,您可以通过Web App编辑估算器的搜索空间。

Image for post
Image for post

综上所述: (In Summary:)

OptimalFlow is an easy-use API tool to achieve Omni-ensemble automated machine learning with simple code, and it’s also a best practice library to prove Pipeline Cluster Traversal Experiments (PCTE) theory.

OptimalFlow是一种易于使用的API工具,可通过简单的代码实现Omni集成的自动化机器学习,它还是证明管道集群遍历实验(PCTE)理论的最佳实践库。

The Web App in its latest version is a handful application built on OptimalFlow Library, and make the Automated Machine Learning workflow building process easy and no-code at all.

最新版本的Web应用程序是基于OptimalFlow库构建的少数应用程序,它使自动机器学习工作流程的构建过程变得简单而没有代码。

Its 6 modules could not only be connected to implement PCTE process, but also could be used individually to optimize traditional machine learning workflow’s components. You can find their individual use cases in Documentation.

它的6个模块不仅可以连接以实施PCTE流程,而且可以单独用于优化传统机器学习工作流程的组件。 您可以在“文档”中找到其各自的用例。

Image for post

Related Readings about OptimalFlow:

有关OptimalFlow的相关阅读:

Ensemble Feature Selection in Machine Learning using OptimalFlow — Easy Way with Simple Code to Select top Features

使用OptimalFlow集成机器学习中的特征选择-通过简单的代码选择主要特征的简便方法

Ensemble Model Selection & Evaluation in Machine Learning using OptimalFlow — Easy Way with Simple Code to Select the Optimal Model

使用OptimalFlow进行机器学习中的集成模型选择和评估-通过简单的代码选择最优模型的简便方法

End-to-end OptimalFlow Automated Machine Learning Tutorial with Real Projects-Formula E Laps Prediction Part 1

具有实际项目的端到端OptimalFlow自动化机器学习教程-公式E失效预测第1部分

End-to-end OptimalFlow Automated Machine Learning Tutorial with Real Projects-Formula E Laps Prediction Part 2

具有实际项目的端到端OptimalFlow自动化机器学习教程-公式E失效预测第2部分

关于我:(About me:)

I am a healthcare & pharmaceutical data scientist and big data Analytics & AI enthusiast. I developed OptimalFlow library to help data scientists building optimal models in an easy way, and automate Machine Learning workflow with simple codes.

我是医疗保健和制药数据科学家以及大数据分析和AI爱好者。 我开发了OptimalFlow库,以帮助数据科学家以简单的方式构建最佳模型,并使用简单的代码使机器学习工作流程自动化。

As a big data insights seeker, process optimizer, and AI professional with years of analytics experience, I use machine learning and problem-solving skills in data science to turn data into actionable insights while providing strategic and quantitative products as solutions for optimal outcomes.

作为具有多年分析经验的大数据洞察力寻求者,流程优化者和AI专业人员,我使用数据科学中的机器学习和问题解决技能将数据转化为可行的洞察力,同时提供战略和定量产品作为最佳结果的解决方案。

You can connect with me on LinkedIn or GitHub.

您可以在LinkedInGitHub上与我联系。

翻译自: https://towardsdatascience.com/build-no-code-automated-machine-learning-model-with-optimalflow-web-app-8acaad8262b1

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值