Amazon SageMaker 与 MLflow 集成项目教程-CSDN博客

本文链接：https://blog.csdn.net/gitblog_00470/article/details/142506267

Amazon SageMaker 与 MLflow 集成项目教程

amazon-sagemaker-mlflow-fargate Managing your machine learning lifecycle with MLflow and Amazon SageMaker 项目地址: https://gitcode.com/gh_mirrors/am/amazon-sagemaker-mlflow-fargate

1. 项目介绍

本项目展示了如何将 MLflow 部署在 AWS Fargate 上，并将其与 Amazon SageMaker 结合使用，以管理机器学习生命周期。通过本项目，您可以使用 Amazon SageMaker 开发、训练、调优和部署基于 Scikit-Learn 的机器学习模型（如随机森林模型），并使用 MLflow 跟踪实验运行和模型。

主要功能

MLflow 跟踪服务器：在 AWS Fargate 上托管一个无服务器的 MLflow 服务器，使用 S3 作为 artifact 存储，RDS 作为后端存储。
实验跟踪：使用 MLflow 跟踪在 SageMaker 上运行的实验。
模型注册：将 SageMaker 中训练的模型注册到 MLflow 模型注册中心。
模型部署：将 MLflow 模型部署到 SageMaker 端点。

2. 项目快速启动

前提条件

在开始之前，请确保您已满足以下条件：

拥有一个 AWS 账户。
已安装并配置 AWS CDK。
已安装 Docker，用于构建和推送 MLflow 容器镜像到 ECR。
已克隆本项目到您的本地环境。

部署步骤

安装 AWS CDK
```
npm install -g aws-cdk@2.51.1
```

创建虚拟环境并安装依赖

python3 -m venv .venv
source .venv/bin/activate
pip3 install -r requirements.txt

部署 CDK 堆栈

ACCOUNT_ID=$(aws sts get-caller-identity --query Account | tr -d '"')
AWS_REGION=$(aws configure get region)
cdk bootstrap aws://$ACCOUNT_ID/$AWS_REGION
cdk deploy --parameters ProjectName=mlflow --require-approval never