Paramount 项目使用教程

最新推荐文章于 2024-12-10 14:53:51 发布

邓越浪Henry

最新推荐文章于 2024-12-10 14:53:51 发布

阅读量802

点赞数 5

本文链接：https://blog.csdn.net/gitblog_00287/article/details/142609353

版权

Paramount 项目使用教程

paramount Agent accuracy measurements for LLMs 项目地址: https://gitcode.com/gh_mirrors/pa/paramount

1. 项目介绍

Paramount 是一个用于评估 AI 聊天代理准确性的开源项目。它允许专家代理评估 AI 聊天，从而实现以下功能：

质量保证
捕获地面真实数据
自动化回归测试

Paramount 可以在私有环境中完全离线运行，确保数据的安全性和隐私性。

2. 项目快速启动

安装

首先，使用 pip 安装 Paramount：

pip install paramount

使用示例

以下是一个简单的使用示例，展示了如何使用 Paramount 记录 AI 函数的调用：

from paramount import record

@record()
def my_ai_function(message_history, new_question):
    # 输入
    new_message = {'role': 'user', 'content': new_question}
    updated_history = message_history + [new_message]
    
    # LLM 调用发生在这里
    
    return updated_history  # 输出

# 多次运行 my_ai_function() 后，启动 Paramount UI 以评估结果
paramount

配置

为了成功设置 Paramount，您需要在项目根目录中添加一个 paramount.toml 配置文件。该文件将自动生成默认配置，如果它尚不存在。

[record]
enabled = true
function_url = "http://localhost:9000"  # 您的 LLM API Flask 应用的 URL

[db]
type = "csv"  # 也可以使用 postgres

[db.postgres]
connection_string = ""

[api]
endpoint = "http://localhost"  # Paramount UI/API 的 URL 和端口
port = 9001
split_by_id = false  # 如果您有多个机器人并希望按 ID 拆分
identifier_colname = ""

[ui]
meta_cols = ['recorded_at']
input_cols = ['args__message_history', 'args__new_question']  # 匹配 my_ai_function() 示例
output_cols = ['1', '2']  # 1 和 2 是示例中 llm_answer 和 llm_references 的索引
chat_list = "output__1"  # 匹配输出 updated_history，必须是字典列表以显示聊天格式
chat_list_role_param = "role"  # 列表中描述角色的键
chat_list_content_param = "content"  # 列表中描述内容的键