第17篇：基于Milvus的性能测试与代码实现

最新推荐文章于 2025-04-08 19:47:55 发布

Gemini技术窝

最新推荐文章于 2025-04-08 19:47:55 发布

阅读量2k

点赞数 30

分类专栏： Milvus应用实战：AI与向量数据库的完美结合文章标签： milvus 功能测试 python

本文链接：https://blog.csdn.net/wjm1991/article/details/139815493

版权

Milvus应用实战：AI与向量数据库的完美结合专栏收录该内容

24 篇文章

订阅专栏

随着人工智能和大数据技术的快速发展，高效的向量检索系统成为了许多应用的核心组件。Milvus作为一个开源的向量数据库，专为处理大规模、高维向量数据的检索而设计。在实际应用中，评估Milvus的性能是非常重要的一环。本文将详细介绍如何对Milvus进行性能测试与基准，包括确定性能指标、基准测试方法和输出测试效果图片。

一、性能测试指标

在进行性能测试时，首先需要确定合理的性能指标。常见的性能测试指标包括：

查询时间（Query Time）： 测量执行一次查询所需的时间。
吞吐量（Throughput）： 单位时间内能够处理的查询数量。
延迟（Latency）： 查询的响应时间，通常关注P95、P99等高百分位延迟。
检索精度（Recall）： 检索结果中正确结果的比例。
内存使用（Memory Usage）： 系统在运行过程中的内存占用情况。
存储使用（Storage Usage）： 数据库存储向量数据所占用的存储空间。
CPU使用率（CPU Usage）： 系统在运行过程中的CPU占用情况。

这些指标能够全面评估Milvus在不同场景下的性能表现。

1.1 指标说明

查询时间（Query Time）

查询时间是衡量数据库在处理单个查询时所需的时间。它直接反映了数据库的响应速度。

吞吐量（Throughput）

吞吐量表示单位时间内数据库能够处理的查询数量，是衡量数据库并发处理能力的重要指标。

延迟（Latency）

延迟是指查询从发出到接收响应所需的时间。高百分位延迟（如P95、P99）反映了系统在高负载下的响应时间。

检索精度（Recall）

检索精度衡量检索结果中正确结果的比例，是衡量向量检索系统准确性的重要指标。

内存使用（Memory Usage）

内存使用反映系统在运行过程中的内存消耗情况，影响系统的稳定性和可扩展性。

存储使用（Storage Usage）

存储使用反映数据库存储向量数据所占用的存储空间，影响数据存储的成本。

CPU使用率（CPU Usage）

CPU使用率反映系统在运行过程中的CPU资源消耗情况，影响系统的处理能力和响应速度。

二、基准测试方法

基准测试是评估数据库性能的重要手段。针对Milvus的基准测试方法包括数据准备、测试环境配置、测试工具选择和测试方案设计等步骤。

2.1 数据准备

基准测试需要使用真实或模拟的数据集。常见的数据集包括SIFT1M、DEEP1B等，这些数据集涵盖了不同规模和维度的向量数据。

数据集下载

可以通过以下脚本下载SIFT1M数据集：

wget ftp://ftp.irisa.fr/local/texmex/corpus/sift.tar.gz
tar -xvzf sift.tar.gz

2.2 测试环境配置

测试环境包括硬件配置、操作系统和Milvus服务器的配置。以下是一个典型的测试环境配置示例：

硬件配置

CPU：Intel Xeon E5-2680 v4
内存：128GB
存储：SSD 1TB

软件配置

操作系统：Ubuntu 20.04
Milvus版本：2.0.0
Python版本：3.8

2.3 测试工具选择

常用的基准测试工具包括Milvus自带的Benchmark工具和第三方测试工具。本文将使用Milvus Benchmark工具进行测试。

安装Milvus Benchmark工具

pip install pymilvus
pip install milvus-benchmark

2.4 测试方案设计

测试方案设计包括测试场景、测试指标和测试步骤等内容。以下是一个基准测试方案设计示例：

测试场景

插入性能测试：测试Milvus在不同规模数据集下的插入性能。
检索性能测试：测试Milvus在不同查询负载下的检索性能。
内存和存储使用测试：测试Milvus在不同规模数据集下的内存和存储使用情况。

测试步骤

启动Milvus服务器并配置数据库。
加载数据集并进行向量插入测试。
执行检索查询并记录查询时间、延迟和吞吐量。
监控内存和存储使用情况。
分析测试结果并输出测试报告。

三、基于Milvus的性能测试与基准

3.1 数据加载和插入性能测试

首先，我们将进行数据加载和插入性能测试。以下是详细的代码实现：

from pymilvus import connections, Collection, CollectionSchema, FieldSchema, DataType
import numpy as np
import time

# 连接到Milvus服务器
connections.connect("default", host="localhost", port="19530")

# 定义集合的字段
fields = [
    FieldSchema(name="id", dtype=DataType.INT64, is_primary=True, auto_id=True),
    FieldSchema(name="embedding", dtype=DataType.FLOAT_VECTOR, dim=128)
]

# 创建集合
schema = CollectionSchema(fields, "SIFT1M dataset collection")
collection = Collection("sift1m_collection", schema)

# 加载数据集
data = np.load("sift1m.npy")

# 插入数据并记录时间
start_time = time.time()
collection.insert(data)
end_time = time.time()

print(f"Data insertion time: {end_time - start_time} seconds")

流程图：

3.2 检索性能测试

接下来，我们将进行检索性能测试。以下是详细的代码实现：

from pymilvus import Collection
import numpy as np
import time

# 连接到Milvus服务器
connections.connect("default", host="localhost", port="19530")

# 加载集合
collection = Collection("sift1m_collection")

# 随机生成查询向量
query_vector = np.random.random((1, 128)).astype(np.float32)

# 设置检索参数
search_params = {"metric_type": "L2", "params": {"nprobe": 10}}

# 执行检索并记录时间
start_time = time.time()
results = collection.search(query_vector, "embedding", search_params, limit=10)
end_time = time.time()

print(f"Query time: {end_time - start_time} seconds")
print(f"Search results: {results}")

流程图：

3.3 内存和存储使用测试

在进行性能测试的同时，我们还需要监控系统的内存和存储使用情况。以下是详细的代码实现：

import psutil
import os

# 获取内存使用情况
memory_info = psutil.virtual_memory()
print(f"Memory usage: {memory_info.percent}%")

# 获取存储使用情况
storage_info = psutil.disk_usage('/')
print(f"Storage usage: {storage_info.percent}%")

3.4 完整性能测试脚本

以下是一个完整的性能测试脚本，包括数据加载、插入性能测试、检索性能测试和内存、存储使用测试：

import numpy as np
import time
import psutil
from pymilvus import connections, Collection, CollectionSchema, FieldSchema, DataType

# 连接到Milvus服务器
connections.connect("default", host="localhost", port="19530")

# 定义集合的字段
fields = [
    FieldSchema(name="id", dtype=DataType.INT64, is_primary=True, auto_id=True),
    FieldSchema(name="embedding", dtype=DataType.FLOAT_VECTOR, dim=128)
]

# 创建集合
schema = CollectionSchema(fields, "SIFT1M dataset collection")
collection = Collection("sift1m_collection", schema)

# 加载数据集
data = np.load("sift1m.npy")

# 插入数据并记录时间
start_time = time.time()
collection.insert

(data)
end_time = time.time()

print(f"Data insertion time: {end_time - start_time} seconds")

# 随机生成查询向量
query_vector = np.random.random((1, 128)).astype(np.float32)

# 设置检索参数
search_params = {"metric_type": "L2", "params": {"nprobe": 10}}

# 执行检索并记录时间
start_time = time.time()
results = collection.search(query_vector, "embedding", search_params, limit=10)
end_time = time.time()

print(f"Query time: {end_time - start_time} seconds")
print(f"Search results: {results}")

# 获取内存使用情况
memory_info = psutil.virtual_memory()
print(f"Memory usage: {memory_info.percent}%")

# 获取存储使用情况
storage_info = psutil.disk_usage('/')
print(f"Storage usage: {storage_info.percent}%")

四、测试效果图片

为了更好地展示测试效果，可以生成相应的图表。以下是使用Matplotlib生成测试效果图的示例代码：

4.1 生成插入性能图表

import matplotlib.pyplot as plt

# 数据插入时间
insertion_times = [end_time - start_time for _ in range(10)]  # 假设进行了10次插入测试
plt.figure(figsize=(10, 5))
plt.plot(insertion_times, label="Insertion Time")
plt.xlabel("Test Iterations")
plt.ylabel("Time (seconds)")
plt.title("Data Insertion Performance")
plt.legend()
plt.show()

4.2 生成检索性能图表

# 查询时间
query_times = [end_time - start_time for _ in range(10)]  # 假设进行了10次检索测试
plt.figure(figsize=(10, 5))
plt.plot(query_times, label="Query Time")
plt.xlabel("Test Iterations")
plt.ylabel("Time (seconds)")
plt.title("Query Performance")
plt.legend()
plt.show()

4.3 生成内存和存储使用图表

# 内存使用情况
memory_usages = [psutil.virtual_memory().percent for _ in range(10)]  # 假设监控了10次
# 存储使用情况
storage_usages = [psutil.disk_usage('/').percent for _ in range(10)]  # 假设监控了10次

plt.figure(figsize=(10, 5))
plt.plot(memory_usages, label="Memory Usage (%)")
plt.plot(storage_usages, label="Storage Usage (%)")
plt.xlabel("Test Iterations")
plt.ylabel("Usage (%)")
plt.title("Memory and Storage Usage")
plt.legend()
plt.show()

五、总结

本文详细介绍了基于Milvus的性能测试与基准，包括确定性能指标、基准测试方法和测试效果的可视化。通过合理的性能测试，可以全面评估Milvus在不同场景下的性能表现，从而指导系统的优化和改进。希望本文对大家理解和实施性能测试有所帮助。
通过上述流程和代码示例，可以高效地进行Milvus的性能测试与基准评估，确保系统在实际应用中的高效稳定运行。

如果你喜欢这篇文章，别忘了收藏文章、关注作者、订阅专栏，感激不尽。