What is Milvus? —— 向量数据库 Milvus

原创于 2025-12-19 02:00:00 发布 · 518 阅读

15 ·

CC 4.0 BY-SA版权

文章标签：

#milvus #向量数据库 #AI #人工智能 #数据库 #OpenSearch

AI 同时被 3 个专栏收录

38 篇文章

订阅专栏

数据库

25 篇文章

订阅专栏

人工智能

21 篇文章

订阅专栏

https://www.ibm.com/think/topics/milvus

What is Milvus?

Milvus is an open-source vector database developed by Zilliz. Milvus is known for providing scalable storage for large amounts of vector embeddings and supporting high-performance similarity searches of vector data.

Zilliz first developed Milvus in 2017 and contributed the project to the Linux® Foundation in 2020. Milvus is now available as both an open-source software under the Apache License 2.0 and as a fully managed cloud service from Zilliz.

Milvus是由Zilliz开发的开源向量数据库。该数据库以提供海量向量嵌入的可扩展存储，并支持向量数据的高性能相似性搜索而著称。

Zilliz于2017年首次开发Milvus，并于2020年将该项目捐赠给Linux®基金会。目前Milvus既可作为Apache License 2.0协议下的开源软件使用，也可通过Zilliz提供的全托管云服务获取。

What are vector databases, and why do they matter?

Vector databases store and manage datasets in the form of vectors. They can help organizations manage unstructured data, and they are critical to advanced artificial intelligence (AI) and machine learning (ML) efforts.

Vectors are arrays of numbers that represent complex concepts and objects, such as words and images.

Unstructured data—such as text, video and audio—makes up a significant portion of enterprise data today, but traditional databases are often ill-suited to organize and manage this data.

Organizations can feed this data to specialized deep learning embedding models, which output vector representations called “embeddings.” For example, the word “cat” might be represented by the vector [0.2, -0.4, 0.7], while the word “dog” might be represented by [0.6, 0.1, 0.5].

Transforming data into vectors enables organizations to store different kinds of unstructured data in a shared format in one vector database.

Vectors also help organizations unlock the value of this data for AI and ML. Vectors capture the semantic relationships between elements, enabling effective processing by large language models (LLMs) and generative AI (gen AI) tools. Most advanced AI and ML applications today rely on vectors for training and content generation.

Like other vector databases, Milvus gives organizations a way to manage and organize embedding vectors. The Milvus vector database’s highly scalable storage and efficient vector search capabilities have made it a popular choice for retrieval augmented generation (RAG), recommendation systems and other AI applications.

向量数据库以向量的形式存储和管理数据集。它们能帮助企业处理非结构化数据，对高级人工智能（AI）和机器学习（ML）应用至关重要。

向量是表示复杂概念（如单词、图像）的数值数组。如今非结构化数据（如文本、视频、音频）在企业数据中占比巨大，但传统数据库往往难以有效组织这类数据。

企业可将这类数据输入专门的深度学习嵌入模型，输出称为"嵌入"的向量表示。例如单词"猫"可能表示为向量[0.2, -0.4, 0.7]，而"狗"可能表示为[0.6, 0.1, 0.5]。

将数据转化为向量后，企业就能以统一格式在单个向量数据库中存储各类非结构化数据。向量还能帮助释放这些数据对AI/ML的价值——通过捕捉元素间的语义关系，支持大语言模型（LLM）和生成式AI工具高效处理。当前大多数先进AI/ML应用都依赖向量进行训练和内容生成。

与其他向量数据库类似，Milvus为企业提供了管理嵌入向量的解决方案。其高度可扩展的存储架构和高效的向量搜索能力，使其成为检索增强生成（RAG）、推荐系统等AI应用的热门选择。

Milvus's architecture

Milvus is a cloud-native vector database with a mulitlayered, microservices-based architecture. Milvus separates storage and compute resources, which enables organizations to scale each layer independently and horizontally.

Milvus is compatible with several different embedding models. Organizations can connect their models to Milvus, which ingests the embeddings along with metadata and other pertinent information. Milvus supports streaming and batch embedding uploads.

Milvus has 4 layers:

Access layer: This is the external-facing layer, which accepts inputs from users and services and returns outputs.

Coordinator service: Zilliz refers to this layer as the system’s “brain” because it orchestrates load balancing, data management, query execution and other important tasks.

Worker nodes: This layer executes queries, updates data and builds indexes.

Object storage layer: This layer includes a metadata store, a log broker that records real-time data changes and an object store that holds log snapshots, index files and intermediate computation results.

Milvus deployment types

Milvus Lite: A Python library that allows users to run Milvus in local environments. Milvus Lite currently supports Ubuntu and MacOS, but not Microsoft Windows.
Milvus Standalone: A complete Milvus database packaged in a single Docker image and run on a single machine.
Milvus Cluster: A distributed vector database that spreads services across groups of nodes in a Kubernetes cluster.
Zilliz Cloud: The fully managed version of Milvus.

Milvus vs. other vector databases

Milvus vs. Pinecone

Both Pinecone and Milvus offer low-latency search and scalable storage, but Pinecone is a proprietary vector database and is available only as a managed service.

Milvus vs. Weaviate

Like Milvus, Weaviate is open source and supports hybrid searches. One key difference is that Milvus offers more indexing types than Weaviate.

Milvus vs. Qdrant

Another open-source vector database, Qdrant is known for its strong metadata filtering capabilities. While Qdrant is well suited for moderate-scale uses, Milvus can generally handle higher volumes of vector data.

Milvus vs. Chroma

Chroma focuses on ease of use and quick local deployments. Chroma does not have a distributed architecture, making it less scalable. While Chroma is commonly used for prototyping and testing, Milvus can support a wider range of use cases.

Common Milvus use cases

Organizations use Milvus to support numerous AI applications, including:

Retrieval augmented generation (RAG)
Recommendation systems
Media searches
Anomaly and fraud detection

Retrieval augmented generation (RAG)

RAG is an architecture that connects AI models to external knowledge bases to help them deliver more relevant, accurate results.

Milvus is common in RAG implementations because of its support for efficient hybrid searches. By combining the contextual understanding of semantic search with the precision of keyword search and metadata filtering, Milvus can help surface relevant docs, code snippets and other information from RAG sources.

CacheGPT, an open source semantic cache for LLMs developed by Zilliz, also helps Milvus support RAG implementations. CacheGPT stores responses from generative AI apps as vector embeddings.

Connected services—such as RAG interfaces—don’t need to make an API call to the generative AI for every search. Instead, they can check the cache first and call the gen AI only if the answer isn’t there.

Recommendation systems

Milvus is popular in recommendation systems that match content, products and ads to users based on past behaviors.

User preferences can be represented as vectors, and a similarity search can surface the vector representations of products, ads and content that are close to the user’s preferences.

Media searches

Milvus's similarity search capabilities can help streamline image search, audio search, video search and other media searches.

Anomaly and fraud detection

Milvus can be used to help spot defects in products by comparing the vectors of product images against the vectors representing those products' proper forms. Differences between the vectors might indicate defects.

Milvus can also help spot anomalies in other contexts. In cybersecurity, vectors representing authorized network activity can be compared to vectors representing known malicious activity. Likewise, in finance, vectors representing transactions can be analyzed to identify deviations that might indicate fraud.

Key characteristics and capabilities of Milvus

Advanced search capabilities

Milvus supports high-performance vector similarity searches, a type of vector search that returns results that are semantically similar to a query. The benefit of similarity search is that it is not limited to exact matches, as a traditional keyword search would be.

For example, a keyword search for “best pizza restaurant” would return only results containing the words “best”, “pizza” and “restaurant.” A similarity search for the same keyword would find any results for highly recommended pizza places, even if the exact words "best pizza restaurant" are not used in the content.

Milvus supports several similarity search types, including top-k approximate nearest neighbor (ANN) and range ANN.

Milvus also supports hybrid searches, which combine semantic vector searches with other criteria, such as metadata filtering or keyword search.

Hybrid searches can make searches more efficient and more relevant. Consider a search that combines keyword and vector search. The search can first use specific keywords to filter results based on exact matches and then use vector similarity search to search those filtered results for the most semantically relevant content.

Indexing

Milvus supports several indexing types, including hierarchical navigable small world (HNSW), inverted file (IVF) and GPU-based indexes.

Indexing vectors can help speed up searches. For example, HNSW clusters similar vectors together during the index construction process, making it easier to find relevant results faster.

Scalability

Milvus’s layers can scale independently of one another, which can give organizations a cost- and resource-effective way to handle massive amounts of vector data and intensive searches.

Integrations and compatibility

Milvus offers numerous software development kits (SDKs) to support development in various languages, including Python (pymilvus), Java and Go.

Milvus can also integrate with analytics tools such as Apache Spark, frameworks such as LangChain and gen AI models such as IBM watsonx™, Meta’s Llama and OpenAI’s GPT models.

Open source

A large open-source community contributes bug fixes, updates and other enhancements to Milvus’s GitHub repos.

高级搜索能力

Milvus支持高性能向量相似性搜索，这种向量搜索能返回与查询语义相似的结果。相似性搜索的优势在于不像传统关键词搜索那样局限于完全匹配。

例如，关键词搜索"最佳披萨餐厅"仅会返回包含"最佳"、"披萨"和"餐厅"字样的结果。而相同关键词的相似性搜索会找出所有被强烈推荐的披萨店，即使内容中并未使用"最佳披萨餐厅"这个确切表述。

Milvus支持多种相似性搜索类型，包括top-k近似最近邻(ANN)和范围ANN搜索。

该系统还支持混合搜索，可将语义向量搜索与其他条件（如元数据过滤或关键词搜索）相结合。

混合搜索能提升搜索效率和相关性。例如结合关键词与向量搜索时，可先用特定关键词筛选完全匹配结果，再通过向量相似性搜索找出语义最相关的内容。

索引技术

Milvus支持多种索引类型，包括分层可导航小世界(HNSW)、倒排文件(IVF)及基于GPU的索引。

建立向量索引能加速搜索过程。以HNSW为例，它在构建索引时将相似向量聚类，从而更快定位相关结果。

弹性扩展

Milvus各层级可独立扩展，使企业能以经济高效的方式处理海量向量数据和密集搜索请求。

集成与兼容性

提供多语言软件开发工具包(SDK)，支持Python(pymilvus)、Java和Go等语言。

可与Apache Spark等分析工具、LangChain等框架，以及IBM watsonx™、Meta的Llama和OpenAI的GPT等生成式AI模型集成。

开源生态

庞大的开源社区通过GitHub仓库为Milvus贡献错误修复、功能更新及其他优化。

An open-source, all-in-one vector database for building scalable and future-proof AI apps

To power artificial intelligence (AI) apps at scale, you need a database specially designed for the way AI understands data. OpenSearch Vector Engine is designed for accuracy, speed, and scalability, enabling you to build stable AI applications on a proven platform that scales to tens of billions of vectors, with low latency and high availability built-in. Unlike traditional databases that struggle with unstructured information, vector databases are designed for high-dimensional data—delivering lightning-fast results for chatbots, recommendations, image search, and other AI use cases.