Is it more like a vector database or a library?

More like a library: ANN is an embeddable component, indexes are built offline as artifacts, and online focuses on load+execute; it complements existing search/recsys pipelines rather than replacing storage/governance.

How do I choose index types and parameters safely?

Pin an eval set and regression scripts, track recall/latency/memory together, and require comparable index artifacts for every tuning change—don’t tune by “vibes”.

What should I compare it against as alternatives?

At the library layer, compare with [FAISS](https://github.com/facebookresearch/faiss) and [hnswlib](https://github.com/nmslib/hnswlib). If you need a full managed system surface, compare with [Milvus](https://milvus.io/).

zvec Deep Dive: production ANN index, FAISS alternative

Pain Points vs Innovation

✕Traditional Pain Points	✓Innovative Solutions
When vector search is glued into app code, index formats, params, and tuning are not reproducible, making regressions hard to diagnose.	zvec makes indexes first-class artifacts: build and query are decoupled, indexes are persistent/versioned, and online focuses on load+execute for controllable regressions.
Adopting a full vector database can be operationally heavy for lightweight recall use cases.	ANN-first execution puts latency/throughput optimizations in the query layer, with configurable recall–performance tradeoffs for embedded deployment.

Deployment Guide

1. Clone the repository

bash

1git clone https://github.com/alibaba/zvec.git && cd zvec

2. Install deps and build (typically Rust/C++ toolchain)

bash

1# Follow repo build commands (e.g., cargo build --release or cmake --build)

3. Build index artifacts (offline)

bash

1# Example: zvec build-index --input embeddings.bin --output index.zv --config config.yaml

4. Load index and run queries (online)

bash

1# Example: zvec query --index index.zv --vector query.bin --topk 10

5. Create a regression baseline

bash

1# Pin a query set + expected topK, store metrics/outputs for version comparisons

Use Cases

Core Scene	Target Audience	Solution	Outcome
Embeddable ANN recall layer for semantic search	search/KB teams	embed embedding-based nearest-neighbor recall into existing retrieval	better recall within latency budgets and regression-friendly tuning
Vector recall for recommender systems with AB iteration	recsys/growth teams	build indexes offline and recall candidates online with low latency	versioned recall components with safer AB and rollbacks
Local vector retrieval component for multimodal apps	multimodal/content understanding teams	run vector retrieval on-prem or at the edge	clear data boundaries, controlled cost, and throughput scaling with hardware

zvec

What is it?

Pain Points vs Innovation

Architecture Deep Dive

Deployment Guide

1. Clone the repository

2. Install deps and build (typically Rust/C++ toolchain)

3. Build index artifacts (offline)

4. Load index and run queries (online)

5. Create a regression baseline

Use Cases

Limitations & Gotchas

Frequently Asked Questions