ConfigsMar 30, 2026·2 min read

Qdrant — High-Performance Vector Database

Vector database and search engine for AI applications. Handles billion-scale similarity search with filtering, sparse vectors, and multi-tenancy. Rust-powered. 30K+ stars.

TL;DR
Qdrant is a vector database built in Rust for fast similarity search with advanced filtering, sparse vectors, and multi-tenancy.
§01

What it is

Qdrant is a vector database and similarity search engine designed for AI applications. Written in Rust for performance, it stores high-dimensional vectors alongside arbitrary JSON payloads and supports filtered search, sparse vectors, and multi-tenancy. It runs as a standalone server or embedded in your application.

Qdrant targets developers building RAG pipelines, recommendation engines, image search, anomaly detection, or any system that needs fast nearest-neighbor lookup at scale.

§02

How it saves time or tokens

Vector databases are the backbone of retrieval-augmented generation (RAG). Instead of stuffing entire documents into an LLM prompt, you embed and index them in Qdrant, then retrieve only the relevant chunks. This cuts token usage dramatically -- a well-tuned RAG pipeline might send 2,000 tokens of context instead of 20,000, saving both cost and latency.

§03

How to use

  1. Start Qdrant with Docker:
docker run -p 6333:6333 -p 6334:6334 qdrant/qdrant
  1. Create a collection and insert vectors:
from qdrant_client import QdrantClient
from qdrant_client.models import Distance, VectorParams, PointStruct

client = QdrantClient('localhost', port=6333)

client.create_collection(
    collection_name='docs',
    vectors_config=VectorParams(size=384, distance=Distance.COSINE),
)

client.upsert(
    collection_name='docs',
    points=[
        PointStruct(id=1, vector=[0.1]*384, payload={'title': 'Getting Started'}),
        PointStruct(id=2, vector=[0.2]*384, payload={'title': 'Advanced Usage'}),
    ]
)
  1. Search with filters:
results = client.search(
    collection_name='docs',
    query_vector=[0.15]*384,
    limit=5,
)
§04

Example

FeatureQdrant
LanguageRust
QuantizationScalar, Product, Binary
Sparse vectorsYes
Multi-tenancyBuilt-in
FilteringPayload-based, nested
ShardingAutomatic
APIREST + gRPC
§05

Related on TokRepo

§06

Common pitfalls

  • Vector dimensionality must match your embedding model. A mismatch (e.g., 384 vs 768) causes silent failures or errors at insert time.
  • Quantization reduces memory usage significantly but can lower recall. Test with your actual data before enabling in production.
  • On-disk storage mode is available for large datasets that do not fit in RAM, but search latency increases. Profile your workload to decide between in-memory and on-disk.

Frequently Asked Questions

How does Qdrant compare to Pinecone?+

Qdrant is open source and self-hostable, while Pinecone is a managed cloud service. Qdrant gives you full control over infrastructure and data residency. Pinecone offers zero-ops convenience. Performance is comparable for most workloads. Choose Qdrant if you need self-hosting or want to avoid vendor lock-in.

Can Qdrant handle billions of vectors?+

Yes. Qdrant supports sharding and on-disk storage for large-scale deployments. Billions of vectors require distributed mode with multiple nodes. Quantization (scalar or product) reduces memory per vector, making billion-scale deployments practical on commodity hardware.

Does Qdrant support hybrid search?+

Yes. Qdrant supports both dense and sparse vectors in the same collection. You can combine semantic similarity (dense) with keyword matching (sparse) using named vectors. This hybrid approach often improves retrieval quality compared to dense-only search.

What embedding models work with Qdrant?+

Any embedding model that outputs fixed-dimension vectors works with Qdrant. Common choices include OpenAI text-embedding-3-small (1536d), Cohere embed-v3 (1024d), and sentence-transformers models (384d or 768d). You generate embeddings externally and store the resulting vectors in Qdrant.

Is Qdrant free?+

Qdrant is open source under Apache 2.0 and free to self-host. Qdrant also offers a managed cloud service with a free tier for small projects and paid tiers for production workloads. The self-hosted version has no feature restrictions compared to the cloud version.

Citations (3)
🙏

Source & Thanks

Created by Qdrant. Licensed under Apache 2.0. qdrant/qdrant — 30,000+ GitHub stars

Discussion

Sign in to join the discussion.
No comments yet. Be the first to share your thoughts.

Related Assets