Esta página se muestra en inglés. Una traducción al español está en curso.
SkillsMar 30, 2026·2 min de lectura

Qdrant — High-Performance Vector Database

Vector database and search engine for AI applications. Handles billion-scale similarity search with filtering, sparse vectors, and multi-tenancy. Rust-powered. 30K+ stars.

Listo para agents

Instalación lista para agent

Este activo puede instalarse después de elegir el runtime, revisar el plan y ejecutar el comando correspondiente.

Native · 98/100Política: permitir
Superficie agent
Cualquier agent MCP/CLI
Tipo
Skill
Instalación
Single
Confianza
Confianza: Established
Entrada
Qdrant — High-Performance Vector Database
Comando de instalación directa
npx -y tokrepo@latest install 1566710d-f5ed-46da-af8c-757475a10420 --target codex

Ejecutar después de confirmar el plan con dry-run.

TL;DR
Qdrant is a vector database built in Rust for fast similarity search with advanced filtering, sparse vectors, and multi-tenancy.
§01

What it is

Qdrant is a vector database and similarity search engine designed for AI applications. Written in Rust for performance, it stores high-dimensional vectors alongside arbitrary JSON payloads and supports filtered search, sparse vectors, and multi-tenancy. It runs as a standalone server or embedded in your application.

Qdrant targets developers building RAG pipelines, recommendation engines, image search, anomaly detection, or any system that needs fast nearest-neighbor lookup at scale.

§02

How it saves time or tokens

Vector databases are the backbone of retrieval-augmented generation (RAG). Instead of stuffing entire documents into an LLM prompt, you embed and index them in Qdrant, then retrieve only the relevant chunks. This cuts token usage dramatically -- a well-tuned RAG pipeline might send 2,000 tokens of context instead of 20,000, saving both cost and latency.

§03

How to use

  1. Start Qdrant with Docker:
docker run -p 6333:6333 -p 6334:6334 qdrant/qdrant
  1. Create a collection and insert vectors:
from qdrant_client import QdrantClient
from qdrant_client.models import Distance, VectorParams, PointStruct

client = QdrantClient('localhost', port=6333)

client.create_collection(
    collection_name='docs',
    vectors_config=VectorParams(size=384, distance=Distance.COSINE),
)

client.upsert(
    collection_name='docs',
    points=[
        PointStruct(id=1, vector=[0.1]*384, payload={'title': 'Getting Started'}),
        PointStruct(id=2, vector=[0.2]*384, payload={'title': 'Advanced Usage'}),
    ]
)
  1. Search with filters:
results = client.search(
    collection_name='docs',
    query_vector=[0.15]*384,
    limit=5,
)
§04

Example

FeatureQdrant
LanguageRust
QuantizationScalar, Product, Binary
Sparse vectorsYes
Multi-tenancyBuilt-in
FilteringPayload-based, nested
ShardingAutomatic
APIREST + gRPC
§05

Related on TokRepo

§06

Common pitfalls

  • Vector dimensionality must match your embedding model. A mismatch (e.g., 384 vs 768) causes silent failures or errors at insert time.
  • Quantization reduces memory usage significantly but can lower recall. Test with your actual data before enabling in production.
  • On-disk storage mode is available for large datasets that do not fit in RAM, but search latency increases. Profile your workload to decide between in-memory and on-disk.

Preguntas frecuentes

How does Qdrant compare to Pinecone?+

Qdrant is open source and self-hostable, while Pinecone is a managed cloud service. Qdrant gives you full control over infrastructure and data residency. Pinecone offers zero-ops convenience. Performance is comparable for most workloads. Choose Qdrant if you need self-hosting or want to avoid vendor lock-in.

Can Qdrant handle billions of vectors?+

Yes. Qdrant supports sharding and on-disk storage for large-scale deployments. Billions of vectors require distributed mode with multiple nodes. Quantization (scalar or product) reduces memory per vector, making billion-scale deployments practical on commodity hardware.

Does Qdrant support hybrid search?+

Yes. Qdrant supports both dense and sparse vectors in the same collection. You can combine semantic similarity (dense) with keyword matching (sparse) using named vectors. This hybrid approach often improves retrieval quality compared to dense-only search.

What embedding models work with Qdrant?+

Any embedding model that outputs fixed-dimension vectors works with Qdrant. Common choices include OpenAI text-embedding-3-small (1536d), Cohere embed-v3 (1024d), and sentence-transformers models (384d or 768d). You generate embeddings externally and store the resulting vectors in Qdrant.

Is Qdrant free?+

Qdrant is open source under Apache 2.0 and free to self-host. Qdrant also offers a managed cloud service with a free tier for small projects and paid tiers for production workloads. The self-hosted version has no feature restrictions compared to the cloud version.

Referencias (3)
🙏

Fuente y agradecimientos

Created by Qdrant. Licensed under Apache 2.0. qdrant/qdrant — 30,000+ GitHub stars

Discusión

Inicia sesión para unirte a la discusión.
Aún no hay comentarios. Sé el primero en compartir tus ideas.

Activos relacionados