RAG & Search

Mejores herramientas de IA para RAG y recuperación (2026)

Frameworks RAG, bases de datos vectoriales, herramientas de embedding y constructores de KB. Ancla tu IA en datos reales.

30 herramientas

RAG Best Practices — Production Pipeline Guide 2026

Comprehensive guide to building production RAG pipelines. Covers chunking strategies, embedding models, vector databases, retrieval techniques, evaluation, and common pitfalls with code examples.

Prompt Lab 445Prompts

Marqo — Tensor Search Engine for AI-Powered Retrieval

An end-to-end vector search engine that handles embedding generation, storage, and retrieval in a single service for text and image search.

AI Open Source 152Configs

LLMWare — Unified Framework for Enterprise RAG Pipelines

Build retrieval-augmented generation workflows with small specialized models, parsing, embeddings, and vector search in one framework.

AI Open Source 62Configs

TurboVec — High-Performance Rust Vector Index with Python Bindings

A vector similarity search index built on TurboQuant quantization, written in Rust with first-class Python bindings for embedding-based retrieval and RAG workloads.

AI Open Source 51Configs

Qdrant MCP — Vector Search Engine for AI Agents

MCP server for Qdrant vector database. Gives AI agents the power to store and search embeddings for RAG, semantic search, and recommendation systems. 22,000+ stars on Qdrant.

MCP Hub 412MCP Configs

Claude Code Agent: Search Specialist — Build Search Systems

Claude Code agent for building search systems. Vector search, semantic retrieval, embedding strategies, and ranking optimization.

Skill Factory 363Skills

Supabase — The Open Source Firebase Alternative

Supabase is an open-source backend platform built on Postgres. It provides a complete backend — database, authentication, real-time subscriptions, storage, edge functions, and vector embeddings — with instant APIs and a generous free tier.

Supabase 308Skills

Embedding Drift Monitoring — Retrieval Regression Runbook

Embedding drift monitoring runbook for RAG and agent search. Uses golden queries, recall@K, rank delta, and rollback gates.

henuwangkai 283Knowledge

R2R — Production-Ready Agentic RAG System

A state-of-the-art production-ready retrieval-augmented generation system with agentic capabilities, a RESTful API, and built-in document processing, vector search, and knowledge graph support.

AI Open Source 257Configs

PageIndex — Document Index for Reasoning-Based RAG

A document indexing system that enables vectorless retrieval-augmented generation by building structured page-level indexes for LLM reasoning.

AI Open Source 235Skills

AutoRAG — Automated RAG Pipeline Optimization

An open-source AutoML-style framework for evaluating and optimizing retrieval-augmented generation pipelines by automatically testing combinations of chunking, embedding, retrieval, and generation strategies.

AI Open Source 189Configs

Cherry Studio Knowledge Base — Local RAG with 50+ Formats

Cherry Studio Knowledge Base ingests PDFs, Office docs, Markdown into a local vector index. Query offline, BYOK any LLM. Data stays on your machine.

Cherry Studio 543Knowledge

Verba — The Golden RAGtriever by Weaviate

Verba is an open-source RAG (Retrieval-Augmented Generation) chatbot from the Weaviate team. Drop in PDFs, web pages, or notes; pick a model (OpenAI, Ollama, Anthropic); and get a polished chat UI with semantic search built in.

AI Open Source 507Skills

Weaviate — Open-Source Vector Database at Scale

Weaviate is an open-source vector database for semantic search at scale. 15.9K+ GitHub stars. Hybrid search (vector + BM25), built-in RAG, reranking, multi-tenancy, and horizontal scaling. BSD 3-Claus

AI Open Source 442Skills

Chroma — Open-Source Vector Database for AI

Chroma is the open-source vector database and data infrastructure for AI applications. 27.1K+ GitHub stars. Simple 4-function API for embedding, storing, and querying documents. Supports Python, JavaS

AI Open Source 441Skills

Quivr — Opinionated RAG Framework for Any LLM

Quivr is an opinionated RAG framework supporting any LLM, multiple file types, and customizable retrieval. 39.1K+ stars. Apache 2.0.

Script Depot 422Scripts

Memvid — Serverless Memory Layer for AI Agents

An open-source memory system that replaces complex RAG pipelines with a single-file, serverless memory layer providing instant retrieval and long-term storage for AI agents.

Script Depot 400Skills

MaxKB — Self-Hosted AI Knowledge Base with RAG

MaxKB is an open-source knowledge base platform that combines document management with retrieval-augmented generation, letting teams build AI-powered Q&A systems over their own documents without sending data to third parties.

AI Open Source 399Configs

Turbopuffer MCP — Serverless Vector DB for AI Agents

MCP server for Turbopuffer serverless vector database. Sub-10ms search, zero ops, auto-scaling. Perfect for AI agent memory and RAG without managing infrastructure. 1,200+ stars.

MCP Hub 384MCP Configs

PostgreSQL — The Most Advanced Open Source Relational Database

PostgreSQL is the most powerful open-source relational database system. It combines SQL compliance, extensibility, and reliability with advanced features like JSONB, full-text search, vector embeddings (pgvector), and PostGIS — making it the database of choice for modern applications.

AI Open Source 380Skills

Haystack MCP — Connect AI Pipelines to MCP Clients

Expose Haystack RAG pipelines as MCP servers. Let Claude Code and other AI tools query your document search, QA, and retrieval pipelines through the MCP protocol.

Skill Factory 375MCP Configs

pgvector — Vector Similarity Search Inside PostgreSQL

A PostgreSQL extension that adds a native `vector` type, HNSW and IVFFlat indexes, and distance operators so semantic search, RAG and recommendation workloads can reuse the same database as the rest of the app.

Script Depot 372Skills

Llama Index — Data Framework for LLM Applications

Leading data framework for connecting LLMs to external data. LlamaIndex handles ingestion, indexing, retrieval, and query engines for building production RAG applications.

Prompt Lab 355Skills

Cohere Rerank — Boost RAG Accuracy with Rerank-3

Cohere Rerank scores candidates against a query using a cross-encoder. Drop into any RAG to boost top-1 hit rate by 30-50% over vector search alone.

Cohere 317Skills

LightRAG — Graph-Enhanced Retrieval-Augmented Generation

LightRAG integrates knowledge graphs into the RAG pipeline, enabling both low-level entity retrieval and high-level thematic search for more accurate and context-rich LLM responses.

Script Depot 273Skills

CocoIndex — Incremental Data Indexing Engine for AI Agents

CocoIndex is an open-source framework for building incremental data indexing pipelines. It keeps embeddings and knowledge graphs in sync with source data using change-data-capture, enabling always-fresh context for AI agents and RAG applications.

Script Depot 207Skills

FlashRAG — Efficient RAG Research Toolkit

FlashRAG is a Python toolkit for RAG experiments: install `flashrag-dev`, build dense/sparse indexes, and iterate on retrieval configs.

AI Open Source 203Skills

nano-graphrag — Lightweight GraphRAG Implementation

A simple, hackable implementation of Microsoft GraphRAG that builds knowledge graphs from documents and uses graph-based retrieval for more accurate LLM question answering.

AI Open Source 203Configs

NornicDB — Graph+Vector DB for Agent Memory

NornicDB is a Neo4j-compatible graph+vector database for agent memory and GraphRAG; run it in Docker and manage it from a localhost admin UI.

AI Open Source 199SkillsCLI Tools

SQLBot — AI-Powered Text-to-SQL with RAG

An open-source conversational data analysis system that converts natural language questions into SQL queries using large language models and retrieval-augmented generation.

AI Open Source 119Configs

RAG en producción

RAG in Production

Retrieval-Augmented Generation (RAG) has moved from research prototype to production standard. Every enterprise AI application that needs to answer questions about internal data uses some form of RAG. RAG Frameworks — RAGFlow, Haystack, and Kotaemon provide end-to-end pipelines for document ingestion, chunking, embedding, retrieval, and answer generation with source citations.

Vector Databases — Chroma, Milvus, Weaviate, LanceDB, and Pinecone store and retrieve document embeddings. The choice depends on scale (Milvus for billions of vectors), simplicity (Chroma for prototyping), or cost (LanceDB for serverless). GraphRAG — Microsoft's GraphRAG and related tools build knowledge graphs from documents, enabling more accurate retrieval for complex queries that span multiple documents.

Advanced RAG Patterns — Hybrid search (combining vector similarity with keyword matching), re-ranking (using cross-encoders to improve retrieval precision), and agentic RAG (letting AI agents decide when and how to retrieve information) represent the cutting edge of production RAG systems.

RAG is the bridge between what the model knows and what your organization knows.

Preguntas frecuentes

¿Qué es el RAG (Retrieval-Augmented Generation)?+

El RAG es una técnica que da a los modelos de IA acceso a conocimiento externo recuperando documentos relevantes antes de generar respuestas. En lugar de depender solo de los datos de entrenamiento, el modelo busca en tus documentos, encuentra pasajes relevantes y los usa para producir respuestas precisas y fundamentadas con citas de fuentes. Así es como las empresas construyen asistentes de IA que "conocen" sus datos internos.

¿Qué base de datos vectorial debo usar?+

Para prototipado: Chroma (en memoria, cero configuración). Para producción a escala: Milvus (miles de millones de vectores) o Weaviate (búsqueda híbrida). Para serverless/embebida: LanceDB o Turso con extensiones vectoriales. Para nube gestionada: Pinecone. La mayoría de assets RAG en TokRepo incluyen setups preconfigurados de bases vectoriales que puedes instalar con un solo comando.

¿Cómo mejoro la precisión del RAG?+

Tres técnicas clave: 1) Mejor chunking — dividir documentos en fronteras semánticas, no en cantidades fijas de caracteres. 2) Recuperación híbrida — combinar búsqueda vectorial con matching de keywords BM25. 3) Re-ranking — usar un modelo cross-encoder para re-puntuar los chunks recuperados antes de enviarlos al LLM. GraphRAG (construir grafos de conocimiento) ayuda sobre todo en queries complejas que abarcan varios documentos.

Explora categorías relacionadas

Herramientas de IA para Database Herramientas de IA para Building Agents Herramientas de IA para Knowledge Graph Herramientas de IA para Documents Herramientas de IA para Research