RAG & Search

Mejores herramientas de IA para RAG y recuperación (2026)

Frameworks RAG, bases de datos vectoriales, herramientas de embedding y constructores de KB. Ancla tu IA en datos reales.

30 herramientas
RAG Best Practices — Production Pipeline Guide 2026 logo

RAG Best Practices — Production Pipeline Guide 2026

Comprehensive guide to building production RAG pipelines. Covers chunking strategies, embedding models, vector databases, retrieval techniques, evaluation, and common pitfalls with code examples.

Prompt Lab 290Prompts
Marqo — Tensor Search Engine for AI-Powered Retrieval logo

Marqo — Tensor Search Engine for AI-Powered Retrieval

An end-to-end vector search engine that handles embedding generation, storage, and retrieval in a single service for text and image search.

AI Open Source 21Configs
AnythingLLM — All-in-One AI Desktop with MCP logo

AnythingLLM — All-in-One AI Desktop with MCP

Full-stack AI desktop app with RAG, agents, MCP support, and multi-model chat. AnythingLLM manages documents, embeddings, and vector stores in one private interface.

MCP Hub 325MCP Configs
Together AI Embeddings & Reranking Skill for Agents logo

Together AI Embeddings & Reranking Skill for Agents

Skill that teaches Claude Code Together AI's embeddings and reranking API. Covers dense vector generation, semantic search, RAG pipelines, and result reranking patterns.

Together AI 284Skills
Spring AI — AI Engineering for Java/Spring logo

Spring AI — AI Engineering for Java/Spring

Spring AI provides Spring-friendly APIs for AI apps. 8.4K+ stars. Chat, embeddings, RAG, vector DBs, function calling. Major providers. Apache 2.0.

Skill Factory 257Skills
Qdrant MCP — Vector Search Engine for AI Agents logo

Qdrant MCP — Vector Search Engine for AI Agents

MCP server for Qdrant vector database. Gives AI agents the power to store and search embeddings for RAG, semantic search, and recommendation systems. 22,000+ stars on Qdrant.

MCP Hub 251MCP Configs
Claude Code Agent: Search Specialist — Build Search Systems logo

Claude Code Agent: Search Specialist — Build Search Systems

Claude Code agent for building search systems. Vector search, semantic retrieval, embedding strategies, and ranking optimization.

Skill Factory 230Skills
Supabase — The Open Source Firebase Alternative logo

Supabase — The Open Source Firebase Alternative

Supabase is an open-source backend platform built on Postgres. It provides a complete backend — database, authentication, real-time subscriptions, storage, edge functions, and vector embeddings — with instant APIs and a generous free tier.

Supabase 212Skills
PageIndex — Document Index for Reasoning-Based RAG logo

PageIndex — Document Index for Reasoning-Based RAG

A document indexing system that enables vectorless retrieval-augmented generation by building structured page-level indexes for LLM reasoning.

AI Open Source 135Skills
Embedding Drift Monitoring — Retrieval Regression Runbook logo

Embedding Drift Monitoring — Retrieval Regression Runbook

Embedding drift monitoring runbook for RAG and agent search. Uses golden queries, recall@K, rank delta, and rollback gates.

henuwangkai 135Knowledge
R2R — Production-Ready Agentic RAG System logo

R2R — Production-Ready Agentic RAG System

A state-of-the-art production-ready retrieval-augmented generation system with agentic capabilities, a RESTful API, and built-in document processing, vector search, and knowledge graph support.

AI Open Source 127Configs
AutoRAG — Automated RAG Pipeline Optimization logo

AutoRAG — Automated RAG Pipeline Optimization

An open-source AutoML-style framework for evaluating and optimizing retrieval-augmented generation pipelines by automatically testing combinations of chunking, embedding, retrieval, and generation strategies.

AI Open Source 92Configs
Chroma — Open-Source Vector Database for AI logo

Chroma — Open-Source Vector Database for AI

Chroma is the open-source vector database and data infrastructure for AI applications. 27.1K+ GitHub stars. Simple 4-function API for embedding, storing, and querying documents. Supports Python, JavaS

AI Open Source 304Skills
LangChain4j — LLM Integration for Java logo

LangChain4j — LLM Integration for Java

LangChain4j integrates 20+ LLM providers and 30+ vector stores into Java apps. 11.4K+ stars. Unified API, RAG, MCP, Spring Boot. Apache 2.0.

LangChain 295Skills
Weaviate — Open-Source Vector Database at Scale logo

Weaviate — Open-Source Vector Database at Scale

Weaviate is an open-source vector database for semantic search at scale. 15.9K+ GitHub stars. Hybrid search (vector + BM25), built-in RAG, reranking, multi-tenancy, and horizontal scaling. BSD 3-Claus

AI Open Source 293Skills
Verba — The Golden RAGtriever by Weaviate logo

Verba — The Golden RAGtriever by Weaviate

Verba is an open-source RAG (Retrieval-Augmented Generation) chatbot from the Weaviate team. Drop in PDFs, web pages, or notes; pick a model (OpenAI, Ollama, Anthropic); and get a polished chat UI with semantic search built in.

AI Open Source 292Skills
Quivr — Opinionated RAG Framework for Any LLM logo

Quivr — Opinionated RAG Framework for Any LLM

Quivr is an opinionated RAG framework supporting any LLM, multiple file types, and customizable retrieval. 39.1K+ stars. Apache 2.0.

Script Depot 274Scripts
Langflow — Visual AI Workflow Builder logo

Langflow — Visual AI Workflow Builder

Low-code visual builder for AI workflows and RAG pipelines. Drag-and-drop components for LLMs, vector stores, tools, and agents with Python extensibility.

Agent Toolkit 269Skills
PostgreSQL — The Most Advanced Open Source Relational Database logo

PostgreSQL — The Most Advanced Open Source Relational Database

PostgreSQL is the most powerful open-source relational database system. It combines SQL compliance, extensibility, and reliability with advanced features like JSONB, full-text search, vector embeddings (pgvector), and PostGIS — making it the database of choice for modern applications.

AI Open Source 250Skills
Haystack MCP — Connect AI Pipelines to MCP Clients logo

Haystack MCP — Connect AI Pipelines to MCP Clients

Expose Haystack RAG pipelines as MCP servers. Let Claude Code and other AI tools query your document search, QA, and retrieval pipelines through the MCP protocol.

Skill Factory 243MCP Configs
Turbopuffer MCP — Serverless Vector DB for AI Agents logo

Turbopuffer MCP — Serverless Vector DB for AI Agents

MCP server for Turbopuffer serverless vector database. Sub-10ms search, zero ops, auto-scaling. Perfect for AI agent memory and RAG without managing infrastructure. 1,200+ stars.

MCP Hub 241MCP Configs
Memvid — Serverless Memory Layer for AI Agents logo

Memvid — Serverless Memory Layer for AI Agents

An open-source memory system that replaces complex RAG pipelines with a single-file, serverless memory layer providing instant retrieval and long-term storage for AI agents.

Script Depot 240Skills
Cherry Studio Knowledge Base — Local RAG with 50+ Formats logo

Cherry Studio Knowledge Base — Local RAG with 50+ Formats

Cherry Studio Knowledge Base ingests PDFs, Office docs, Markdown into a local vector index. Query offline, BYOK any LLM. Data stays on your machine.

Cherry Studio 225Knowledge
Llama Index — Data Framework for LLM Applications logo

Llama Index — Data Framework for LLM Applications

Leading data framework for connecting LLMs to external data. LlamaIndex handles ingestion, indexing, retrieval, and query engines for building production RAG applications.

Prompt Lab 224Skills
MaxKB — Self-Hosted AI Knowledge Base with RAG logo

MaxKB — Self-Hosted AI Knowledge Base with RAG

MaxKB is an open-source knowledge base platform that combines document management with retrieval-augmented generation, letting teams build AI-powered Q&A systems over their own documents without sending data to third parties.

AI Open Source 220Configs
pgvector — Vector Similarity Search Inside PostgreSQL logo

pgvector — Vector Similarity Search Inside PostgreSQL

A PostgreSQL extension that adds a native `vector` type, HNSW and IVFFlat indexes, and distance operators so semantic search, RAG and recommendation workloads can reuse the same database as the rest of the app.

Script Depot 210Skills
Cohere Rerank — Boost RAG Accuracy with Rerank-3 logo

Cohere Rerank — Boost RAG Accuracy with Rerank-3

Cohere Rerank scores candidates against a query using a cross-encoder. Drop into any RAG to boost top-1 hit rate by 30-50% over vector search alone.

Cohere 193Skills
LightRAG — Graph-Enhanced Retrieval-Augmented Generation logo

LightRAG — Graph-Enhanced Retrieval-Augmented Generation

LightRAG integrates knowledge graphs into the RAG pipeline, enabling both low-level entity retrieval and high-level thematic search for more accurate and context-rich LLM responses.

Script Depot 161Skills
CocoIndex — Incremental Data Indexing Engine for AI Agents logo

CocoIndex — Incremental Data Indexing Engine for AI Agents

CocoIndex is an open-source framework for building incremental data indexing pipelines. It keeps embeddings and knowledge graphs in sync with source data using change-data-capture, enabling always-fresh context for AI agents and RAG applications.

Script Depot 112Skills
FlashRAG — Efficient RAG Research Toolkit logo

FlashRAG — Efficient RAG Research Toolkit

FlashRAG is a Python toolkit for RAG experiments: install `flashrag-dev`, build dense/sparse indexes, and iterate on retrieval configs.

AI Open Source 109Skills

RAG en producción

RAG in Production

Retrieval-Augmented Generation (RAG) has moved from research prototype to production standard. Every enterprise AI application that needs to answer questions about internal data uses some form of RAG. RAG Frameworks — RAGFlow, Haystack, and Kotaemon provide end-to-end pipelines for document ingestion, chunking, embedding, retrieval, and answer generation with source citations.

Vector Databases — Chroma, Milvus, Weaviate, LanceDB, and Pinecone store and retrieve document embeddings. The choice depends on scale (Milvus for billions of vectors), simplicity (Chroma for prototyping), or cost (LanceDB for serverless). GraphRAG — Microsoft's GraphRAG and related tools build knowledge graphs from documents, enabling more accurate retrieval for complex queries that span multiple documents.

Advanced RAG Patterns — Hybrid search (combining vector similarity with keyword matching), re-ranking (using cross-encoders to improve retrieval precision), and agentic RAG (letting AI agents decide when and how to retrieve information) represent the cutting edge of production RAG systems.

RAG is the bridge between what the model knows and what your organization knows.

Preguntas frecuentes

¿Qué es el RAG (Retrieval-Augmented Generation)?+

El RAG es una técnica que da a los modelos de IA acceso a conocimiento externo recuperando documentos relevantes antes de generar respuestas. En lugar de depender solo de los datos de entrenamiento, el modelo busca en tus documentos, encuentra pasajes relevantes y los usa para producir respuestas precisas y fundamentadas con citas de fuentes. Así es como las empresas construyen asistentes de IA que "conocen" sus datos internos.

¿Qué base de datos vectorial debo usar?+

Para prototipado: Chroma (en memoria, cero configuración). Para producción a escala: Milvus (miles de millones de vectores) o Weaviate (búsqueda híbrida). Para serverless/embebida: LanceDB o Turso con extensiones vectoriales. Para nube gestionada: Pinecone. La mayoría de assets RAG en TokRepo incluyen setups preconfigurados de bases vectoriales que puedes instalar con un solo comando.

¿Cómo mejoro la precisión del RAG?+

Tres técnicas clave: 1) Mejor chunking — dividir documentos en fronteras semánticas, no en cantidades fijas de caracteres. 2) Recuperación híbrida — combinar búsqueda vectorial con matching de keywords BM25. 3) Re-ranking — usar un modelo cross-encoder para re-puntuar los chunks recuperados antes de enviarlos al LLM. GraphRAG (construir grafos de conocimiento) ayuda sobre todo en queries complejas que abarcan varios documentos.

Explora categorías relacionadas