Verba — The Golden RAGtriever by Weaviate
Verba is an open-source RAG (Retrieval-Augmented Generation) chatbot from the Weaviate team. Drop in PDFs, web pages, or notes; pick a model (OpenAI, Ollama, Anthropic); and get a polished chat UI with semantic search built in.
这个资产会安全暂存
这个资产会先安全暂存。复制的指令会要求 Agent 读取暂存文件,并在激活脚本、MCP 配置或全局配置前先确认。
npx -y tokrepo@latest install e0e719be-37db-11f1-9bc6-00163e2b0d79 --target codex先暂存文件;激活前需要读取暂存 README 和安装计划。
What it is
Verba is an open-source RAG (Retrieval-Augmented Generation) chatbot from the Weaviate team. It bundles document ingestion, chunking, embedding, vector search, and chat into a single application. Upload PDFs, DOCX, Markdown, plain text, web URLs, or GitHub repos, and Verba handles the entire pipeline from ingestion to conversational retrieval.
Verba is designed for teams who want to evaluate RAG patterns, demo to stakeholders, or self-host a private knowledge assistant without writing custom code.
How it saves time or tokens
Building a RAG application from scratch requires choosing and integrating a vector store, embedding model, chunking strategy, and chat UI. Verba provides all of these in one package. The zero-infra mode runs Weaviate Embedded locally, so you do not need a separate database server. Pluggable components let you swap models and embedders without changing application code.
How to use
- Install and run Verba:
pip install goldenverba
verba start
# Open http://localhost:8000
- Or run via Docker Compose:
git clone https://github.com/weaviate/Verba
cd Verba
docker compose up -d
- Upload documents through the web UI and start chatting with your knowledge base.
Example
Configuring Verba to use Ollama for local, private RAG:
# Set environment variables before starting
export OLLAMA_URL='http://localhost:11434'
export OLLAMA_MODEL='llama3'
export OLLAMA_EMBED_MODEL='nomic-embed-text'
# Start Verba with Ollama as the backend
verba start
The architecture follows a pluggable pipeline:
[Document Upload] -> [Reader] -> [Chunker] -> [Embedder] -> [Weaviate Vector Store]
|
[Chat UI] <- [LLM Generator] <- [Semantic Search] <------------------+
Related on TokRepo
- RAG tools — retrieval-augmented generation frameworks and tools
- Self-hosted tools — privacy-focused local deployments
Common pitfalls
- Large PDF uploads without chunking configuration can exceed context windows; adjust chunk size in the settings for documents over 100 pages
- The default Weaviate Embedded mode stores data in memory; restart loses all data unless you configure persistent storage
- OpenAI API keys must be set as environment variables before starting; Verba does not prompt for them at runtime
常见问题
Verba supports OpenAI, Anthropic Claude, Ollama (for local models), OpenRouter, Cohere, and HuggingFace models. You configure the provider through environment variables or the web UI settings panel.
Yes, with Ollama. Set Ollama as both the LLM and embedding provider, and Verba runs entirely on your local machine with no external API calls. This is useful for sensitive data that cannot leave your network.
Verba supports PDF, DOCX, Markdown, plain text, web URLs, and GitHub repositories. The Reader component extracts text from each format before passing it to the chunking pipeline.
LangChain is a framework for building custom LLM applications. Verba is a complete, ready-to-use RAG application. If you need a quick RAG chatbot without coding, use Verba. If you need custom logic, pipelines, or integrations beyond chat, use LangChain.
Verba provides a single shared knowledge base by default. There is no built-in user authentication or per-user document separation. For multi-tenant deployments, you would need to run separate Verba instances or implement a reverse proxy with authentication.
引用来源 (3)
- Verba GitHub— Verba is an open-source RAG application by Weaviate
- Weaviate Docs— Weaviate vector database for semantic search
- Anthropic Docs— RAG combines retrieval and generation for knowledge-grounded answers
TokRepo 相关
讨论
相关资产
Kepler.gl — Open Source Geospatial Data Visualization
A powerful open-source tool for large-scale geospatial data visualization built on deck.gl and Mapbox GL.
Starlight — Documentation Framework for Astro
Starlight is a full-featured documentation framework built on Astro that generates fast, accessible, and SEO-friendly docs sites from Markdown and MDX with built-in navigation, search, and internationalization.
Fastlane — Automate Building and Releasing iOS and Android Apps
Fastlane is an open-source platform that simplifies iOS and Android deployment. It handles code signing, screenshots, beta distribution, and App Store releases from a single command.
Unsloth — 2x Faster Local LLM Training & Inference
Unsloth is a unified local interface for running and training AI models. 58.7K+ GitHub stars. 2x faster training with 70% less VRAM across 500+ models including Qwen, DeepSeek, Llama, Gemma. Web UI wi