Best Self-Hosted AI Tools (2026)
Run AI locally with full privacy. Open-source LLMs, chat interfaces, knowledge bases, and development tools you can self-host on your own infrastructure.
Meetily — Privacy-First AI Meeting Assistant with Local Transcription
An open-source, self-hosted AI meeting assistant that provides real-time transcription, speaker diarization, and local summarization using Whisper and Ollama, with no cloud dependency.
Self-Hosted AI Starter Kit — Local AI with n8n
Docker Compose template by n8n that bootstraps a complete local AI environment with n8n workflow automation, Ollama LLMs, Qdrant vector database, and PostgreSQL. 14,500+ stars.
Continue — Open-Source AI Code Assistant
Open-source AI code assistant for VS Code and JetBrains. Tab autocomplete, chat, inline editing with any model — OpenAI, Anthropic, Ollama, or self-hosted.
ArchiveBox — Self-Hosted Web Archiving Platform
ArchiveBox is an open-source self-hosted web archiver that saves URLs as local HTML, PDF, screenshots, WARC, and more. Feed it bookmarks, browser history, or RSS feeds and it preserves everything for offline access.
Colanode — Open-Source Local-First Slack and Notion Alternative
A self-hosted collaboration platform combining real-time team chat, a rich document editor, and a knowledge base in a single local-first application that keeps all data on your own infrastructure.
Ollama Model Library — Best AI Models for Local Use
Curated guide to the best models available on Ollama for coding, chat, and reasoning. Compare Llama, Mistral, Gemma, Phi, and Qwen models for local AI development.
Audiobookshelf — Self-Hosted Audiobook & Podcast Server
Audiobookshelf is an open-source audiobook and podcast server with progress sync, chapter navigation, mobile apps, and multi-user support — a self-hosted Audible alternative.
Actual Budget — Local-First Personal Finance App
Actual is an open-source personal finance app with envelope budgeting, bank sync, multi-device sync, and local-first architecture — a YNAB alternative.
Immich — High-Performance Self-Hosted Photo & Video Management
Immich is an open-source Google Photos alternative with auto-backup, AI-powered search, face recognition, and mobile apps — self-hosted for complete privacy.
Stirling PDF — Self-Hosted PDF Editor & Toolkit
Stirling PDF is the #1 open-source PDF tool on GitHub. Merge, split, convert, compress, OCR, sign, and edit PDFs — all self-hosted with no data leaving your server.
Verba — The Golden RAGtriever by Weaviate
Verba is an open-source RAG (Retrieval-Augmented Generation) chatbot from the Weaviate team. Drop in PDFs, web pages, or notes; pick a model (OpenAI, Ollama, Anthropic); and get a polished chat UI with semantic search built in.
Open WebUI — Self-Hosted AI Chat Interface
User-friendly, self-hosted AI chat interface. Supports Ollama, OpenAI, Anthropic, and any OpenAI-compatible API. RAG, web search, voice, image gen, and plugins. 129K+ stars.
Firefly III — Self-Hosted Personal Finance Manager
Firefly III is an open-source personal finance manager for tracking expenses, budgets, and bank accounts. Self-hosted with full privacy, multi-currency, and powerful reporting.
Ollama — Run LLMs Locally
Run large language models locally on your machine. Supports Llama 3, Mistral, Gemma, Phi, and dozens more. One-command install, OpenAI-compatible API.
/loop — Local Recurring Task Scheduler (Boris-Style)
Open-source slash command for recurring local Claude Code tasks with a 3-day safety cap. Inspired by Boris Cherny's /loop scheduler.
HuggingFace Chat UI — Open-Source AI Chat Interface
Chat UI is Hugging Face's open-source web interface for conversational AI, powering HuggingChat and supporting any text-generation model via TGI, Ollama, or OpenAI-compatible APIs with features like web search, tool use, and multimodal input.
LobeChat — Open-Source Multi-Model Chat UI
Beautiful open-source chat UI supporting Claude, GPT-4, Gemini, Ollama, and 50+ providers. Plugin system, knowledge base, TTS, image generation, and self-hostable. 55,000+ GitHub stars.
LocalAI — Run Any AI Model Locally, No GPU
LocalAI is an open-source AI engine running LLMs, vision, voice, and image models locally. 44.6K+ GitHub stars. OpenAI/Anthropic-compatible API, 35+ backends, MCP, agents. MIT licensed.
Halo — Modern Self-Hosted Publishing Platform
Halo is an open-source content management and blogging platform built with Java and Spring Boot. It provides a polished editing experience, a plugin system, and theme marketplace for self-hosted publishing.
Vikunja — Self-Hosted Open Source To-Do and Project Management
Vikunja is an open-source task management application written in Go. It offers lists, kanban boards, Gantt charts, and CalDAV sync — a self-hosted alternative to Todoist, Trello, and Asana.
Home Assistant — Open-Source Home Automation That Puts Local Control First
Home Assistant is the most popular open-source platform for smart home automation. It integrates 3,000+ devices and services, runs entirely on local hardware (Raspberry Pi to NUC), and keeps your data off the cloud by default.
LibrePhotos — Self-Hosted AI-Powered Photo Management
A self-hosted open-source photo management service with automatic face recognition, object detection, and geolocation tagging powered by machine learning.
Refact — Local-First AI Coding Assistant
Refact is an open-source, local-first AI coding assistant: install the IDE plugin, run local refact-lsp, and connect a model provider.
frp — Fast Reverse Proxy to Expose Local Servers Behind NATs and Firewalls
frp is a high-performance reverse proxy written in Go. Expose a local HTTP/TCP/UDP service to the public internet through a relay server — the self-hosted alternative to ngrok and Cloudflare Tunnel.
Stable Diffusion Web UI by AUTOMATIC1111 — The Definitive Local AI Image Generator
AUTOMATIC1111's Stable Diffusion Web UI is the most popular interface for running Stable Diffusion locally. It supports text-to-image, image-to-image, inpainting, ControlNet, LoRA, embeddings, extensions, and every model variant — all in a self-hosted browser UI.
Tolgee — Developer-Friendly Localization Platform
An open-source localization platform that lets developers and translators manage translations through a web UI, in-context editing, and native SDK integrations for React, Vue, Angular, and more.
Local Deep Research — Privacy-First AI Research Agent
A self-hosted deep research agent that achieves near-perfect accuracy on benchmarks using local or cloud LLMs, with support for 10+ search engines and fully encrypted processing.
Cherry Studio Custom Models — BYOK Any LLM Provider
Cherry Studio Custom Models adds any OpenAI-compatible endpoint — proxy, local, or third-party. Mix Claude, GPT, Gemini, DeepSeek, Ollama side-by-side.
RustDesk — Self-Hosted Open Source Remote Desktop
RustDesk is a full-featured open-source remote desktop application written in Rust. It offers a self-hosted alternative to TeamViewer and AnyDesk, giving you full control over your data with your own relay and rendezvous servers.
SiYuan — Privacy-First Self-Hosted Knowledge Management
SiYuan is a local-first, self-hosted personal knowledge management system with block-level references, end-to-end encryption, and Markdown support.
The Self-Hosted AI Stack
The Self-Hosted AI Stack
Self-hosted AI has matured from a hobbyist pursuit to an enterprise requirement. Privacy regulations, data sovereignty laws, and the desire for predictable costs drive organizations to run AI on their own infrastructure. Local LLM Inference — Ollama, Jan, and GPT4All make running models like Llama, Mistral, and Qwen as simple as installing an app. Support for GPU acceleration, quantization, and model management.
Chat Interfaces — Open WebUI, LibreChat, LobeChat, and AnythingLLM provide ChatGPT-like interfaces for your self-hosted models. Features include conversation history, file upload, RAG integration, and multi-model switching. Knowledge Bases — Onyx, Quivr, and PrivateGPT let you build private RAG systems over your documents — no data leaves your servers.
Development Tools — Tabby (self-hosted Copilot), SearXNG (private search), and Puter (cloud desktop) provide developer infrastructure without external dependencies. TokRepo hosts deployment configs and Docker Compose files for the entire self-hosted AI stack.
Self-hosting AI isn't about avoiding costs — it's about owning your intelligence infrastructure.
Frequently Asked Questions
What hardware do I need to self-host AI?+
It depends on the model size. For 7B parameter models (good for most tasks): 16GB RAM + any modern GPU with 8GB VRAM. For 70B models (GPT-4 class): 64GB RAM + GPU with 48GB VRAM (A6000 or dual 3090). For CPU-only inference: Ollama with quantized models runs on any modern laptop, just slower. Apple Silicon Macs with 32GB+ unified memory are excellent for local AI.
Is self-hosted AI as good as cloud APIs?+
For many tasks, yes. Open-source models like Llama 3.1 70B and Qwen 2.5 72B match GPT-4 on coding, analysis, and general reasoning. They fall short on the most complex multi-step reasoning and creative tasks where Claude Opus or GPT-4o still lead. The gap narrows every quarter. For most business applications, self-hosted models are "good enough" with dramatically better privacy and cost.
What is the easiest way to start with self-hosted AI?+
Install Ollama (one command on Mac/Linux/Windows), pull a model ("ollama pull llama3.1"), then install Open WebUI for a ChatGPT-like interface. Total setup time: under 10 minutes. TokRepo hosts Docker Compose configs that bundle Ollama + Open WebUI + RAG pipeline into a single deployment.