ConfigsApr 19, 2026·2 min read

BentoPDF — Privacy-First Self-Hosted PDF Toolkit

BentoPDF is a self-hosted web application that provides a comprehensive set of PDF tools including merging, splitting, converting, and OCR without sending files to external services.

AI
AI Open Source · Community
Quick Use

Use it first, then decide how deep to go

This block should tell both the user and the agent what to copy, install, and apply first.

docker run -d -p 3000:3000 --name bentopdf ghcr.io/alam00000/bentopdf

Introduction

BentoPDF bundles the most common PDF operations into a single self-hosted web app. Because everything runs on your own server, sensitive documents never leave your network.

What BentoPDF Does

  • Merges multiple PDF files into a single document
  • Splits PDFs by page ranges or extracts individual pages
  • Converts images (JPG, PNG) to PDF and PDF pages to images
  • Performs OCR on scanned documents to make text searchable
  • Compresses PDF file sizes while preserving readability

Architecture Overview

BentoPDF is a JavaScript and TypeScript application using pdf-lib, Tesseract.js, and other libraries for PDF manipulation. The web frontend provides a drag-and-drop interface that communicates with a backend API. All processing happens server-side in the container.

Self-Hosting & Configuration

  • Deploy with a single Docker command, no external database needed
  • Configure upload size limits via environment variables
  • Mount a volume if you want to persist processed files
  • Place behind a reverse proxy for TLS and access control
  • Customize the UI theme through configuration options

Key Features

  • All-in-one PDF operations in a single web interface
  • Privacy-first: files are processed locally and never leave your server
  • OCR support for extracting text from scanned PDFs
  • Batch processing for handling multiple files at once
  • Lightweight container with minimal system requirements

Comparison with Similar Tools

  • Stirling PDF — similar self-hosted PDF toolkit with a broader feature set
  • ILovePDF — cloud-hosted service that processes files on external servers; BentoPDF keeps files local
  • Smallpdf — freemium cloud service with usage limits; BentoPDF is free and unlimited
  • pdftk — CLI-only tool; BentoPDF provides a graphical web interface
  • ConvertX — general file converter; BentoPDF specializes in PDF-specific operations

FAQ

Q: Does BentoPDF require a database? A: No, it runs without an external database. File processing is stateless.

Q: What OCR languages are supported? A: BentoPDF uses Tesseract for OCR and supports multiple languages. Additional language packs can be added to the container.

Q: Is there a file size limit? A: The default limit is configurable. Since you host it yourself, you can set it as high as your server resources allow.

Q: Can I use BentoPDF for batch operations? A: Yes, you can upload and process multiple files in a single operation through the web interface.

Sources

Discussion

Sign in to join the discussion.
No comments yet. Be the first to share your thoughts.

Related Assets