# Real-ESRGAN — Practical Image and Video Super-Resolution > General-purpose image and video restoration tool that trains on pure synthetic data to handle real-world degradations including blur, noise, JPEG compression, and resize artifacts. ## Install Save the content below to `.claude/skills/` or append to your `CLAUDE.md`: # Real-ESRGAN — Practical Image and Video Super-Resolution ## Quick Use ```bash pip install realesrgan wget https://github.com/xinntao/Real-ESRGAN/releases/download/v0.2.5.0/realesr-general-x4v3.pth -P weights python inference_realesrgan.py -n realesr-general-x4v3 -i inputs -o results ``` ## Introduction Real-ESRGAN extends ESRGAN to handle practical real-world image restoration by training exclusively on synthetic degradation pipelines. This eliminates the need for paired real-world training data while covering complex degradation combinations that occur in photographs, video frames, and web images. ## What Real-ESRGAN Does - Upscales images 2x or 4x while removing compression artifacts and noise - Handles complex real-world degradations via high-order degradation modeling - Processes video frame-by-frame with temporal consistency options - Provides specialized anime/illustration models alongside general-purpose ones - Supports face enhancement via integration with GFPGAN ## Architecture Overview Real-ESRGAN uses an ESRGAN generator (RRDB network) trained with a second-order degradation pipeline that synthesizes realistic artifacts by chaining blur, resize, noise, and JPEG compression twice. A U-Net discriminator with spectral normalization provides stable adversarial training. The synthetic pipeline covers degradation combinations that first-order models miss, producing outputs that generalize to unseen real-world inputs. ## Self-Hosting & Configuration - Install via pip: `pip install realesrgan` or clone the repository - Download model weights from GitHub releases (general, anime, or face-specific) - Requires PyTorch and a CUDA GPU; CPU fallback available but slow - Use `--tile` flag for large images that exceed GPU memory - Integrate with ffmpeg for video processing via the provided script ## Key Features - Pure synthetic training: no paired real-world data needed - High-order degradation model covers complex artifact combinations - Multiple pretrained models for photos, anime, and video - Tile-based processing for memory-constrained environments - Python API and command-line interface for batch processing ## Comparison with Similar Tools - **GFPGAN** — specialized for face restoration; Real-ESRGAN handles general images - **SwinIR** — transformer-based restoration with slightly different quality characteristics - **BSRGAN** — similar synthetic degradation approach but first-order pipeline - **waifu2x** — older CNN upscaler, primarily for anime art - **Topaz Video AI** — commercial alternative with proprietary models ## FAQ **Q: What resolution images can Real-ESRGAN process?** A: Any size, using the `--tile` option to split large images into overlapping patches that fit in GPU memory. **Q: Is there a model for anime or illustration art?** A: Yes, the `realesr-animevideov3` model is specifically trained on anime-style content. **Q: How does it compare to commercial upscalers?** A: Real-ESRGAN produces competitive results for general photography; commercial tools may have additional temporal consistency for video. **Q: Can I train my own model?** A: Yes, the repository provides full training scripts with configurable degradation pipelines. ## Sources - https://github.com/xinntao/Real-ESRGAN - https://real-esrgan.readthedocs.io/ --- Source: https://tokrepo.com/en/workflows/real-esrgan-practical-image-video-super-resolution-73d0fc65 Author: AI Open Source