# Real-ESRGAN — Practical Image and Video Super-Resolution

> General-purpose image and video restoration tool that trains on pure synthetic data to handle real-world degradations including blur, noise, JPEG compression, and resize artifacts.

## Install

Save the content below to `.claude/skills/` or append to your `CLAUDE.md`:

# Real-ESRGAN — Practical Image and Video Super-Resolution

## Quick Use
```bash
pip install realesrgan
wget https://github.com/xinntao/Real-ESRGAN/releases/download/v0.2.5.0/realesr-general-x4v3.pth -P weights
python inference_realesrgan.py -n realesr-general-x4v3 -i inputs -o results
```

## Introduction
Real-ESRGAN extends ESRGAN to handle practical real-world image restoration by training exclusively on synthetic degradation pipelines. This eliminates the need for paired real-world training data while covering complex degradation combinations that occur in photographs, video frames, and web images.

## What Real-ESRGAN Does
- Upscales images 2x or 4x while removing compression artifacts and noise
- Handles complex real-world degradations via high-order degradation modeling
- Processes video frame-by-frame with temporal consistency options
- Provides specialized anime/illustration models alongside general-purpose ones
- Supports face enhancement via integration with GFPGAN

## Architecture Overview
Real-ESRGAN uses an ESRGAN generator (RRDB network) trained with a second-order degradation pipeline that synthesizes realistic artifacts by chaining blur, resize, noise, and JPEG compression twice. A U-Net discriminator with spectral normalization provides stable adversarial training. The synthetic pipeline covers degradation combinations that first-order models miss, producing outputs that generalize to unseen real-world inputs.

## Self-Hosting & Configuration
- Install via pip: `pip install realesrgan` or clone the repository
- Download model weights from GitHub releases (general, anime, or face-specific)
- Requires PyTorch and a CUDA GPU; CPU fallback available but slow
- Use `--tile` flag for large images that exceed GPU memory
- Integrate with ffmpeg for video processing via the provided script

## Key Features
- Pure synthetic training: no paired real-world data needed
- High-order degradation model covers complex artifact combinations
- Multiple pretrained models for photos, anime, and video
- Tile-based processing for memory-constrained environments
- Python API and command-line interface for batch processing

## Comparison with Similar Tools
- **GFPGAN** — specialized for face restoration; Real-ESRGAN handles general images
- **SwinIR** — transformer-based restoration with slightly different quality characteristics
- **BSRGAN** — similar synthetic degradation approach but first-order pipeline
- **waifu2x** — older CNN upscaler, primarily for anime art
- **Topaz Video AI** — commercial alternative with proprietary models

## FAQ
**Q: What resolution images can Real-ESRGAN process?**
A: Any size, using the `--tile` option to split large images into overlapping patches that fit in GPU memory.

**Q: Is there a model for anime or illustration art?**
A: Yes, the `realesr-animevideov3` model is specifically trained on anime-style content.

**Q: How does it compare to commercial upscalers?**
A: Real-ESRGAN produces competitive results for general photography; commercial tools may have additional temporal consistency for video.

**Q: Can I train my own model?**
A: Yes, the repository provides full training scripts with configurable degradation pipelines.

## Sources
- https://github.com/xinntao/Real-ESRGAN
- https://real-esrgan.readthedocs.io/

---
Source: https://tokrepo.com/en/workflows/real-esrgan-practical-image-video-super-resolution-73d0fc65
Author: AI Open Source