How do I install OpenNMT-py — Open Source Neural Machine Translation in PyTorch?

Visit the asset page on TokRepo and click "Copy for agent" to get the installation instructions. Most assets can be installed with a single command.

OpenNMT-py — Open Source Neural Machine Translation in PyTorch

Introduction

OpenNMT is an open-source neural machine translation ecosystem started by Harvard NLP and SYSTRAN. The PyTorch version (OpenNMT-py) provides a flexible toolkit for training Transformer and RNN-based sequence-to-sequence models, and has expanded to support large language model fine-tuning and inference. It is used in research and production translation systems worldwide.

What OpenNMT-py Does

Train Transformer and RNN encoder-decoder models for translation
Fine-tune and serve large language models with LoRA/quantization support
Multi-GPU and distributed training with mixed-precision support
Beam search, random sampling, and constrained decoding for inference
Vocabulary building, data preprocessing, and tokenization pipelines

Architecture Overview

OpenNMT-py uses a modular architecture with configurable encoders, decoders, attention mechanisms, and generators. Configuration is YAML-driven, making experiments reproducible. The training loop supports gradient accumulation, learning rate scheduling, and checkpoint averaging. The inference server supports REST API translation with batching. Recent versions integrate CTranslate2 for optimized production inference.

Self-Hosting & Configuration

Install via pip: pip install OpenNMT-py
Requires Python 3.8+ and PyTorch
All configuration through YAML files for reproducibility
Multi-GPU training via PyTorch distributed data parallel
Production serving with the built-in REST translation server

Key Features

Full Transformer and RNN architectures with copy mechanism and coverage
LLM support for fine-tuning and inference with quantization
CTranslate2 integration for fast production-grade translation
SentencePiece and BPE tokenization built into the pipeline
Comprehensive documentation with step-by-step tutorials

Comparison with Similar Tools

Hugging Face Transformers — broader model hub; OpenNMT-py is optimized specifically for translation workflows
Fairseq — Meta's sequence modeling toolkit; OpenNMT-py has a simpler YAML-driven interface
MarianMT — lightweight C++ translation engine; OpenNMT-py offers more flexibility for research
Argos Translate — offline translation app; OpenNMT-py is the underlying training and model toolkit

FAQ

Q: Can I use OpenNMT-py for non-translation tasks? A: Yes. OpenNMT-py supports any sequence-to-sequence task including summarization, dialogue, and grammatical error correction.

Q: Does OpenNMT-py support LLM fine-tuning? A: Yes. Recent versions support LoRA adapters and quantization (4-bit, 8-bit) for fine-tuning large language models.

Q: How do I deploy a trained model? A: Use the built-in REST server or export to CTranslate2 format for optimized CPU/GPU inference in production.

Q: What tokenization does OpenNMT-py support? A: SentencePiece (unigram and BPE), BPE with subword-nmt, and character-level tokenization.

OpenNMT-py — Open Source Neural Machine Translation in PyTorch

Ready-to-run agent install

Introduction

What OpenNMT-py Does

Architecture Overview

Self-Hosting & Configuration

Key Features

Comparison with Similar Tools

FAQ

Sources

Discussion

Related Assets

CodeGeeX — Open Multilingual Code Generation Model

Aegis Authenticator — Secure Open-Source 2FA for Android

Open-Sora — Open-Source Text-to-Video Generation

PyTorch — The Deep Learning Framework for Research and Production