Redpanda Connect (Benthos) — Declarative Stream Processing Engine
Redpanda Connect, formerly known as Benthos, is a high-performance declarative stream processor written in Go. It connects, transforms, and routes data between any combination of sources and sinks using simple YAML configuration files.
Installation agent prête
Cet actif peut être installé après choix du runtime, vérification du plan et exécution de la commande adaptée.
npx -y tokrepo@latest install e95cabb2-39eb-11f1-9bc6-00163e2b0d79 --target codexÀ exécuter après confirmation du plan en dry-run.
What it is
Redpanda Connect (formerly Benthos) is a high-performance declarative stream processor written in Go. It connects, transforms, and routes data between 200+ sources and sinks using YAML configuration files. You define inputs, processors, and outputs without writing custom code.
The tool serves data engineers, backend developers, and DevOps teams who need to move and transform data between systems like Kafka, databases, APIs, and file stores.
The project is actively maintained with regular releases and a growing user community. Documentation covers common use cases, and the open-source nature means you can inspect the source code, contribute fixes, and adapt the tool to your specific requirements.
How it saves time or tokens
Instead of writing custom ETL scripts for each data pipeline, Redpanda Connect lets you define the entire flow in YAML. Adding a new source or transformation is a config change, not a code change. The single-binary deployment eliminates dependency management and simplifies CI/CD.
How to use
- Install the binary:
brew install redpanda-data/tap/connector download from GitHub releases. - Create a YAML config defining your input, pipeline processors, and output.
- Run with
rpk connect run config.yml.
Example
# config.yml - Read from Kafka, transform JSON, write to PostgreSQL
input:
kafka:
addresses: ['localhost:9092']
topics: ['events']
consumer_group: 'connect-group'
pipeline:
processors:
- mapping: |
root.user_id = this.user.id
root.event = this.type
root.timestamp = this.created_at
output:
sql_insert:
driver: postgres
dsn: 'postgres://user:pass@localhost:5432/analytics'
table: events
columns: ['user_id', 'event', 'timestamp']
Related on TokRepo
- AI Tools for Automation -- automation and data pipeline tools
- AI Tools for DevOps -- infrastructure and operations tooling
Common pitfalls
- The
mappingprocessor uses Bloblang, a domain-specific language. Read the Bloblang docs before writing complex transformations. - Redpanda Connect buffers messages in memory by default. For high-throughput pipelines, configure explicit buffer limits to avoid OOM.
- The rename from Benthos to Redpanda Connect means older tutorials reference the
benthosbinary. Userpk connectfor current versions.
Before adopting this tool, evaluate whether it fits your team's existing workflow. Read the official documentation thoroughly, and start with a small proof-of-concept rather than a full migration. Community forums, GitHub issues, and Stack Overflow are valuable resources when you encounter edge cases not covered in the documentation.
Questions fréquentes
Benthos was renamed to Redpanda Connect after Redpanda acquired the project. The core functionality is the same. The CLI binary changed from benthos to rpk connect, and new connectors specific to Redpanda have been added.
Redpanda Connect supports 200+ connectors including Kafka, PostgreSQL, MySQL, S3, HTTP, AMQP, NATS, GCP Pub/Sub, and many more. Both source (input) and sink (output) connectors are available.
Yes. Written in Go with minimal allocations, Redpanda Connect handles millions of messages per second on modest hardware. It supports parallel processing and batching for throughput optimization.
No. Redpanda Connect works with any supported source and sink. It is broker-agnostic and connects to Kafka, RabbitMQ, NATS, or any other supported system independently of Redpanda.
Use rpk connect test to run unit tests on your Bloblang mappings. You can also use the stdin/stdout connectors to test transformations locally with sample data before connecting to production systems.
Sources citées (3)
- Redpanda Connect GitHub— Declarative stream processor formerly known as Benthos
- Redpanda Connect Docs— 200+ connectors for sources and sinks
- Bloblang Docs— Bloblang mapping language for data transformations
En lien sur TokRepo
Fil de discussion
Actifs similaires
Redpanda — Kafka-Compatible Streaming Platform Without JVM
Redpanda is a Kafka-compatible streaming data platform written in C++. It is a drop-in replacement for Kafka that is simpler to operate — no JVM, no ZooKeeper, single binary deployment — with lower latency and built-in schema registry.
Babylon.js — Powerful 3D Game and Rendering Engine
Babylon.js is a powerful, beautiful, simple, open 3D game and rendering engine for the web. WebGL + WebGPU, Playground IDE, Node Material Editor, GUI system, physics, and VR/AR support. Microsoft-backed with enterprise polish.
Cadence — Distributed Workflow Execution Engine by Uber
Cadence is a distributed, scalable, fault-tolerant workflow orchestration engine developed by Uber for executing long-running business logic as durable, stateful workflows that survive process and infrastructure failures.
Turf.js — Advanced Geospatial Analysis for JavaScript
A modular geospatial analysis engine written in JavaScript and TypeScript for browsers and Node.js.