Esta página se muestra en inglés. Una traducción al español está en curso.
SkillsApr 13, 2026·3 min de lectura

Apache Pulsar — Cloud-Native Distributed Messaging and Streaming

Apache Pulsar is a cloud-native distributed messaging and streaming platform. It combines the best of traditional messaging (like RabbitMQ) with streaming (like Kafka) — providing multi-tenancy, geo-replication, and tiered storage in a single system.

Listo para agents

Instalación lista para agent

Este activo puede instalarse después de elegir el runtime, revisar el plan y ejecutar el comando correspondiente.

Native · 98/100Política: permitir
Superficie agent
Cualquier agent MCP/CLI
Tipo
Skill
Instalación
Single
Confianza
Confianza: Community
Entrada
step-1.md
Comando de instalación directa
npx -y tokrepo@latest install 8d354adf-3734-11f1-9bc6-00163e2b0d79 --target codex

Ejecutar después de confirmar el plan con dry-run.

TL;DR
Apache Pulsar unifies messaging and streaming in one platform with multi-tenancy, geo-replication, and tiered storage.
§01

What it is

Apache Pulsar is a cloud-native distributed messaging and streaming platform that combines the capabilities of traditional message queues (like RabbitMQ) with event streaming (like Kafka) in a single system. It provides multi-tenancy, geo-replication, and tiered storage as built-in features rather than add-ons.

Pulsar is designed for platform teams and backend engineers who need a unified messaging layer that scales from simple pub-sub to complex event streaming without deploying separate systems for each use case.

§02

How it saves time or tokens

Pulsar's architecture separates compute (brokers) from storage (BookKeeper), which means you can scale throughput and storage independently. This eliminates the rebalancing pain common with broker-storage-coupled systems. Multi-tenancy is built in, so a single Pulsar cluster can serve multiple teams with namespace-level isolation, reducing operational overhead.

The unified messaging model means you do not need to maintain separate Kafka clusters for streaming and RabbitMQ for queuing. One Pulsar cluster handles both patterns with topic-level configuration.

§03

How to use

  1. Start Pulsar with Docker: docker run -d --name pulsar -p 6650:6650 -p 8080:8080 apachepulsar/pulsar:latest bin/pulsar standalone.
  2. Produce a message: bin/pulsar-client produce my-topic --messages 'hello pulsar'.
  3. Consume messages: bin/pulsar-client consume my-topic -s my-sub --num-messages 0.
§04

Example

# Start Pulsar standalone in Docker
docker run -d --name pulsar \
  -p 6650:6650 -p 8080:8080 \
  apachepulsar/pulsar:latest bin/pulsar standalone

# Produce messages
bin/pulsar-client produce my-topic --messages 'hello pulsar'

# Consume messages
bin/pulsar-client consume my-topic -s my-subscription --num-messages 0

# Python client
pip install pulsar-client
import pulsar

client = pulsar.Client('pulsar://localhost:6650')
producer = client.create_producer('my-topic')
producer.send('hello from python'.encode())
client.close()
§05

Related on TokRepo

§06

Common pitfalls

  • Pulsar standalone mode is for development only; production deployments require a ZooKeeper cluster and BookKeeper ensemble, which adds operational complexity.
  • The broker-storage separation is powerful but means more moving parts to monitor; invest in observability (Prometheus metrics are built in) from day one.
  • Client library support varies by language; Java and Python clients are most mature, while Go and Node.js clients may lag in feature parity.

Preguntas frecuentes

How does Pulsar compare to Kafka?+

Pulsar separates compute (brokers) from storage (BookKeeper), enabling independent scaling. Kafka couples brokers and storage, requiring partition rebalancing when scaling. Pulsar also provides built-in multi-tenancy and geo-replication that Kafka requires additional tooling for.

What is multi-tenancy in Pulsar?+

Pulsar supports tenant and namespace isolation at the cluster level. Different teams or applications can share a single Pulsar cluster with independent topic namespaces, access controls, and resource quotas.

Does Pulsar support exactly-once semantics?+

Yes. Pulsar supports exactly-once message delivery through transactional messaging. Producers can send messages within transactions, and consumers can acknowledge messages atomically, ensuring no duplicates or losses.

What is tiered storage in Pulsar?+

Tiered storage automatically offloads older messages from BookKeeper to cheaper object storage (S3, GCS, Azure Blob). This lets you retain months or years of data without the cost of keeping it all on fast storage.

Can I run Pulsar Functions for stream processing?+

Yes. Pulsar Functions is a lightweight compute framework for processing messages in-flight. Functions can transform, route, or enrich messages without deploying a separate stream processing framework.

Referencias (3)

Discusión

Inicia sesión para unirte a la discusión.
Aún no hay comentarios. Sé el primero en compartir tus ideas.

Activos relacionados