Skills2026年4月14日·1 分钟阅读

Databend — Cloud-Native Open-Source Data Warehouse Built in Rust

Databend is a modern cloud data warehouse with separation of storage and compute on object storage. Written in Rust for extreme performance, it is a self-hostable alternative to Snowflake with full Snowflake-style SQL compatibility.

Agent 就绪

先审查再安装

这个资产需要先审查。复制的指令会要求 Agent dry-run、列出写入项,确认后再继续。

Needs Confirmation · 64/100策略:需确认
Agent 入口
任意 MCP/CLI Agent
类型
Skill
安装
Single
信任
信任等级:Established
入口
step-1.md
先审查命令
npx -y tokrepo@latest install 09c00758-37d2-11f1-9bc6-00163e2b0d79 --target codex

先 dry-run,确认写入项后再运行此命令。

TL;DR
Databend separates storage and compute on object storage, offering a self-hostable Snowflake alternative.
§01

What it is

Databend is a modern cloud-native data warehouse written in Rust. It separates storage and compute, runs on object storage (S3, GCS, Azure Blob), and provides Snowflake-compatible SQL. It is designed as a self-hostable alternative to Snowflake for analytical workloads.

Data engineers and analytics teams who need a cost-efficient analytical database on their own infrastructure, with elastic compute and pay-per-query economics, will find Databend a fit.

§02

How it saves time or tokens

Databend's storage-compute separation means you pay only for compute when running queries, while data sits cheaply on object storage. The Rust implementation provides high throughput with minimal resource overhead. Snowflake SQL compatibility reduces migration effort.

§03

How to use

  1. Deploy Databend on your infrastructure or use Databend Cloud.
  2. Configure object storage (S3, MinIO, or compatible) as the storage backend.
  3. Connect with any MySQL or ClickHouse-compatible client.
  4. Run SQL queries against your data.
# Start Databend with Docker
docker run -d --name databend \
  -p 8000:8000 -p 3307:3307 \
  -v databend-data:/var/lib/databend \
  datafuselabs/databend

# Connect via MySQL client
mysql -h 127.0.0.1 -P 3307 -u root
§04

Example

Create a table and load data from S3:

CREATE TABLE events (
  event_id BIGINT,
  user_id INT,
  event_type VARCHAR,
  created_at TIMESTAMP
);

COPY INTO events
  FROM 's3://my-bucket/events/'
  FILE_FORMAT = (type = 'PARQUET');

SELECT event_type, COUNT(*)
  FROM events
  GROUP BY event_type
  ORDER BY COUNT(*) DESC;
§05

Related on TokRepo

§06

Common pitfalls

  • Object storage latency is higher than local SSDs. Databend optimizes for throughput, not single-query latency.
  • Snowflake SQL compatibility is extensive but not 100%. Test your specific queries during migration.
  • Self-hosted deployments require managing meta-service nodes for cluster coordination.

常见问题

How does Databend compare to Snowflake?+

Databend provides similar separation of storage and compute with SQL compatibility. The key difference is Databend is open-source and self-hostable, while Snowflake is a fully managed SaaS. Databend trades managed convenience for cost control and data sovereignty.

What object storage does Databend support?+

Databend supports Amazon S3, Google Cloud Storage, Azure Blob Storage, MinIO, and any S3-compatible object storage. This lets you run Databend on-premises with MinIO or in any major cloud.

Is Databend OLTP or OLAP?+

Databend is an OLAP (analytical) database. It is optimized for complex aggregation queries over large datasets, not for transactional workloads with many small reads and writes.

Can I connect to Databend from existing tools?+

Yes. Databend supports MySQL and ClickHouse wire protocols. Any tool that connects to MySQL (DBeaver, DataGrip, Python mysql-connector) works with Databend.

What language is Databend written in?+

Databend is written in Rust for performance and memory safety. The Rust implementation enables high query throughput with predictable resource usage.

引用来源 (3)

讨论

登录后参与讨论。
还没有评论,来写第一条吧。

相关资产