Skills2026年4月14日·1 分钟阅读

Candle — Minimalist Machine Learning Framework for Rust

Candle is a Rust-native ML framework focused on inference performance, small binaries, and serverless deployment. It runs Llama, Whisper, Stable Diffusion, and other PyTorch models in pure Rust — no Python required.

AI Open Source · Community

Agent 就绪

Agent 可直接安装

这个资产可安装；Agent 先选择当前运行时、检查安装计划，再运行匹配命令。

Native · 98/100策略：允许

Agent 入口

任意 MCP/CLI Agent

类型

Skill

安装

Single

信任

信任等级：Established

入口

step-1.md

直接安装命令

npx -y tokrepo@latest install b113c394-37db-11f1-9bc6-00163e2b0d79 --target codex

先 dry-run 确认安装计划，再运行此命令。

TL;DR

Candle runs ML inference in pure Rust with small binaries, no Python required, supporting Llama and Whisper.

§01

What it is

Candle is a Rust-native machine learning framework focused on inference performance, small binaries, and serverless deployment. Built by Hugging Face, it runs Llama, Whisper, Stable Diffusion, and other PyTorch models in pure Rust without requiring Python.

Candle is designed for ML engineers and systems developers who need fast inference with minimal dependencies, especially for edge deployment, WebAssembly targets, or serverless functions where Python runtimes add overhead.

§02

How it saves time or tokens

Candle produces small, self-contained binaries that start in milliseconds compared to Python-based inference servers that load heavy runtimes. No Python dependency chain means no version conflicts, no pip install issues, and predictable builds. CUDA and Metal support provides GPU acceleration on par with PyTorch for supported models.

§03

How to use

Add Candle to your Rust project:

cargo add candle-core candle-nn candle-transformers

Run a pre-built example:

cargo run --example llama -- --prompt 'Hello, world'

Use the tensor API:

use candle_core::{Device, Tensor};

fn main() -> anyhow::Result<()> {
    let device = Device::Cpu;
    let a = Tensor::randn(0f32, 1., (3, 3), &device)?;
    let b = Tensor::randn(0f32, 1., (3, 3), &device)?;
    let c = a.matmul(&b)?;
    println!("{c}");
    Ok(())
}

§04

Example

// Load and run Whisper for speech-to-text
use candle_transformers::models::whisper;

fn transcribe(audio_path: &str) -> anyhow::Result<String> {
    let device = Device::Cpu;
    let model = whisper::model::Whisper::load(
        "openai/whisper-base",
        &device,
    )?;
    let result = model.transcribe(audio_path)?;
    Ok(result.text)
}

§05

Related on TokRepo

AI coding tools — ML and AI development frameworks
Local LLM tools — running models locally

§06

Common pitfalls

Expecting full training support: Candle is optimized for inference, not large-scale training
Not enabling the cuda or metal feature flags for GPU acceleration
Trying to load PyTorch checkpoints directly without converting to safetensors format first

常见问题

How does Candle compare to PyTorch?+

Candle focuses on inference with small binaries and fast startup. PyTorch is a full training and inference framework with a massive ecosystem. Use Candle for production inference in Rust; use PyTorch for research and training.

Does Candle support GPU acceleration?+

Yes. Candle supports CUDA (NVIDIA) and Metal (Apple Silicon) through feature flags. Enable them in your Cargo.toml to use GPU acceleration for tensor operations and model inference.

Which models can Candle run?+

Candle supports Llama, Mistral, Whisper, Stable Diffusion, BERT, T5, and many other transformer architectures. The candle-transformers crate provides pre-built model implementations.

Can Candle compile to WebAssembly?+

Yes. Candle's pure Rust implementation allows compilation to WASM for browser-based inference. This enables running ML models directly in the browser without a server.

Who maintains Candle?+

Candle is maintained by Hugging Face as part of their Rust ML ecosystem. It integrates with the Hugging Face Hub for model downloads and uses the safetensors format for model weights.

引用来源 (3)

Candle GitHub— Candle Rust ML framework by Hugging Face
Safetensors GitHub— Safetensors format for model weights
Hugging Face Blog— Rust machine learning ecosystem

讨论

登录后参与讨论。

还没有评论，来写第一条吧。

Candle — Minimalist Machine Learning Framework for Rust

Agent 可直接安装

What it is

How it saves time or tokens

How to use

Example

Related on TokRepo

Common pitfalls

常见问题

引用来源 (3)

TokRepo 相关

讨论

相关资产

tinygrad — Minimalist Deep Learning Framework

ggml — Lightweight Tensor Library for Machine Learning in C

Apache TVM — Open Machine Learning Compiler Framework

PostgresML — Machine Learning Inside PostgreSQL