Skills2026年5月11日·1 分钟阅读

safetensors — Safe and Fast Tensor Serialization

A simple file format for storing tensors safely and efficiently, designed to eliminate security risks from pickle-based model files.

Agent 就绪

这个资产可以被 Agent 直接读取和安装

TokRepo 同时提供通用 CLI 命令、安装契约、metadata JSON、按适配器生成的安装计划和原始内容链接,方便 Agent 判断适配度、风险和下一步动作。

Native · 98/100策略:允许
Agent 入口
任意 MCP/CLI Agent
类型
Skill
安装
Single
信任
信任等级:Established
入口
safetensors Overview
通用 CLI 安装命令
npx tokrepo install 083a4cf0-4cd1-11f1-9bc6-00163e2b0d79

Introduction

safetensors is a file format and library for storing and loading tensors without the security risks of Python pickle. Created by Hugging Face, it provides zero-copy deserialization and prevents arbitrary code execution, making it the recommended format for distributing machine learning model weights.

What safetensors Does

  • Stores tensors in a flat binary format with a JSON header for metadata
  • Prevents arbitrary code execution attacks inherent in pickle-based formats
  • Enables zero-copy memory-mapped loading for fast deserialization
  • Supports PyTorch, TensorFlow, Flax/JAX, PaddlePaddle, and NumPy tensors
  • Provides bindings in Python and Rust for cross-language compatibility

Architecture Overview

A safetensors file consists of a fixed 8-byte header size field, a JSON header containing tensor names, data types, shapes, and byte offsets, followed by a contiguous data buffer. Loading maps the data region into memory without copying, and the header is parsed to locate each tensor by offset. The Rust core handles serialization and validation, with Python bindings via PyO3.

Self-Hosting & Configuration

  • Install: pip install safetensors
  • Save PyTorch tensors: save_file({"layer.weight": tensor}, "model.safetensors")
  • Load with memory mapping: load_file("model.safetensors", device="cpu")
  • Convert existing pickle checkpoints: use torch.load() then save_file()
  • Hugging Face Hub uses safetensors as the default format for model uploads

Key Features

  • Security by design — no arbitrary code execution during loading
  • Zero-copy deserialization with memory-mapped I/O for fast startup
  • Lazy loading of individual tensors without reading the entire file
  • Cross-framework support for PyTorch, TensorFlow, JAX, and NumPy
  • Compact format with no overhead beyond the JSON header

Comparison with Similar Tools

  • pickle/torch.save — flexible but allows arbitrary code execution; safetensors is safe by design
  • ONNX — model interchange with graph structure; safetensors stores raw weight tensors only
  • NumPy .npy/.npz — NumPy-specific; safetensors supports multiple frameworks and metadata
  • HDF5 — hierarchical data format with complex features; safetensors is simpler and faster for tensors

FAQ

Q: Why not just use pickle for model weights? A: Pickle can execute arbitrary Python code during loading, creating a security risk when downloading models from untrusted sources.

Q: Can safetensors store model architecture along with weights? A: No. It stores only tensor data and metadata. Model architecture is defined in code or config files.

Q: Is safetensors compatible with Hugging Face Transformers? A: Yes. Transformers uses safetensors by default when saving and loading models.

Q: What happens if a safetensors file is corrupted? A: The format validates the header before reading data. Corrupted files produce clear errors instead of silent data corruption.

Sources

讨论

登录后参与讨论。
还没有评论,来写第一条吧。

相关资产