Cette page est affichée en anglais. Une traduction française est en cours.
ScriptsMar 31, 2026·2 min de lecture

Llama Stack — Meta Official LLM App Framework

Official Meta framework for building LLM applications with Llama models. Inference, safety, RAG, agents, evals, and tool use. Standardized APIs. 8.3K+ stars.

Introduction

Llama Stack is Meta's official framework for building LLM applications with Llama models. It provides standardized APIs for inference, safety (Llama Guard), RAG, agentic workflows, evaluations, tool use, and memory — all designed to work seamlessly with Llama 3, 3.1, and 3.2 models. Deploy locally, in the cloud, or on-device. 8,300+ GitHub stars, MIT licensed.

Best for: Developers building production apps with Meta's Llama models Works with: Llama 3/3.1/3.2, Ollama, Together, Fireworks, AWS Bedrock, NVIDIA NIM


Core APIs

API Description
Inference Chat completion, text generation, embeddings
Safety Content moderation with Llama Guard / Prompt Guard
Agents Multi-step agentic workflows with tool use and memory
RAG Document ingestion, vector search, contextual retrieval
Eval Benchmark and evaluate model quality
Memory Persistent memory banks for agent context
Tool Use Web search, code execution, Wolfram Alpha, custom tools

Distribution Providers

Run anywhere with pluggable backends:

  • Local: Ollama, vLLM, TGI
  • Cloud: Together, Fireworks, AWS Bedrock, NVIDIA NIM
  • On-device: Qualcomm, MediaTek, PyTorch ExecuTorch

FAQ

Q: What is Llama Stack? A: Meta's official framework for building LLM apps with Llama models. Provides standardized APIs for inference, safety, RAG, agents, and evals. 8.3K+ stars, MIT licensed.

Q: Can I use Llama Stack with non-Llama models? A: Llama Stack is designed for Llama models, but inference providers like Ollama and vLLM can serve other models through the same API.


🙏

Source et remerciements

Created by Meta. Licensed under MIT. meta-llama/llama-stack — 8,300+ GitHub stars

Discussion

Connectez-vous pour rejoindre la discussion.
Aucun commentaire pour l'instant. Soyez le premier à partager votre avis.

Actifs similaires