Cette page est affichée en anglais. Une traduction française est en cours.
SkillsMay 11, 2026·2 min de lecture

modal-examples — Serverless LLM Jobs on Modal

Learn production patterns for serverless jobs (LLM inference, data pipelines) using Modal’s official examples. Run one and adapt it to your workload.

Prêt pour agents

Cet actif peut être lu et installé directement par les agents

TokRepo expose une commande CLI universelle, un contrat d'installation, le metadata JSON, un plan selon l'adaptateur et le contenu raw pour aider les agents à juger l'adaptation, le risque et les prochaines actions.

Native · 98/100Policy : autoriser
Surface agent
Tout agent MCP/CLI
Type
Skill
Installation
Single
Confiance
Confiance : Established
Point d'entrée
README.md
Commande CLI universelle
npx tokrepo install 154d3162-9681-440c-b4d9-825b073b04a5
Introduction

Learn production patterns for serverless jobs (LLM inference, data pipelines) using Modal’s official examples. Run one and adapt it to your workload.

  • Best for: Developers who want a quick, example-driven path to run LLM workloads as serverless jobs
  • Works with: Python, Modal CLI, cloud execution with local development loop (per README)
  • Setup time: 12 minutes

Quantitative Notes

  • Setup time ~12 minutes (install + auth + run one example)
  • GitHub stars + forks (verified): see Source & Thanks
  • Examples are organized into multiple folders; start with 1 file before scaling up

Practical Notes

Treat examples as templates: fork one that matches your workload (batch, web endpoint, GPU inference), replace the core function with your model/tool call, then add logging and retries. Keep a local dev loop with a tiny input set so iteration stays fast.

Safety note: Treat secrets carefully: store API keys in env/secret managers and avoid printing them in logs.

FAQ

Q: Do I need an account? A: Yes. The README instructs you to sign up and set an API key for the Modal CLI.

Q: Can I run LLM inference? A: Many examples demonstrate patterns you can adapt to inference and data workloads; follow the repo structure.

Q: How do I keep costs predictable? A: Pin resources, set concurrency limits, and use small test runs before scaling.


🙏

Source et remerciements

GitHub: https://github.com/modal-labs/modal-examples Owner avatar: https://avatars.githubusercontent.com/u/88658467?v=4 License (SPDX): MIT GitHub stars (verified via api.github.com/repos/modal-labs/modal-examples): 1,189 GitHub forks (verified via api.github.com/repos/modal-labs/modal-examples): 288

Fil de discussion

Connectez-vous pour rejoindre la discussion.
Aucun commentaire pour l'instant. Soyez le premier à partager votre avis.

Actifs similaires