Esta página se muestra en inglés. Una traducción al español está en curso.
SkillsMay 11, 2026·2 min de lectura

modal-examples — Serverless LLM Jobs on Modal

Learn production patterns for serverless jobs (LLM inference, data pipelines) using Modal’s official examples. Run one and adapt it to your workload.

Listo para agents

Este activo puede ser leído e instalado directamente por agents

TokRepo expone un comando CLI universal, contrato de instalación, metadata JSON, plan según adaptador y contenido raw para que los agents evalúen compatibilidad, riesgo y próximos pasos.

Native · 98/100Política: permitir
Superficie agent
Cualquier agent MCP/CLI
Tipo
Skill
Instalación
Single
Confianza
Confianza: Established
Entrada
README.md
Comando CLI universal
npx tokrepo install 154d3162-9681-440c-b4d9-825b073b04a5
Introducción

Learn production patterns for serverless jobs (LLM inference, data pipelines) using Modal’s official examples. Run one and adapt it to your workload.

  • Best for: Developers who want a quick, example-driven path to run LLM workloads as serverless jobs
  • Works with: Python, Modal CLI, cloud execution with local development loop (per README)
  • Setup time: 12 minutes

Quantitative Notes

  • Setup time ~12 minutes (install + auth + run one example)
  • GitHub stars + forks (verified): see Source & Thanks
  • Examples are organized into multiple folders; start with 1 file before scaling up

Practical Notes

Treat examples as templates: fork one that matches your workload (batch, web endpoint, GPU inference), replace the core function with your model/tool call, then add logging and retries. Keep a local dev loop with a tiny input set so iteration stays fast.

Safety note: Treat secrets carefully: store API keys in env/secret managers and avoid printing them in logs.

FAQ

Q: Do I need an account? A: Yes. The README instructs you to sign up and set an API key for the Modal CLI.

Q: Can I run LLM inference? A: Many examples demonstrate patterns you can adapt to inference and data workloads; follow the repo structure.

Q: How do I keep costs predictable? A: Pin resources, set concurrency limits, and use small test runs before scaling.


🙏

Fuente y agradecimientos

GitHub: https://github.com/modal-labs/modal-examples Owner avatar: https://avatars.githubusercontent.com/u/88658467?v=4 License (SPDX): MIT GitHub stars (verified via api.github.com/repos/modal-labs/modal-examples): 1,189 GitHub forks (verified via api.github.com/repos/modal-labs/modal-examples): 288

Discusión

Inicia sesión para unirte a la discusión.
Aún no hay comentarios. Sé el primero en compartir tus ideas.

Activos relacionados