Practical Notes
- Quant: start with 1 UI task (login/search/export) and measure success rate over 10 runs.
- Quant: track setup time (Node + provider key) and typical run latency per task.
How to use it safely
Desktop automation is powerful and risky. Treat Agent TARS like a privileged robot:
- Start in a disposable test account and a non-production environment.
- Make the agent produce observable checkpoints (e.g., screenshots, extracted text, saved files) after each major step.
- Keep a narrow allowlist of domains/apps, and require confirmation before any destructive action.
Where it shines
Use it when the UI is the API: internal admin panels, vendor dashboards, and legacy tools that have no stable endpoint.
If you already have an API, prefer API automation first and use UI automation only for gaps.
FAQ
Q: Do I need a specific model provider? A: No. The CLI supports multiple providers; pick one you already use and budget for.
Q: How do I keep runs reproducible? A: Pin the prompt/plan, log each step, and replay the same task 10 times before expanding scope.
Q: When should I avoid UI automation? A: Avoid it for critical, irreversible actions in production; use APIs or add human confirmation gates.