You’ve built an environment with tools and scenarios. Deploy it to the platform and you can run evals at scale—hundreds of parallel runs across models, all traced, all generating training data.Documentation Index
Fetch the complete documentation index at: https://hud-f5fd7c15-parallel-agent-telemetry.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
Deploying Environments
Start withhud init (see Environments) to scaffold locally. When ready:
- Go to hud.ai → New → Environment
- Connect your GitHub repo and name your environment
- Push changes and it rebuilds automatically, like Vercel
Running at Scale
Once deployed, create evals on hud.ai from your scenarios. Each eval is a frozen configuration—same prompt, same scoring, every time. Your scenario might take arguments:| Eval Name | Arguments |
|---|---|
checkout-laptop | product_name="Laptop", apply_coupon=False |
checkout-phone-coupon | product_name="Phone", apply_coupon=True |
checkout-headphones | product_name="Headphones", apply_coupon=False |
What’s Next?
With your environment deployed:- Scale: Launch thousands of rollouts. Every run generates traces—prompts, tool calls, rewards.
- Analyze: See which evals agents struggle with. Compare models across your entire benchmark.
- Train: Use runs as training data. Fine-tune on successful completions. Run reinforcement learning to optimize for your specific environment.
Integrations
Connect OpenAI, Anthropic, LangChain, and more.
Sandboxing
Turn production services into safe test environments.