VOLY —
a flock of agents

Control plane for AI coding agents: multi-agent orchestration, a billing fallback chain, and cost control. Cheaper, more reliable, measurable.

GitHub
9+
model providers
7
executors
5
roles in the flock
$0
re-run on cache hit

Orchestration

A lead orchestrator splits the task into roles and assigns each a model and skills. Sub-agents run by dependencies.

Fallback

Billing fallback chain: claude-code → wrangler → opencode → zen. Out of credits — the task doesn't fail.

Telemetry

Tokens, cost, and cache hits for every run. Spend limits, cost policy, and a savings report.

Open source

Apache 2.0, Python 3.10+. Project-agnostic: the target project is passed via --cwd.

How a task flies

One entry point — CLI, Web UI, or CI. From there, the route depends on task complexity.

voly run "implement auth refactor" --cwd /path/to/project
complex · ≥2 capabilities

Multi-agent · A2A

The lead orchestrator assigns each role a model tier and skills. The flock flies by dependencies, passing results down the chain.

simple code task

Executor · fallback chain

claude-code Anthropic ✗ billing
wrangler Workers AI ✗ billing
opencode free models ✗ billing
zen free ✓ running

Out of billing — the next executor in the chain picks up the task. Every one of them writes files into --cwd.

AIGateway.chat() the only exit to models
DLP Cache Rate limit Spend limit Upstream · OmniRoute Provider Telemetry

A live dashboard

Every run unfolds into pipeline stages: routing, memory, skills, the model call — with tokens and cost at each step. Multi-agent assignments show which role ran on which model and what it cost.

VOLY Web UI: task list, pipeline stages, cost, and multi-agent role assignments
Web UI: a task broken into stages; on the right — sub-agents with the model tier and the cost of every role.

Executors

Who runs the task and what happens when billing runs out.

VOLY compresses providers and agents into one control plane
One press for all: providers and agents under VOLY's control.
ExecutorWrites filesBillingIn the chain
claude-codeyesAnthropic1st
wrangleryesCF Workers AI2nd
opencodeyesfree models → your own keys3rd
zenyesfree4th (last resort)
cursoryesCursorstandalone
deepseek / mimono — text onlyAPIoutside the chain

Savings you can see

VOLY counts every token: caching, routing, and fallback turn into concrete numbers, not gut feelings.

Billing fallback chain: paid → free, the task never stops
voly savings
spent today $4.20 / $20.00 limit
cache hits 62% → saved $6.85
model routing premium → cheap: $3.10
task re-runs 8 tasks from cache: $0.00
total this week saved $41.70

* example output — numbers depend on your tasks

Quick start

A full control plane running locally in a couple of minutes. Requires Python 3.10+ and API keys in .env.

git clone https://github.com/voly-codes/voly.git && cd voly
python3 -m venv .venv && source .venv/bin/activate
pip install -e ".[ui,dev]"
voly init && voly ui
# then — the dashboard at localhost:7788

Higher together

Build, run, and scale your AI agents with VOLY.

Learn more