Skip to content

Usage Tracking

Agents mode distributes work across multiple workers, which means token usage is also distributed. Multi gives you visibility into consumption at every level.

Click the usage icon at the top of the window to open the usage panel. In agents mode, this shows a comprehensive breakdown of token consumption.

Each subagent’s usage is tracked independently. When viewing a specific subagent, the header displays:

  • Input tokens - Context sent to the subagent
  • Output tokens - Responses generated by the subagent
  • Total tokens - Combined consumption for that worker

This helps you understand which subagents are doing the heavy lifting and where tokens are being spent.

The main usage panel shows aggregate statistics:

  • Supervisor agent - Tokens used by the main coordinating agent
  • All subagents - Combined total across every spawned worker
  • Task total - Complete token consumption for the entire task

Agent-based tasks typically consume more tokens than single-agent tasks because:

  1. Coordination overhead - The supervisor spends tokens planning and delegating
  2. Context duplication - Each subagent receives relevant context for its assignment
  3. Reporting - Subagents summarize their work back to the supervisor

This is the tradeoff for being able to handle larger, more complex tasks.

A few strategies to keep token consumption reasonable:

StrategyEffect
Use agents only for complex tasksAvoid coordination overhead on simple work
Clear, specific promptsReduces back-and-forth and clarification
Appropriate model selectionMatch model capability to task complexity

The usage panel helps you make informed decisions about:

  • Whether a task justified agents mode
  • How to estimate costs for similar future tasks
  • Which types of work benefit most from delegation

Over time, you’ll develop intuition for when agents mode delivers value versus when it’s overkill.