Skip to content

Deep Context & Ultra-Economy Settings

ModeToken budgetCompressionNotes
plan32 000standardOne-shot, structured JSON output
agent32 000standard (configurable)Interactive; approval gates
full128 000offFull context — no compression
exec32 000aggressiveHarness loop; maximise iterations

full mode uses a 4× larger budget and disables all compression, trading token cost for maximum fidelity. It is the right choice for complex multi-file tasks where losing context mid-turn would require backtracking.

The default compression level for agent/plan modes is read from ~/.umbra/settings.json:

{
"compression": {
"level": "standard"
}
}

Valid values: off / lite / standard / aggressive / ultra. The exec harness always uses aggressive regardless of this setting.

Set via launch flag --mode when starting Umbra, or switch mid-session using /agent, /full, or /plan in the TUI. The active mode is shown in the session status bar. Persisted across launches via --mode flag or runtime-preferences.json → defaultMode.