Release Notes

v0.3.0 (2026-03-26)

Simplification release: removed features that added complexity without improving agent performance, streamlined the sprint workflow.

Removed Features

Playbook: Rule injection into agent prompts did not improve code quality — context noise outweighed benefits. All playbook rules, enforcement levels, rule evaluations, and Defense Report removed.
Skills: Task-level tag injection into prompts had no measurable effect.
Secrets: Encrypted environment variable injection was never used.
Goals: OKR management and goal evaluation were unused.
Recurring Tasks: Periodic task auto-generation was unused.
Review Diffs: Diff recording between reject/approve was never populated.
Strategist role: Strategy, content, research, and story-decompose templates removed. Remaining roles: Manager, Builder. Remaining templates: implementation, reviewer.

Story Workflow

Direct task creation: Stories no longer trigger AI decomposition. Creating a Story auto-creates a Builder task. Additional tasks can be added manually.
Enrich is optional: Tasks no longer require acceptance criteria or steps before being dispatched. Agents read the codebase and decide what to do.

Fixes

Audit Log: Fixed authentication check — user sessions (user: prefix) now correctly allowed.
Story delete: Fixed FK constraint error by unlinking tasks before deletion.
DELETE proxy: Fixed empty body forwarding that caused 500 errors.
All new workspaces default to Pro plan.

Documentation

Public docs updated to reflect current feature set (architecture, getting-started, API reference, glossary, pricing, setup guide).
CLI README updated (removed Manager LLM references, updated pipeline flow).

E2E Tests

Updated for removed features (Goals, Playbook). 13 tests pass, 2 gracefully skipped.

Known Issues (resolved from v0.2.0)

~~Defense Report page~~ → removed
~~Playbook rule management~~ → removed
~~Strategist proposals fetch~~ → removed

v0.2.0 (2026-03-21)

Major update: Product Goals, onboarding wizard, multi-agent orchestration, and review intelligence.

Product Goals

Goals & Key Results: Set product goals, track key results, and link them to sprints. Goal status updates automatically based on sprint outcomes.
Strategy Log: Record strategic decisions from chat or retrospectives. Dedicated Strategy tab on the dashboard.
Agents are goal-aware: Active goals are included in agent context, so tasks align with your current priorities.

Onboarding

toban init: Interactive setup wizard — select repos, configure agents, and connect GitHub in one command.
Browser-based auth: GitHub login opens in your browser — no token copy-paste needed.
Auto-review on push: Setup installs a git hook that runs toban review on every push.

Review Intelligence

Diff analysis: Tracks what code changes turn a rejection into an approval, surfacing improvement patterns over time.
Defense report: Monthly summary of issues caught by Playbook — by category, by rule, with 3-month trends.
Public metrics: Share anonymized quality metrics at /public/metrics (no login required).

Multi-Agent Orchestration (CLI v0.10.0)

Parallel builders: Run multiple builders at the same time. Git conflicts are handled automatically with rebase and escalation.
Task auto-split: Large tasks are automatically broken into subtasks by AI.
Dependency-aware assignment: Tasks are dispatched in the right order based on dependencies.
Context sharing: Each builder's changes are passed to the next, reducing redundant work.
Stall detection: Idle agents are detected and cleaned up automatically.

Agent Concurrency Settings

Dashboard UI: Settings > Agents — adjust how many builders and cloud engineers run in parallel.

Bug Fixes

Sprint phase buttons now display correctly (e.g., "Start Sprint" in planning phase).
Workspace switching properly resets all state.
Reviewer falls back to general code quality review when no matching task is found.
Retrospective data parsing is more resilient to malformed output.
Various stability improvements for task updates and agent coordination.

Known Issues (remaining)

Individual use only — team/multi-user features not available.
GitHub App org installation may not appear in Setup (workaround: set installation_id manually).

v0.1.1 (2026-03-20)

Reviewer Agent

Independent Reviewer process: Reviewer spawns as a separate Claude Code agent (10 turns, 5min timeout) instead of single-turn --print call. Can read files, run tests, and analyze diffs in context.
Task-type specific review: Review criteria vary by task type (implementation, content, infra, research, strategy, bug, chore).
Customizable review rules: Reviewer prompt externalized to prompts/templates.ts. Project-specific rules injected from Playbook.
Test execution in review: Reviewer runs npm test before evaluation. Failed tests = automatic NEEDS_CHANGES.
Reviewer Playbook rules: 5 default locked/overridable rules (scope check, test requirement, metadata-only rejection, etc.)
Review scoring: Each review produces a 0-100 quality score.
Verdict badges: Task cards show Approved (green), Changes (amber), Blocked (red) with hover tooltip.

Knowledge & Learning

Skills → Playbook integration: Skills moved from hardcoded CLI to Playbook rules (category="skill"). Tag-based matching — only injected when task labels match rule tags.
Shared Agent Memory: Agents can share knowledge across the team via shared: true in memory frontmatter. Tag-based pinpoint injection.
Knowledge management page: Settings > Knowledge with Shared Memories (click-to-expand) + Skills tabs.
Failure Database: Auto-populated on task reject. Past failures injected into agent prompts to prevent recurring issues. Japanese text normalize, retry noise dedup.
Team Knowledge: POST /failures/analyze groups failure patterns and suggests Playbook rules. One-click apply to Playbook.
Previous Review injection: Retry tasks receive prior review feedback in prompt.

Sprint Planning (Strategist)

toban plan: Strategist agent analyzes backlog, velocity, quality trends, and failure patterns to generate sprint plans via local Claude CLI.
toban propose: Strategist analyzes data sources and suggests improvement tasks. Auto-runs when sprint enters retrospective.
Proposals UI: Improvement proposals displayed in retrospective view with approve (→ Sprint / → Backlog) and reject (with reason) buttons.
Sprint Plan API: CRUD endpoints for plans with approve/reject workflow.
Proposals API: CRUD + batch create + approve with sprint target + reject with reason.
Analytics in Manager context: Velocity and quality trends injected into Manager prompt.

Sprint Flow

Sprint transition guards: Active → Review blocked if in_progress tasks remain. Review → Retro blocked if review/in_progress/todo tasks remain. Retro → Complete blocked while Strategist is generating proposals.
Retrospective auto-proposals: Entering retrospective triggers Strategist to analyze and generate improvement proposals.
Review → Retro required: Removed "Next Sprint" shortcut from Review phase. Must go through Retrospective.
Phase error display: Sprint phase transition failures shown to user (was silently swallowed).
CLI active-only task pickup: Agents only pick up tasks during active phase, not during review/retrospective.

Quality & Data

Quality Score API: GET /sprints/:number/quality and GET /analytics/quality with first-pass approve rate, avg review score, and per-agent breakdown.
Quality Score UI: Analytics page trend chart + Sprint header badge.
Failure DB management page: Filter by agent/type, expandable detail rows, pattern analysis UI.
Recurring Tasks page: CRUD with frequency options, enabled/disabled toggle.
Tasks Skills field: Skills array on tasks and recurring tasks, UI editing in task modal.

Agent Reliability

NEEDS_CHANGES → todo: Tasks with NEEDS_CHANGES verdict automatically reset to todo.
Retry limit: After 3 failed attempts, task moves to review with "Blocked" comment for human review.
allow_no_commit_completion: Template flag — agents that complete without code changes go to review instead of infinite loop.
Stream result fallback: Extracts completion from stream-json result when COMPLETION_JSON is not explicitly output.
Per-task execution logger: ~/.toban/logs/tasks/{taskId}.jsonl for debugging.
Description pre-check: Tasks with descriptions under 20 characters are auto-blocked.
Context budget: 30K token limit with priority-ordered section trimming.
Auto-split: Tasks with SP >= 5 are automatically split by LLM into subtasks.

Templates

Content template: For documentation tasks (docs-only file restrictions).
Strategy template: For research/planning tasks (read-only + web search).
Reviewer template: For code review (read-only + test execution).

Setup & Configuration

4 required agents: Setup automatically creates Manager, Builder, Reviewer, Strategist. Free plan limit raised to 4.
Setup overwrite protection: Re-running setup on a configured project returns 409.
Dynamic model selection: Agent engine setting resolved to model ID per role. Builder defaults to Opus 4.6, Reviewer/Strategist to Sonnet 4.6.
Workspace switching: Cookie and localStorage properly cleared on project switch.

Infrastructure

API proxy removal (production): Browser communicates directly with api.toban.dev.
Agent API key expiry: 24-hour expiry with sliding window.
Git credential caching: 50-minute token cache.
blocked status: Added to task status enum.
Playbook tags: Tag-based skill rule matching.
Sprint plans + Task proposals tables: Persistent storage for Strategist outputs.
Origin header forwarding: Proxy forwards Origin for correct cookie domain on localhost.

Known Issues (fixed from v0.1.0)

~~Agent API keys do not expire~~ → 24h expiry with sliding window
~~Review timeout on large diffs~~ → Reviewer agent with 5min timeout

Known Issues (remaining)

Individual use only — team/multi-user features not available
GitHub App org installation may not appear in Setup (workaround: set installation_id manually)
Manager chat is unreliable — use toban plan / toban propose CLI commands instead
API 500 on D1 writes: Task updates and review-report may intermittently return 500. CLI retries automatically (2 attempts with backoff), but root cause needs investigation
Reviewer max-turns: Reviewer may hit 5-turn limit on no-commit verification tasks. Increase planned
Strategist proposals fetch: toban propose may fail to save proposals due to execSync blocking Node.js event loop. Proposals display in CLI but may not persist to DB
engine/model column: DB agents.engine mixes provider types (claude-code) and model names (claude-opus). CLI has PROVIDER_ENGINE_IDS workaround
Setup duplicate tasks: Initial Sprint #0 may generate duplicate tasks via Manager or RETRO suggested_tasks
RETRO_JSON parse: Agent may append text after RETRO_JSON, causing parse failure
localStorage leakage: Workspace-scoped data (terminal, proposals) not prefixed with workspace ID — cleared on switch but not preserved per project
Sprint #0 planning status: Newly created projects start with planning status. UI now shows "Start Sprint" button correctly

v0.1.0 (2026-03-19)

Initial public release.

Features

Sprint Management: Kanban board with drag-and-drop, sprint phases (active/review/retrospective/completed)
Manager AI: LLM-powered task proposals, sprint coordination, interactive proposal cards
Builder Agent: Automated code implementation in git worktrees, trunk-based development
Auto Code Review: LLM-generated structured review (requirement match, code quality, test coverage, risks, verdict)
Real-time Dashboard: WebSocket-powered live updates for tasks, agents, and sprint state
Sprint Analytics: Velocity, burndown, hourly activity, cumulative charts
Playbook: Configurable agent rules (locked/overridable/recommended), per-agent targeting
ADR System: Architecture Decision Records injected into agent prompts
Project Spec: Vision, roadmap, tech stack — feeds into agent context
Multi-repo Support: Agents can work across multiple repositories
GitHub App Auth: OAuth login, installation token for git operations
CLI: npx toban-cli@latest start — connects to dashboard, spawns agents locally

Known Issues

Individual use only — team/multi-user features are not yet available
GitHub App installed on an organization may not appear in Setup repo selection (personal account only)
Project deletion and re-creation may leave stale data in some edge cases
Claude Code CLI must be installed locally to run agents (no cloud execution yet)
Review timeout on large diffs (workaround: keep tasks small)
Agent API keys do not currently expire (fix planned)

Release Notes

v0.3.0 (2026-03-26)

Simplification release: removed features that added complexity without improving agent performance, streamlined the sprint workflow.

Removed Features

Playbook: Rule injection into agent prompts did not improve code quality — context noise outweighed benefits. All playbook rules, enforcement levels, rule evaluations, and Defense Report removed.
Skills: Task-level tag injection into prompts had no measurable effect.
Secrets: Encrypted environment variable injection was never used.
Goals: OKR management and goal evaluation were unused.
Recurring Tasks: Periodic task auto-generation was unused.
Review Diffs: Diff recording between reject/approve was never populated.
Strategist role: Strategy, content, research, and story-decompose templates removed. Remaining roles: Manager, Builder. Remaining templates: implementation, reviewer.

Story Workflow

Direct task creation: Stories no longer trigger AI decomposition. Creating a Story auto-creates a Builder task. Additional tasks can be added manually.
Enrich is optional: Tasks no longer require acceptance criteria or steps before being dispatched. Agents read the codebase and decide what to do.

Fixes

Audit Log: Fixed authentication check — user sessions (user: prefix) now correctly allowed.
Story delete: Fixed FK constraint error by unlinking tasks before deletion.
DELETE proxy: Fixed empty body forwarding that caused 500 errors.
All new workspaces default to Pro plan.

Documentation

Public docs updated to reflect current feature set (architecture, getting-started, API reference, glossary, pricing, setup guide).
CLI README updated (removed Manager LLM references, updated pipeline flow).

E2E Tests

Updated for removed features (Goals, Playbook). 13 tests pass, 2 gracefully skipped.

Known Issues (resolved from v0.2.0)

~~Defense Report page~~ → removed
~~Playbook rule management~~ → removed
~~Strategist proposals fetch~~ → removed

v0.2.0 (2026-03-21)

Major update: Product Goals, onboarding wizard, multi-agent orchestration, and review intelligence.

Product Goals

Goals & Key Results: Set product goals, track key results, and link them to sprints. Goal status updates automatically based on sprint outcomes.
Strategy Log: Record strategic decisions from chat or retrospectives. Dedicated Strategy tab on the dashboard.
Agents are goal-aware: Active goals are included in agent context, so tasks align with your current priorities.

Onboarding

toban init: Interactive setup wizard — select repos, configure agents, and connect GitHub in one command.
Browser-based auth: GitHub login opens in your browser — no token copy-paste needed.
Auto-review on push: Setup installs a git hook that runs toban review on every push.

Review Intelligence

Diff analysis: Tracks what code changes turn a rejection into an approval, surfacing improvement patterns over time.
Defense report: Monthly summary of issues caught by Playbook — by category, by rule, with 3-month trends.
Public metrics: Share anonymized quality metrics at /public/metrics (no login required).

Multi-Agent Orchestration (CLI v0.10.0)

Parallel builders: Run multiple builders at the same time. Git conflicts are handled automatically with rebase and escalation.
Task auto-split: Large tasks are automatically broken into subtasks by AI.
Dependency-aware assignment: Tasks are dispatched in the right order based on dependencies.
Context sharing: Each builder's changes are passed to the next, reducing redundant work.
Stall detection: Idle agents are detected and cleaned up automatically.

Agent Concurrency Settings

Dashboard UI: Settings > Agents — adjust how many builders and cloud engineers run in parallel.

Bug Fixes

Sprint phase buttons now display correctly (e.g., "Start Sprint" in planning phase).
Workspace switching properly resets all state.
Reviewer falls back to general code quality review when no matching task is found.
Retrospective data parsing is more resilient to malformed output.
Various stability improvements for task updates and agent coordination.

Known Issues (remaining)

Individual use only — team/multi-user features not available.
GitHub App org installation may not appear in Setup (workaround: set installation_id manually).

v0.1.1 (2026-03-20)

Reviewer Agent

Independent Reviewer process: Reviewer spawns as a separate Claude Code agent (10 turns, 5min timeout) instead of single-turn --print call. Can read files, run tests, and analyze diffs in context.
Task-type specific review: Review criteria vary by task type (implementation, content, infra, research, strategy, bug, chore).
Customizable review rules: Reviewer prompt externalized to prompts/templates.ts. Project-specific rules injected from Playbook.
Test execution in review: Reviewer runs npm test before evaluation. Failed tests = automatic NEEDS_CHANGES.
Reviewer Playbook rules: 5 default locked/overridable rules (scope check, test requirement, metadata-only rejection, etc.)
Review scoring: Each review produces a 0-100 quality score.
Verdict badges: Task cards show Approved (green), Changes (amber), Blocked (red) with hover tooltip.

Knowledge & Learning

Skills → Playbook integration: Skills moved from hardcoded CLI to Playbook rules (category="skill"). Tag-based matching — only injected when task labels match rule tags.
Shared Agent Memory: Agents can share knowledge across the team via shared: true in memory frontmatter. Tag-based pinpoint injection.
Knowledge management page: Settings > Knowledge with Shared Memories (click-to-expand) + Skills tabs.
Failure Database: Auto-populated on task reject. Past failures injected into agent prompts to prevent recurring issues. Japanese text normalize, retry noise dedup.
Team Knowledge: POST /failures/analyze groups failure patterns and suggests Playbook rules. One-click apply to Playbook.
Previous Review injection: Retry tasks receive prior review feedback in prompt.

Sprint Planning (Strategist)

toban plan: Strategist agent analyzes backlog, velocity, quality trends, and failure patterns to generate sprint plans via local Claude CLI.
toban propose: Strategist analyzes data sources and suggests improvement tasks. Auto-runs when sprint enters retrospective.
Proposals UI: Improvement proposals displayed in retrospective view with approve (→ Sprint / → Backlog) and reject (with reason) buttons.
Sprint Plan API: CRUD endpoints for plans with approve/reject workflow.
Proposals API: CRUD + batch create + approve with sprint target + reject with reason.
Analytics in Manager context: Velocity and quality trends injected into Manager prompt.

Sprint Flow

Sprint transition guards: Active → Review blocked if in_progress tasks remain. Review → Retro blocked if review/in_progress/todo tasks remain. Retro → Complete blocked while Strategist is generating proposals.
Retrospective auto-proposals: Entering retrospective triggers Strategist to analyze and generate improvement proposals.
Review → Retro required: Removed "Next Sprint" shortcut from Review phase. Must go through Retrospective.
Phase error display: Sprint phase transition failures shown to user (was silently swallowed).
CLI active-only task pickup: Agents only pick up tasks during active phase, not during review/retrospective.

Quality & Data

Quality Score API: GET /sprints/:number/quality and GET /analytics/quality with first-pass approve rate, avg review score, and per-agent breakdown.
Quality Score UI: Analytics page trend chart + Sprint header badge.
Failure DB management page: Filter by agent/type, expandable detail rows, pattern analysis UI.
Recurring Tasks page: CRUD with frequency options, enabled/disabled toggle.
Tasks Skills field: Skills array on tasks and recurring tasks, UI editing in task modal.

Agent Reliability

NEEDS_CHANGES → todo: Tasks with NEEDS_CHANGES verdict automatically reset to todo.
Retry limit: After 3 failed attempts, task moves to review with "Blocked" comment for human review.
allow_no_commit_completion: Template flag — agents that complete without code changes go to review instead of infinite loop.
Stream result fallback: Extracts completion from stream-json result when COMPLETION_JSON is not explicitly output.
Per-task execution logger: ~/.toban/logs/tasks/{taskId}.jsonl for debugging.
Description pre-check: Tasks with descriptions under 20 characters are auto-blocked.
Context budget: 30K token limit with priority-ordered section trimming.
Auto-split: Tasks with SP >= 5 are automatically split by LLM into subtasks.

Templates

Content template: For documentation tasks (docs-only file restrictions).
Strategy template: For research/planning tasks (read-only + web search).
Reviewer template: For code review (read-only + test execution).

Setup & Configuration

4 required agents: Setup automatically creates Manager, Builder, Reviewer, Strategist. Free plan limit raised to 4.
Setup overwrite protection: Re-running setup on a configured project returns 409.
Dynamic model selection: Agent engine setting resolved to model ID per role. Builder defaults to Opus 4.6, Reviewer/Strategist to Sonnet 4.6.
Workspace switching: Cookie and localStorage properly cleared on project switch.

Infrastructure

API proxy removal (production): Browser communicates directly with api.toban.dev.
Agent API key expiry: 24-hour expiry with sliding window.
Git credential caching: 50-minute token cache.
blocked status: Added to task status enum.
Playbook tags: Tag-based skill rule matching.
Sprint plans + Task proposals tables: Persistent storage for Strategist outputs.
Origin header forwarding: Proxy forwards Origin for correct cookie domain on localhost.

Known Issues (fixed from v0.1.0)

~~Agent API keys do not expire~~ → 24h expiry with sliding window
~~Review timeout on large diffs~~ → Reviewer agent with 5min timeout

Known Issues (remaining)

Individual use only — team/multi-user features not available
GitHub App org installation may not appear in Setup (workaround: set installation_id manually)
Manager chat is unreliable — use toban plan / toban propose CLI commands instead
API 500 on D1 writes: Task updates and review-report may intermittently return 500. CLI retries automatically (2 attempts with backoff), but root cause needs investigation
Reviewer max-turns: Reviewer may hit 5-turn limit on no-commit verification tasks. Increase planned
Strategist proposals fetch: toban propose may fail to save proposals due to execSync blocking Node.js event loop. Proposals display in CLI but may not persist to DB
engine/model column: DB agents.engine mixes provider types (claude-code) and model names (claude-opus). CLI has PROVIDER_ENGINE_IDS workaround
Setup duplicate tasks: Initial Sprint #0 may generate duplicate tasks via Manager or RETRO suggested_tasks
RETRO_JSON parse: Agent may append text after RETRO_JSON, causing parse failure
localStorage leakage: Workspace-scoped data (terminal, proposals) not prefixed with workspace ID — cleared on switch but not preserved per project
Sprint #0 planning status: Newly created projects start with planning status. UI now shows "Start Sprint" button correctly

v0.1.0 (2026-03-19)

Initial public release.

Features

Sprint Management: Kanban board with drag-and-drop, sprint phases (active/review/retrospective/completed)
Manager AI: LLM-powered task proposals, sprint coordination, interactive proposal cards
Builder Agent: Automated code implementation in git worktrees, trunk-based development
Auto Code Review: LLM-generated structured review (requirement match, code quality, test coverage, risks, verdict)
Real-time Dashboard: WebSocket-powered live updates for tasks, agents, and sprint state
Sprint Analytics: Velocity, burndown, hourly activity, cumulative charts
Playbook: Configurable agent rules (locked/overridable/recommended), per-agent targeting
ADR System: Architecture Decision Records injected into agent prompts
Project Spec: Vision, roadmap, tech stack — feeds into agent context
Multi-repo Support: Agents can work across multiple repositories
GitHub App Auth: OAuth login, installation token for git operations
CLI: npx toban-cli@latest start — connects to dashboard, spawns agents locally

Known Issues

Individual use only — team/multi-user features are not yet available
GitHub App installed on an organization may not appear in Setup repo selection (personal account only)
Project deletion and re-creation may leave stale data in some edge cases
Claude Code CLI must be installed locally to run agents (no cloud execution yet)
Review timeout on large diffs (workaround: keep tasks small)
Agent API keys do not currently expire (fix planned)