Release Notes
v0.3.0 (2026-03-26)
Simplification release: removed features that added complexity without improving agent performance, streamlined the sprint workflow.
Removed Features
- Playbook: Rule injection into agent prompts did not improve code quality — context noise outweighed benefits. All playbook rules, enforcement levels, rule evaluations, and Defense Report removed.
- Skills: Task-level tag injection into prompts had no measurable effect.
- Secrets: Encrypted environment variable injection was never used.
- Goals: OKR management and goal evaluation were unused.
- Recurring Tasks: Periodic task auto-generation was unused.
- Review Diffs: Diff recording between reject/approve was never populated.
- Strategist role: Strategy, content, research, and story-decompose templates removed. Remaining roles: Manager, Builder. Remaining templates: implementation, reviewer.
Story Workflow
- Direct task creation: Stories no longer trigger AI decomposition. Creating a Story auto-creates a Builder task. Additional tasks can be added manually.
- Enrich is optional: Tasks no longer require acceptance criteria or steps before being dispatched. Agents read the codebase and decide what to do.
Fixes
- Audit Log: Fixed authentication check — user sessions (
user:prefix) now correctly allowed. - Story delete: Fixed FK constraint error by unlinking tasks before deletion.
- DELETE proxy: Fixed empty body forwarding that caused 500 errors.
- All new workspaces default to Pro plan.
Documentation
- Public docs updated to reflect current feature set (architecture, getting-started, API reference, glossary, pricing, setup guide).
- CLI README updated (removed Manager LLM references, updated pipeline flow).
E2E Tests
- Updated for removed features (Goals, Playbook). 13 tests pass, 2 gracefully skipped.
Known Issues (resolved from v0.2.0)
- ~~Defense Report page~~ → removed
- ~~Playbook rule management~~ → removed
- ~~Strategist proposals fetch~~ → removed
v0.2.0 (2026-03-21)
Major update: Product Goals, onboarding wizard, multi-agent orchestration, and review intelligence.
Product Goals
- Goals & Key Results: Set product goals, track key results, and link them to sprints. Goal status updates automatically based on sprint outcomes.
- Strategy Log: Record strategic decisions from chat or retrospectives. Dedicated Strategy tab on the dashboard.
- Agents are goal-aware: Active goals are included in agent context, so tasks align with your current priorities.
Onboarding
toban init: Interactive setup wizard — select repos, configure agents, and connect GitHub in one command.- Browser-based auth: GitHub login opens in your browser — no token copy-paste needed.
- Auto-review on push: Setup installs a git hook that runs
toban reviewon every push.
Review Intelligence
- Diff analysis: Tracks what code changes turn a rejection into an approval, surfacing improvement patterns over time.
- Defense report: Monthly summary of issues caught by Playbook — by category, by rule, with 3-month trends.
- Public metrics: Share anonymized quality metrics at
/public/metrics(no login required).
Multi-Agent Orchestration (CLI v0.10.0)
- Parallel builders: Run multiple builders at the same time. Git conflicts are handled automatically with rebase and escalation.
- Task auto-split: Large tasks are automatically broken into subtasks by AI.
- Dependency-aware assignment: Tasks are dispatched in the right order based on dependencies.
- Context sharing: Each builder's changes are passed to the next, reducing redundant work.
- Stall detection: Idle agents are detected and cleaned up automatically.
Agent Concurrency Settings
- Dashboard UI: Settings > Agents — adjust how many builders and cloud engineers run in parallel.
Bug Fixes
- Sprint phase buttons now display correctly (e.g., "Start Sprint" in planning phase).
- Workspace switching properly resets all state.
- Reviewer falls back to general code quality review when no matching task is found.
- Retrospective data parsing is more resilient to malformed output.
- Various stability improvements for task updates and agent coordination.
Known Issues (remaining)
- Individual use only — team/multi-user features not available.
- GitHub App org installation may not appear in Setup (workaround: set installation_id manually).
v0.1.1 (2026-03-20)
Reviewer Agent
- Independent Reviewer process: Reviewer spawns as a separate Claude Code agent (10 turns, 5min timeout) instead of single-turn
--printcall. Can read files, run tests, and analyze diffs in context. - Task-type specific review: Review criteria vary by task type (implementation, content, infra, research, strategy, bug, chore).
- Customizable review rules: Reviewer prompt externalized to
prompts/templates.ts. Project-specific rules injected from Playbook. - Test execution in review: Reviewer runs
npm testbefore evaluation. Failed tests = automatic NEEDS_CHANGES. - Reviewer Playbook rules: 5 default locked/overridable rules (scope check, test requirement, metadata-only rejection, etc.)
- Review scoring: Each review produces a 0-100 quality score.
- Verdict badges: Task cards show Approved (green), Changes (amber), Blocked (red) with hover tooltip.
Knowledge & Learning
- Skills → Playbook integration: Skills moved from hardcoded CLI to Playbook rules (category="skill"). Tag-based matching — only injected when task labels match rule tags.
- Shared Agent Memory: Agents can share knowledge across the team via
shared: truein memory frontmatter. Tag-based pinpoint injection. - Knowledge management page: Settings > Knowledge with Shared Memories (click-to-expand) + Skills tabs.
- Failure Database: Auto-populated on task reject. Past failures injected into agent prompts to prevent recurring issues. Japanese text normalize, retry noise dedup.
- Team Knowledge:
POST /failures/analyzegroups failure patterns and suggests Playbook rules. One-click apply to Playbook. - Previous Review injection: Retry tasks receive prior review feedback in prompt.
Sprint Planning (Strategist)
toban plan: Strategist agent analyzes backlog, velocity, quality trends, and failure patterns to generate sprint plans via local Claude CLI.toban propose: Strategist analyzes data sources and suggests improvement tasks. Auto-runs when sprint enters retrospective.- Proposals UI: Improvement proposals displayed in retrospective view with approve (→ Sprint / → Backlog) and reject (with reason) buttons.
- Sprint Plan API: CRUD endpoints for plans with approve/reject workflow.
- Proposals API: CRUD + batch create + approve with sprint target + reject with reason.
- Analytics in Manager context: Velocity and quality trends injected into Manager prompt.
Sprint Flow
- Sprint transition guards: Active → Review blocked if in_progress tasks remain. Review → Retro blocked if review/in_progress/todo tasks remain. Retro → Complete blocked while Strategist is generating proposals.
- Retrospective auto-proposals: Entering retrospective triggers Strategist to analyze and generate improvement proposals.
- Review → Retro required: Removed "Next Sprint" shortcut from Review phase. Must go through Retrospective.
- Phase error display: Sprint phase transition failures shown to user (was silently swallowed).
- CLI active-only task pickup: Agents only pick up tasks during active phase, not during review/retrospective.
Quality & Data
- Quality Score API:
GET /sprints/:number/qualityandGET /analytics/qualitywith first-pass approve rate, avg review score, and per-agent breakdown. - Quality Score UI: Analytics page trend chart + Sprint header badge.
- Failure DB management page: Filter by agent/type, expandable detail rows, pattern analysis UI.
- Recurring Tasks page: CRUD with frequency options, enabled/disabled toggle.
- Tasks Skills field: Skills array on tasks and recurring tasks, UI editing in task modal.
Agent Reliability
- NEEDS_CHANGES → todo: Tasks with NEEDS_CHANGES verdict automatically reset to todo.
- Retry limit: After 3 failed attempts, task moves to review with "Blocked" comment for human review.
- allow_no_commit_completion: Template flag — agents that complete without code changes go to review instead of infinite loop.
- Stream result fallback: Extracts completion from stream-json result when COMPLETION_JSON is not explicitly output.
- Per-task execution logger:
~/.toban/logs/tasks/{taskId}.jsonlfor debugging. - Description pre-check: Tasks with descriptions under 20 characters are auto-blocked.
- Context budget: 30K token limit with priority-ordered section trimming.
- Auto-split: Tasks with SP >= 5 are automatically split by LLM into subtasks.
Templates
- Content template: For documentation tasks (docs-only file restrictions).
- Strategy template: For research/planning tasks (read-only + web search).
- Reviewer template: For code review (read-only + test execution).
Setup & Configuration
- 4 required agents: Setup automatically creates Manager, Builder, Reviewer, Strategist. Free plan limit raised to 4.
- Setup overwrite protection: Re-running setup on a configured project returns 409.
- Dynamic model selection: Agent engine setting resolved to model ID per role. Builder defaults to Opus 4.6, Reviewer/Strategist to Sonnet 4.6.
- Workspace switching: Cookie and localStorage properly cleared on project switch.
Infrastructure
- API proxy removal (production): Browser communicates directly with
api.toban.dev. - Agent API key expiry: 24-hour expiry with sliding window.
- Git credential caching: 50-minute token cache.
- blocked status: Added to task status enum.
- Playbook tags: Tag-based skill rule matching.
- Sprint plans + Task proposals tables: Persistent storage for Strategist outputs.
- Origin header forwarding: Proxy forwards Origin for correct cookie domain on localhost.
Known Issues (fixed from v0.1.0)
- ~~Agent API keys do not expire~~ → 24h expiry with sliding window
- ~~Review timeout on large diffs~~ → Reviewer agent with 5min timeout
Known Issues (remaining)
- Individual use only — team/multi-user features not available
- GitHub App org installation may not appear in Setup (workaround: set installation_id manually)
- Manager chat is unreliable — use
toban plan/toban proposeCLI commands instead - API 500 on D1 writes: Task updates and review-report may intermittently return 500. CLI retries automatically (2 attempts with backoff), but root cause needs investigation
- Reviewer max-turns: Reviewer may hit 5-turn limit on no-commit verification tasks. Increase planned
- Strategist proposals fetch:
toban proposemay fail to save proposals due to execSync blocking Node.js event loop. Proposals display in CLI but may not persist to DB - engine/model column: DB
agents.enginemixes provider types (claude-code) and model names (claude-opus). CLI has PROVIDER_ENGINE_IDS workaround - Setup duplicate tasks: Initial Sprint #0 may generate duplicate tasks via Manager or RETRO suggested_tasks
- RETRO_JSON parse: Agent may append text after RETRO_JSON, causing parse failure
- localStorage leakage: Workspace-scoped data (terminal, proposals) not prefixed with workspace ID — cleared on switch but not preserved per project
- Sprint #0 planning status: Newly created projects start with planning status. UI now shows "Start Sprint" button correctly
v0.1.0 (2026-03-19)
Initial public release.
Features
- Sprint Management: Kanban board with drag-and-drop, sprint phases (active/review/retrospective/completed)
- Manager AI: LLM-powered task proposals, sprint coordination, interactive proposal cards
- Builder Agent: Automated code implementation in git worktrees, trunk-based development
- Auto Code Review: LLM-generated structured review (requirement match, code quality, test coverage, risks, verdict)
- Real-time Dashboard: WebSocket-powered live updates for tasks, agents, and sprint state
- Sprint Analytics: Velocity, burndown, hourly activity, cumulative charts
- Playbook: Configurable agent rules (locked/overridable/recommended), per-agent targeting
- ADR System: Architecture Decision Records injected into agent prompts
- Project Spec: Vision, roadmap, tech stack — feeds into agent context
- Multi-repo Support: Agents can work across multiple repositories
- GitHub App Auth: OAuth login, installation token for git operations
- CLI:
npx toban-cli@latest start— connects to dashboard, spawns agents locally
Known Issues
- Individual use only — team/multi-user features are not yet available
- GitHub App installed on an organization may not appear in Setup repo selection (personal account only)
- Project deletion and re-creation may leave stale data in some edge cases
- Claude Code CLI must be installed locally to run agents (no cloud execution yet)
- Review timeout on large diffs (workaround: keep tasks small)
- Agent API keys do not currently expire (fix planned)