Work Packages: Enable Plan Mission Runtime Support on 2.x

Feature: 041-enable-plan-mission-runtime-support Branch: 2.x Created: 2026-02-22 Total Work Packages: 5 Total Subtasks: 19


Overview

The plan mission system currently blocks the runtime loop because it lacks: 1. Runtime schema (mission-runtime.yaml) 2. Mission-scoped command templates (4 steps) 3. Comprehensive tests

This feature is organized into 5 focused work packages, organized by phase:

  • Phase 1: Foundation (WP01) - Runtime schema setup
  • Phase 2: Mission Templates (WP02) - Command templates for all 4 steps
  • Phase 3: Support (WP03) - Content templates and test setup
  • Phase 4: Testing (WP04) - Integration, resolution, and regression tests
  • Phase 5: Finalization (WP05) - Dependency parsing and commit

Work Package Breakdown

WP01: Runtime Schema Foundation

Priority: P0 (blocking) Dependencies: None Estimated Size: 4 subtasks, ~280 lines Status: Planned

Goal: Create the runtime schema that enables the runtime bridge to discover and load the plan mission.

Included Subtasks:

  • ✅ T001: Study existing mission-runtime.yaml patterns
  • ✅ T002: Create mission-runtime.yaml with 4-step schema
  • ✅ T003: Validate against runtime bridge expectations
  • ✅ T016: Create directory structure

Implementation Sketch: 1. Examine software-dev and research mission runtime definitions 2. Create plan mission-runtime.yaml following the same schema 3. Define 4 sequential steps (specify → research → plan → review) 4. Ensure directories exist for command templates and content templates

Parallel Opportunities: None (foundational)

Risks:

  • Schema incompatibility with runtime bridge
  • Mitigation: Follow existing patterns exactly

Definition of Done:

  • mission-runtime.yaml created at correct path
  • Schema validates (4 steps, linear sequence, terminal step defined)
  • Directories created and ready for templates
  • No broken references to runtime bridge

WP02: All Command Templates

Priority: P0 Dependencies: WP01 (directories must exist) Estimated Size: 6 subtasks, ~420 lines Status: Planned Parallelization: [P] Templates can be created in parallel after WP01 is done

Goal: Create 4 mission-scoped command templates that guide agents through the planning workflow.

Included Subtasks:

  • ✅ T004: Create specify.md (step 1 - feature definition)
  • ✅ T005: Create research.md (step 2 - research gathering)
  • ✅ T006: Create plan.md (step 3 - design artifacts)
  • ✅ T007: Create review.md (step 4 - final validation)
  • ✅ T008: Validate all templates for completeness
  • ✅ T009: Analyze for content template references

Implementation Sketch: 1. Create specify.md with entry point context, deliverables, instructions 2. Create research.md with research phase guidance 3. Create plan.md with design artifact generation 4. Create review.md with validation criteria 5. Each template follows Markdown + YAML frontmatter structure 6. Scan for any references to content templates (../templates/...)

Parallel Opportunities: [P] All 4 templates can be created in parallel after WP01

Risks:

  • Inconsistent template structure between files
  • Missing content template references detected too late
  • Mitigation: Validate all templates before moving to WP03

Definition of Done:

  • All 4 template files created at correct paths
  • Frontmatter YAML parses correctly
  • All required sections present (Context, Deliverables, Instructions, Success Criteria)
  • No broken references to other files
  • Content template references identified

WP03: Content Templates & Test Setup

Priority: P1 Dependencies: WP02 (need to know what templates are referenced) Estimated Size: 3 subtasks, ~260 lines Status: Planned

Goal: Create any referenced content templates and set up the test framework.

Included Subtasks:

  • ✅ T010: Create any referenced content templates
  • ✅ T011: Create test file (test_plan_mission_runtime.py)
  • ✅ T012: Set up test fixtures and mocks

Implementation Sketch: 1. Review content template references from WP02 T009 2. Create any referenced templates (e.g., research-outline.md, design-checklist.md) 3. Create tests/specify_cli/next/test_plan_mission_runtime.py 4. Set up pytest fixtures: temp_project, plan_feature, mock_runtime_bridge 5. Create reusable mocks for runtime bridge and command resolver

Parallel Opportunities: None (must wait for WP02)

Risks:

  • Content templates missing if references analyzed incorrectly
  • Test fixtures may not work with real runtime bridge
  • Mitigation: Unit test fixtures before writing integration tests

Definition of Done:

  • Content templates created (if any referenced)
  • Test file created with proper structure
  • Fixtures and mocks functional
  • Test file imports correctly (no syntax errors)
  • Ready for test implementation (WP04)

WP04: Integration & Regression Tests

Priority: P0 Dependencies: WP03 (test setup required) Estimated Size: 5 subtasks, ~380 lines Status: Planned

Goal: Comprehensive testing to ensure plan mission works end-to-end and other missions remain unaffected.

Included Subtasks:

  • ✅ T013: Implement mission discovery integration test
  • ✅ T014: Implement command resolution tests (all 4 steps)
  • ✅ T015: Implement regression tests (software-dev, research)
  • □ T016: Verify test coverage and CI compatibility (UPDATED: was T016, renamed)
  • ✅ T017: Run tests locally and verify all pass

Implementation Sketch:

T013 - Mission Discovery Integration:

  • Create feature with mission=plan
  • Call spec-kitty next --feature <slug>
  • Verify return status is not blocked
  • Assert no "Mission 'plan' not found" error

T014 - Command Resolution Tests:

  • Test resolver for each step: specify, research, plan, review
  • Load each template file
  • Verify YAML frontmatter parses
  • Verify all required sections present
  • Check for missing references

T015 - Regression Tests:

  • Run existing software-dev mission tests
  • Run existing research mission tests
  • Verify no broken functionality
  • Ensure backward compatibility

T016 - Coverage & CI:

  • Generate coverage report
  • Verify deterministic test execution (no timing deps, no random data)
  • Test with pytest headless flags
  • Ensure isolation (no cross-test contamination)

T017 - Local Verification:

  • Run full test suite locally: pytest tests/specify_cli/next/test_plan_mission_runtime.py -v
  • Verify all pass (100% pass rate)
  • Check coverage metrics (target: >85%)
  • Verify no external dependencies or network calls

Parallel Opportunities: [P] Integration tests can run in parallel with regression tests

Risks:

  • Tests may have timing-dependent assertions
  • Cross-test contamination in fixtures
  • Regression tests fail due to other changes
  • Mitigation: Use deterministic data, isolated fixtures, mock external calls

Definition of Done:

  • All test classes implemented (Integration, Resolution, Regression)
  • 100% test pass rate locally
  • Coverage >85%
  • No external service dependencies
  • No timing-dependent assertions
  • All tests deterministic and repeatable
  • Ready for finalization

WP05: Finalization & Commit

Priority: P0 (blocking release) Dependencies: WP04 (all tests must pass) Estimated Size: 1 subtask, ~50 lines Status: Planned

Goal: Parse dependencies and commit all work to the 2.x branch.

Included Subtasks:

  • ✅ T018: Run finalize-tasks command and verify commit

Implementation Sketch: 1. Run spec-kitty agent feature finalize-tasks --json 2. Verify JSON output: "commit_created": true 3. Capture commit hash 4. Verify all WP files in tasks/ directory 5. Verify frontmatter has dependencies field populated 6. Verify no cycles in dependency graph 7. Confirm commit was pushed to 2.x branch (if applicable)

Parallel Opportunities: None (final step)

Risks:

  • Dependencies not parsed correctly
  • Commit fails due to untracked files
  • Dependency cycle detected (would block)
  • Mitigation: Verify JSON output carefully

Definition of Done:

  • spec-kitty agent feature finalize-tasks succeeds
  • JSON output shows commit_created: true
  • All WP frontmatter updated with dependencies
  • Git log shows new commit on 2.x
  • Feature ready for implementation (spec-kitty implement WP01)

Phase Dependencies

WP01: Foundation
  ↓ (must complete first)
WP02: Templates
  ↓ (templates must exist)
WP03: Test Setup
  ↓ (test framework ready)
WP04: Tests
  ↓ (tests passing)
WP05: Finalization
  ↓ (commit to 2.x)
Ready for Implementation

MVP Scope Recommendation

Minimum Viable Product: WP01 + WP02 + WP04 (Minimal Tests)

  • Core functionality (schema + templates) working
  • Basic integration test passing
  • No full test suite

Recommended Scope: WP01 + WP02 + WP03 + WP04 (Full)

  • Complete implementation with comprehensive testing
  • Content templates included
  • Regression tests ensure no breakage
  • Recommended for merge to 2.x

Extended Scope: All 5 WPs

  • Full implementation with finalization
  • All tests passing
  • Committed to 2.x branch
  • Ready for agent implementation

Size Distribution

WPSubtasksEst. LinesStatus
WP014280✓ Planned
WP026420✓ Planned
WP033260✓ Planned
WP045380✓ Planned
WP05150✓ Planned
Total19~1390-

Assessment:

  • ✓ All WPs within ideal range (200-500 lines)
  • ✓ All WPs have 1-6 subtasks (target: 3-7, max: 10)
  • ✓ No WP exceeds 700 lines (max safe for agents)
  • ✓ Balanced distribution of work across phases

Parallelization Opportunities

Sequential Critical Path (required order): 1. WP01 (foundation - 4 subtasks) 2. WP02 (templates - 6 subtasks) 3. WP03 (test setup - 3 subtasks) 4. WP04 (tests - 5 subtasks) 5. WP05 (finalization - 1 subtask)

Parallel Opportunities (after critical path):

  • [P] WP02 templates can be created in parallel (once WP01 directories exist)
  • [P] WP04 integration tests can run in parallel with regression tests

Estimated Timeline:

  • Sequential: ~5 agents × 1-2 hours each = 5-10 hours
  • With parallelization: 3-4 agents in parallel = 3-5 hours

Risk Summary

RiskImpactMitigation
Schema incompatibility with runtime bridgeHIGHFollow existing patterns exactly (software-dev, research)
Content templates missing or incompleteMEDIUMAnalyze references thoroughly before creating
Tests fail due to external dependenciesMEDIUMMock all external services, use deterministic data
Regression in other missionsHIGHComprehensive regression tests in WP04
2.x path conflictsMEDIUMKeep all paths local to missions/plan/, no doctrine paths

Notes for Implementers

1. 2.x Branch Only: All changes must be on the 2.x branch. No mainline changes.

2. Reference Patterns: Follow software-dev and research mission structures exactly. Don't invent new patterns.

3. Testing is Critical: WP04 is not optional. Comprehensive testing ensures no regressions.

4. Content Templates: Only create if explicitly referenced by command templates. Don't create generic templates.

5. Deterministic Tests: No timing dependencies, no random data, no external service calls. Tests must pass consistently in CI.

6. Path Safety: All paths must be 2.x-compatible. No doctrine paths, no mainline paths.


Next Steps After Task Generation

1. Review: Verify all WP prompt files generated correctly 2. Estimate: Each agent should review estimated time before starting 3. Implement: Agents implement WPs in sequence: WP01 → WP02 → WP03 → WP04 → WP05 4. Test: After WP04, run full test suite locally 5. Finalize: WP05 commits all work to 2.x branch 6. Review: Code review of all artifacts 7. Merge: Merge to 2.x when tests pass


Prompt File References

Individual work package prompts are generated in tasks/ directory:

  • WP01-runtime-schema-foundation.md - Runtime schema setup
  • WP02-command-templates.md - All 4 step templates
  • WP03-content-test-setup.md - Supporting templates and test framework
  • WP04-integration-regression-tests.md - Comprehensive test suite
  • WP05-finalization-commit.md - Finalization and commit

Each prompt includes detailed implementation guidance, code examples, test cases, and success criteria.

<!-- status-model:start -->

Canonical Status (Generated)

<!-- status-model:end -->

  • WP01: done
  • WP02: done
  • WP03: done
  • WP04: done
  • WP05: done