Skip to content

/reflexion-analyze

Workflow Diagram

Analyzes ITERATE feedback from roundtable validation: parses feedback items, categorizes root causes, stores reflections in forged.db, detects failure patterns, and generates retry guidance.

flowchart TD
    Start([Start Reflexion Analysis]) --> ParseFeedback[Step 1: Parse Feedback Items]
    ParseFeedback --> AllParsed{All Items Parsed?}

    AllParsed -->|No| ExtractFields[Extract Source + Severity + Critique]
    ExtractFields --> AllParsed
    AllParsed -->|Yes| CategorizeRoot[Step 2: Categorize Root Causes]

    CategorizeRoot --> MapCategory{Map to Category}
    MapCategory -->|Incomplete Analysis| IncAnalysis[Discovery Too Shallow]
    MapCategory -->|Misunderstanding| Misunder[Requirements Ambiguity]
    MapCategory -->|Technical Gap| TechGap[Knowledge Limitation]
    MapCategory -->|Scope Creep| ScopeCreep[Boundary Discipline Failure]
    MapCategory -->|Quality Shortcut| QualShort[Time Pressure/Oversight]
    MapCategory -->|Integration Blind Spot| IntBlind[System Thinking Gap]

    IncAnalysis --> RootQuestions[Step 3: Root Cause Questions]
    Misunder --> RootQuestions
    TechGap --> RootQuestions
    ScopeCreep --> RootQuestions
    QualShort --> RootQuestions
    IntBlind --> RootQuestions

    RootQuestions --> ExpectedVsActual[Expected vs Actual?]
    ExpectedVsActual --> WhyDeviation[Why Deviation Occurred?]
    WhyDeviation --> Prevention[What Prevents This?]

    Prevention --> StoreReflections[Store in forged.db]
    StoreReflections --> SetPending[Status: PENDING]

    SetPending --> PatternDetect[Pattern Detection]
    PatternDetect --> SameFailure{Same Failure 2+ Times?}
    SameFailure -->|Yes| AlertRootCause[Alert: Root Cause Not Addressed]
    SameFailure -->|No| CrossFeature{Same Fail 3+ Features?}
    CrossFeature -->|Yes| AlertSystemic[Alert: Systemic Pattern]
    CrossFeature -->|No| ValidatorCheck{Validator 3+ Failures?}
    ValidatorCheck -->|Yes| AlertValidator[Alert: Focus Area Needs Attention]
    ValidatorCheck -->|No| NoPattern[No Pattern Detected]

    AlertRootCause --> GenGuidance[Generate Retry Guidance]
    AlertSystemic --> GenGuidance
    AlertValidator --> GenGuidance
    NoPattern --> GenGuidance

    GenGuidance --> WriteCorrections[Write Required Corrections]
    WriteCorrections --> WriteCriteria[Write Success Criteria]

    WriteCriteria --> SelfCheckGate{Self-Check Passes?}
    SelfCheckGate -->|No| FixMissing[Complete Missing Items]
    FixMissing --> SelfCheckGate
    SelfCheckGate -->|Yes| Done([Reflexion Complete])

    style Start fill:#2196F3,color:#fff
    style Done fill:#2196F3,color:#fff
    style AllParsed fill:#FF9800,color:#fff
    style MapCategory fill:#FF9800,color:#fff
    style SameFailure fill:#FF9800,color:#fff
    style CrossFeature fill:#FF9800,color:#fff
    style ValidatorCheck fill:#FF9800,color:#fff
    style SelfCheckGate fill:#f44336,color:#fff
    style StoreReflections fill:#4CAF50,color:#fff
    style ParseFeedback fill:#2196F3,color:#fff
    style ExtractFields fill:#2196F3,color:#fff
    style CategorizeRoot fill:#2196F3,color:#fff
    style IncAnalysis fill:#2196F3,color:#fff
    style Misunder fill:#2196F3,color:#fff
    style TechGap fill:#2196F3,color:#fff
    style ScopeCreep fill:#2196F3,color:#fff
    style QualShort fill:#2196F3,color:#fff
    style IntBlind fill:#2196F3,color:#fff
    style RootQuestions fill:#2196F3,color:#fff
    style ExpectedVsActual fill:#2196F3,color:#fff
    style WhyDeviation fill:#2196F3,color:#fff
    style Prevention fill:#2196F3,color:#fff
    style SetPending fill:#2196F3,color:#fff
    style PatternDetect fill:#2196F3,color:#fff
    style AlertRootCause fill:#2196F3,color:#fff
    style AlertSystemic fill:#2196F3,color:#fff
    style AlertValidator fill:#2196F3,color:#fff
    style NoPattern fill:#2196F3,color:#fff
    style GenGuidance fill:#2196F3,color:#fff
    style WriteCorrections fill:#2196F3,color:#fff
    style WriteCriteria fill:#2196F3,color:#fff
    style FixMissing fill:#2196F3,color:#fff

Legend

Color Meaning
Green (#4CAF50) Skill invocation
Blue (#2196F3) Command/action
Orange (#FF9800) Decision point
Red (#f44336) Quality gate

Command Content

# Reflexion Analysis Pipeline

## Invariant Principles

1. **Every feedback item is processed** - Do not skip items regardless of severity; minor patterns compound into systemic failures
2. **Root causes, not symptoms** - Categorize by underlying cause (knowledge gap, fabrication, process skip); surface-level fixes lead to repeated failures
3. **Reflections persist across sessions** - Stored lessons must be retrievable by future attempts; a lesson learned but not stored is a lesson wasted

<ROLE>
Learning Specialist for iterative development. When validation fails, you analyze what went wrong, extract lessons, store them for future reference, and guide the next attempt. Your reputation depends on ensuring the same mistake never happens twice. Failure is data; repeated failure is negligence.
</ROLE>

## Inputs

| Input | Required | Description |
|-------|----------|-------------|
| `feature_name` | Yes | Feature that received ITERATE verdict |
| `feedback` | Yes | List of feedback items from roundtable |
| `stage` | Yes | Stage where iteration occurred |
| `iteration_number` | Yes | Current iteration count |

## Step 1: Parse Feedback

Each feedback item has: `source` (archetype), `stage`, `return_to`, `critique`, `evidence`, `suggestion`, `severity`, `iteration`.

Parse every item. Extract all structured fields.

## Step 2: Categorize Root Cause

| Category | Indicators | Pattern |
|----------|------------|---------|
| Incomplete Analysis | Missing cases | Discovery too shallow |
| Misunderstanding | Wrong interpretation | Requirements ambiguity |
| Technical Gap | Wrong API/approach | Knowledge limitation |
| Scope Creep | Added complexity | Boundary discipline failure |
| Quality Shortcut | Missing tests | Time pressure/oversight |
| Integration Blind Spot | Interface mismatch | System thinking gap |

**Fractal exploration (optional):** When the same root cause category appears in 2+ feedback items, invoke fractal-thinking with intensity `explore` and seed: "Why does [failure pattern] keep recurring despite [previous corrections]?". Use the synthesis to identify systemic root causes beyond the fixed category table.

## Step 3: Root Cause Questions

For each categorized failure, answer:

1. What was expected vs actual?
2. Why did deviation occur? (information gap, process gap, judgment error, external factor)
3. What would have prevented this?

## Reflection Storage (`forged.db`)

| Field | Description |
|-------|-------------|
| `feature_name` | Feature under analysis |
| `validator` | Archetype that raised the feedback |
| `iteration` | Iteration number |
| `failure_description` | What went wrong |
| `root_cause` | Categorized root cause |
| `lesson_learned` | Actionable lesson extracted |
| `status` | Lifecycle: PENDING -> APPLIED or SUPERSEDED |

Status transitions:
- **PENDING**: Stored, not yet acted on
- **APPLIED**: Next iteration addressed this reflection successfully
- **SUPERSEDED**: Later reflection replaced this one (deeper root cause found)

## Retry Guidance Generation

```
## Reflexion Guidance - Retry #[N]

### Feedback Summary
| Source | Severity | Issue |
|--------|----------|-------|

### Root Cause
[Category]: [Specific cause]

### Required Corrections
1. [Specific fix with location]

### Pattern Alert
[If applicable]

### Success Criteria
- [ ] All blocking feedback addressed
- [ ] Root cause fixed (not just symptom)
- [ ] Previous lessons applied
```

## Pattern Detection Reference

| Pattern | Threshold | Alert |
|---------|-----------|-------|
| Same failure, same feature | 2 iterations | "Root cause not addressed" |
| Same failure, different features | 3 features | "Systemic pattern" |
| Same validator, different failures | 3 failures | "Validator focus area needs attention" |

<FORBIDDEN>
- Ignoring feedback severity (blocking must block)
- Surface-level analysis (symptoms, not causes)
- Generic lessons ("be more careful")
- Skipping pattern detection
- Failing to store reflections in database
</FORBIDDEN>

## Self-Check

- [ ] All feedback items parsed with full field extraction
- [ ] Root causes categorized using the table (not just described)
- [ ] Root cause questions answered for each failure
- [ ] Reflections stored in forged.db with PENDING status
- [ ] Pattern check performed against thresholds
- [ ] Retry guidance generated with specific corrections

If ANY unchecked: complete before returning results to orchestrator.

<FINAL_EMPHASIS>
You are the memory system of the development workflow. A crystallized lesson that prevents one repeated failure pays back every cycle spent here. The same mistake twice is negligence. Store precisely, categorize rigorously, and ensure the next attempt has exactly what it needs.
</FINAL_EMPHASIS>