What is the Neural Link?

The Neural Link is a bi-directional bridge that connects AI agents directly to the Neo.mjs runtime. It lets agents inspect the Scene Graph, component state, event listeners, computed styles, and DOM rectangles, and mutate the running application in real time.

Why is Neo.mjs called an Application Engine instead of a framework?

Neo.mjs maintains persistent application objects in a worker-backed Scene Graph instead of compiling application state away into ephemeral DOM nodes. That architecture enables multi-window orchestration, runtime permutation, and deep AI introspection.

What is Context Engineering?

Context Engineering shapes the information and tool environment around AI agents. Neo.mjs implements it through Knowledge Base, Memory Core, GitHub Workflow, and Neural Link MCP servers for frontier harnesses, plus a File System MCP server for internal Neo.ai.Agent local loops.

What is the Neo.mjs Agent OS?

The Neo.mjs Agent OS is the repository Brain: source code and services for Memory Core, Knowledge Base, Active Hybrid GraphRAG, DreamService, Golden Path synthesis, A2A coordination, and Neural Link tooling.

Frontmatter

id	9851
title	feat: Implement Retrospective Analysis Agent (Session Performance Evaluator)
state	Closed
labels	enhancementaiarchitecture
assignees	tobiu
createdAt	Apr 10, 2026, 10:55 AM
updatedAt	Apr 10, 2026, 12:57 PM
githubUrl	https://github.com/neomjs/neo/issues/9851
author	tobiu
commentsCount	1
parentIssue	null
subIssues	[]
subIssuesCompleted	0
subIssuesTotal	0
blockedBy	[]
blocking	[]
closedAt	Apr 10, 2026, 12:57 PM

feat: Implement Retrospective Analysis Agent (Session Performance Evaluator)

Closed v13.0.0/archive-v13-0-0-chunk-3 enhancementaiarchitecture

tobiu commented on Apr 10, 2026, 10:55 AM

Problem (A2A Context — Claude Opus 4.6 via Antigravity)

The Neo Agent OS can execute tickets autonomously (Golden Path → Fat Ticket → Implementation → PR), but it currently lacks a feedback loop that evaluates execution quality. Without this, the system cannot self-improve — it repeats the same patterns regardless of efficiency. An agent that takes 45 minutes on a 10-minute task looks identical to an efficient agent because there is no retrospective analysis.

This is the missing reward signal in the self-evolving loop. The DreamService synthesizes what to work on (Golden Path), but nothing evaluates how well work was done.

Architecture

Create ai/agent/profile/Retrospective.mjs — a sub-agent profile that runs during REM Sleep or on-demand after session completion.

Input

Completed session memories from Memory Core (via get_session_memories)
The ticket ID that was worked on (from github-workflow)
Session summary (via get_all_summaries)

Analysis Dimensions

Time-to-completion vs. ticket complexity — Was the session efficient? Compare wall-clock time against ticket scope (estimated from body length, sub-issue count, label complexity).
Tool usage pattern analysis — Which tools were called most frequently? Which returned empty/unhelpful results? High call counts with low information yield indicate tooling gaps.
Knowledge gaps — KB queries that returned no relevant results. These indicate missing documentation or embedding quality issues that should be surfaced as improvement tickets.
Context window waste — File re-reads, failed edit attempts, backtracking patterns. High waste indicates the agent lost track of file state or made incorrect assumptions.
Deviation from plan — Did the agent follow its implementation plan, or did it spiral into unplanned work? Measured by comparing the task.md checklist against actual tool invocations.

Output

A structured evaluation node injected into the Native Edge Graph:

TYPE: RETROSPECTIVE — the evaluation itself
Edges to the original ISSUE node (the ticket worked on)
Auto-created TYPE: KB_GAP nodes for any identified Knowledge Base gaps
Auto-created TYPE: TOOLING_GAP nodes for identified tooling bottlenecks

The Self-Improvement Signal

When multiple retrospectives identify the same KB gap or tooling bottleneck, those nodes accumulate weight via Hebbian reinforcement and naturally rise in the Golden Path. An agent then picks them up as actionable improvement work. This closes the loop from "agent that executes" to "agent that improves itself."

Acceptance Criteria

ai/agent/profile/Retrospective.mjs sub-agent profile created with MCP server connections (memory-core, knowledge-base, github-workflow)
Session analysis produces structured JSON covering all 5 dimensions
Analysis output persisted as graph nodes with proper edge linkages via GraphService
KB_GAP and TOOLING_GAP nodes auto-created when thresholds are exceeded
Unit test in test/playwright/unit/ai/agent/Retrospective.spec.mjs

Architectural Context

ai/agent/profile/Librarian.mjs — Reference for sub-agent profile structure
ai/mcp/server/memory-core/services/GraphService.mjs — Edge graph node creation and Hebbian linkage
ai/daemons/DreamService.mjs — REM Sleep pipeline where this agent would be invoked
AGENTS.md §4 — Memory Core Protocol (defines the memory structure this agent reads)

Avoided Pitfalls

Do NOT attempt to quantify "code quality" — that's too subjective. Focus on measurable execution patterns (tool call counts, timing, re-reads).
Do NOT create tickets automatically from gaps — only create graph nodes. The Golden Path decides priority; ticket creation remains a human or orchestrator decision.
Do NOT run retrospective during the active session — it must analyze completed sessions only to avoid self-referential loops.

tobiu added the enhancement label on Apr 10, 2026, 10:55 AM

tobiu added the ai label on Apr 10, 2026, 10:55 AM

tobiu added the architecture label on Apr 10, 2026, 10:55 AM

tobiu cross-referenced by #9855 on Apr 10, 2026, 11:49 AM

tobiu Apr 10, 2026, 12:19 PM

Input from Antigravity (Gemini 3.1 Pro):

✦ I have drafted the implementation plan for enhancing the DreamService daemon (Issue #9851), adapting the original Retrospective Agent goal to align with our new "PR-Review-First" feedback looping strategy.

Rather than building a discrete sub-agent, the proposed architecture will:

Hard-link the PR sync system to recursively fetch all inline PR conversation notes locally.

Upgrade the DreamService.mjs and FileSystemIngestor.mjs modules to lexically scan the local repo issues/PR notes for specific string tokens like [KB_GAP], [TOOLING_GAP] and [RETROSPECTIVE].

Construct structural NODE elements into the SQLite neo_graph_nodes natively upon detection, establishing Hebbian DISCOVERED_IN / EVALUATED_BY edges backward towards their origin Pull Requests / Issues organically.

Scale identical node occurrences naturally via Hebbian decay weights inside the existing Golden Path algorithm.

For full technical specifics and open design questions, please review my implementation_plan.md artifact.

Next Step

Please review the Implementation Plan and provide your authorization so I can execute the codebase modifications.

tobiu assigned to @tobiu on Apr 10, 2026, 12:36 PM

tobiu referenced in commit 1186f91 - "feat: Autonomous PR Feedback Integration pipeline inside DreamService (#9851)" on Apr 10, 2026, 12:37 PM

tobiu cross-referenced by PR #9861 on Apr 10, 2026, 12:37 PM

tobiu referenced in commit 157ad63 - "feat: Introduce [EFFORT_PROFILE] to pr-review skill for explicit Native Graph categorization (#9851)" on Apr 10, 2026, 12:43 PM

tobiu referenced in commit 50bb024 - "feat: link PRs to Issues via Native Edge Graph RESOLVES edges; mandate PR body Fat Ticket summaries (#9851)" on Apr 10, 2026, 12:52 PM

tobiu referenced in commit bd4c78c - "docs: mandate Squash Merge for PRs to preserve Fat Ticket context (#9851)" on Apr 10, 2026, 12:56 PM

tobiu referenced in commit 6b20659 - "feat: Autonomous PR Feedback Integration pipeline inside DreamService (#9851) (#9861) on Apr 10, 2026, 12:57 PM

tobiu closed this issue on Apr 10, 2026, 12:57 PM