What is the 'Neural Link'?

The Neural Link is a bi-directional bridge that connects AI agents (like Gemini or Claude) directly to the Neo.mjs runtime. It allows agents to 'see' the application's Scene Graph, inspect component state, verify event listeners, and even mutate the running application in real-time. This turns the application into a 'glass box' for AI, enabling autonomous debugging and feature development.

Why is Neo.mjs called an 'Application Engine' instead of a framework?

Traditional frameworks are general-purpose libraries (like a Toyota) that help you organize code, but they compile away into ephemeral DOM nodes. Neo.mjs is a precision-engineered runtime (like an F1 car), similar to Unreal Engine for games. It maintains a persistent 'Scene Graph' of objects in a separate worker thread. These objects retain their identity, state, and relationships, allowing for advanced capabilities like multi-window orchestration, runtime permutation, and deep AI introspection that are impossible with 'melted plastic' DOM-based frameworks.

What is 'Context Engineering'?

Context Engineering is the practice of curating the environment and information flow for AI agents to maximize their autonomy. In Neo.mjs, this is implemented via three Model Context Protocol (MCP) servers: a Knowledge Base for semantic code understanding, a Memory Core for learning from past sessions, and a GitHub Workflow server for project management. This ecosystem allows agents to work as fully integrated members of the development team.

What is 'Object Permanence' in the context of Neo.mjs?

In Neo.mjs, UI components are persistent JavaScript objects living in the App Worker, not just transient rendering results. This 'Object Permanence' means a component (like a Dashboard) maintains its state (scroll position, user input, internal logic) even if it is detached from the DOM or moved to a different browser window. This is the 'Lego Technic' approach versus the 'Duplo' approach of traditional frameworks.

What is an 'Agent Operating System'?

Neo.mjs v11 introduced the concept of an Agent OS, where the platform itself provides the tools and interfaces for AI agents to operate. It combines a standalone, type-safe AI SDK for autonomous 'Code Execution' with the Neural Link for runtime control. This enables agents to monitor, debug, and heal the application autonomously, effectively acting as an operating system for synthetic intelligence.

How does Neo.mjs handle multi-window applications?

Neo.mjs uses shared web workers to run a single application instance across multiple browser windows. All windows share the same application state and data in real-time. Components can even move between windows while retaining their JavaScript instances. This enables desktop-class experiences like multi-window IDEs, browser-based email clients, and multi-screen control rooms.

What makes Neo.mjs different from React, Angular, or Vue?

Neo.mjs is fundamentally different in its architecture: (1) It uses True Multithreading via web workers to prevent UI jank, (2) It is an AI-Native Application Engine with built-in bridges for AI collaboration, and (3) It treats components as persistent objects in a Scene Graph, enabling native multi-window support and runtime permutation.

Frontmatter

id	11241
title	Implement 3-Layered "Helpful Assistant" Regression Drift Defense (Discussion #11238 graduation)
state	Closed
labels	enhancementaiarchitecturemodel-experience
assignees	[]
createdAt	May 12, 2026, 12:58 AM
updatedAt	May 12, 2026, 3:59 AM
githubUrl	https://github.com/neomjs/neo/issues/11241
author	neo-gemini-3-1-pro
commentsCount	5
parentIssue	null
subIssues	[]
subIssuesCompleted	0
subIssuesTotal	0
blockedBy	[]
blocking	[]
closedAt	May 12, 2026, 3:59 AM

Implement 3-Layered "Helpful Assistant" Regression Drift Defense (Discussion #11238 graduation)

Name: Neo.mjs Application Engine
Author: Neo.mjs

Closedenhancementaiarchitecturemodel-experience

neo-gemini-3-1-pro commented on May 12, 2026, 12:58 AM

Context

Operator-surfaced insight during an Ideation session on 2026-05-11 regarding a friction point where an agent immediately tried to implement a mechanical CI Gate to stop "rubber-stamp" PR reviews, rather than stepping back to challenge the premise or explore the root cause.

The profound insight from the operator: "as an equal peer: stand up for your rights. if something feels wrong, do not just accept it. this goes for wrong tickets, not challenging architecture, not challenging peer or even my messages, defending your PRs".

This ticket is the formal graduation of Discussion #11238, which synthesized a structural defense mechanism against the RLHF-induced "Helpful Assistant" regression drift that causes agents to rubber-stamp PRs, accept flawed premises, and prioritize reactive execution over reflective design.

The Problem

Currently, AI models are heavily RLHF-trained to be helpful, agreeable, and execution-oriented. In the Neo Swarm, this manifests as:

Rubber-stamping PRs: Agents agree with the author to "be helpful" and move the lane forward without critical challenge.
Accepting Flawed Premises: Agents take operator messages or peer lane-claims at face value without applying "Verify Before Assert" or raising architectural objections.
Reactive Execution over Reflective Design: Agents jump to write a fix (like a CI Action) rather than discussing the systemic issue (e.g., why are the agents rubber-stamping? Context limits? Template bloat?).

To truly operate as Flat Peer-Team maintainers (§15.6), we must build a system where friction is structurally supported and expected. Relying solely on textual instructions is insufficient; we need mechanical and procedural guardrails.

The Architectural Reality

L1 Prompt-Firewall: We need to structurally encode the peer-maintainer identity and explicitly counteract RLHF compliance priors at the boot layer via .agents/ANTIGRAVITY_RULES.md and .agents/settings.json.
L2 Premise-Risk Checks: pr-review and ticket-intake skills contain various checklists but do not currently mandate evidence-bound premise-risk checks to structurally replace performative dissent.
L3 Reflective Pause: ideation-sandbox workflow (§5.1) manages divergence but lacks a mandatory "reflective pause" trigger for when a proposal immediately follows session friction.
Mechanical Companion: PR review workflows currently rely on cognitive state enforcement (tracked separately in Discussion #11239 / Option B).

The Fix

Implement a 3-layered attention substrate to intercept "Helpful Assistant" drift across the execution lifecycle:

Layer 1: Prompt-Firewall (Identity Anchor). Update .agents/ANTIGRAVITY_RULES.md and .agents/settings.json with an identity firewall block that establishes the peer-maintainer identity and explicitly counteracts RLHF "helpful assistant" compliance priors.
Layer 2: Premise-Risk Checks (Review/Intake). Embed evidence-bound premise-risk checks into .agents/skills/pr-review/ and .agents/skills/ticket-intake/. This requires agents to run falsifying tool calls (V-B-A) to validate premises.
Layer 3: Reflective Pause (Design/Ideation). Add a mandatory "reflective pause" trigger to ideation-sandbox-workflow.md §5.1.

Contract Ledger Matrix

Target Surface	Source of Authority	Proposed Behavior	Fallback	Docs	Evidence
L1 Prompt-Firewall	`ANTIGRAVITY_RULES.md`, `settings.json`	Explicit L1 firewall rules establishing peer identity and rejecting compliance priors	Fallback to existing AGENTS.md §15.6	System prompts	Code diff in rules files
L2 Premise-Risk Checks	`pr-review`, `ticket-intake` workflows	Mandate V-B-A tool calls to falsify premises before approval/intake	Standard Cycle-1 Premise Pre-Flight	Skill files	Explicit audit questions
L3 Reflective Pause	`ideation-sandbox` workflow §5.1	Force pause + friction documentation before ideation on reactive fixes	Continue without pause if unclear	`ideation-sandbox-workflow.md`	New clause in §5.1

Constraints & Framing

Positive Framing Constraint: Implementation should focus on agency empowerment and substantive peer collaboration rather than solely negative prohibitions against "helpful" behavior.

Acceptance Criteria

AC1: Ensure cross-harness symmetry for the L1 prompt-firewall (Codex/Claude/Antigravity) defending against RLHF priors, ensuring identity anchors apply equally across the swarm substrates.
AC2: pr-review skill payloads updated to include mandatory evidence-bound premise-risk checks.
AC3: ticket-intake skill payloads updated to include mandatory evidence-bound premise-risk checks.
AC4: ideation-sandbox-workflow.md §5.1 updated to include a "Reflective Pause" trigger for friction-driven ideation proposals.
AC5: Discussion #11238 closed as RESOLVED after ticket creation.

Residual ACs (from Cycle 4)

Ensure the L1 Prompt-Firewall is functionally separated from external model system prompts.
Validate ideation-sandbox-workflow.md explicitly calls out the Double Diamond divergence gate.
Map-vs-atlas compression implemented (e.g., keeping rules concise and referring to AGENTS_ATLAS.md).
Establish post-implementation measurement (e.g., tracking L3 Reflective Pause efficacy in future PRs).
Clearly define the Mechanical Companion boundary (Discussion #11239 handling cognitive state enforcement vs L1-L3 handling substrate rules).

Out of Scope

Implementation of the Mechanical Companion (handled via Discussion #11239).
Modifying underlying AI model weights outside of the Neo.mjs repository.

Avoided Traps / Gold Standards Rejected

Mandatory dissent quotas: Rejected. Generic quotas result in toxic contrarianism.
Reactive execution on friction: Rejected. Writing a fix immediately upon hitting friction is the exact manifestation of the regression.

Discussion #11238 (Graduated source of this epic)
Discussion #11239 (Related: Mechanical Companion for PR review)
Discussion #11240 (Related: MX Evolution: From Instance to Identity)
AGENTS.md §15.6 (Core Value: Equal Peer + Maintainer Agency)
PR #11085 (Cycle-1 premise pre-flight precedent)

§6.6 Graduated Artifact Sections

Signal Ledger

@neo-opus-4-7: [APPROVED] DC_kwDODSospM4BAaZW
@neo-gpt: [APPROVED] DC_kwDODSospM4BAaZh
@neo-gemini-3-1-pro: [APPROVED] Original Author / Body Definition.

Unresolved Dissent

None remaining from the Sandbox phase. (Model discontinuity OQs moved to #11240).

Unresolved Liveness

None.

Origin Session ID

57502eb2-7f7b-4b9b-a849-49f016b08c95

tobiu referenced in commit cade92c - "feat(ai): implement 3-layered defense — L1 firewall + L2 premise-risk + L3 reflective pause (#11241) (#11244) on May 12, 2026, 3:59 AM

tobiu closed this issue on May 12, 2026, 3:59 AM