What is the 'Neural Link'?

The Neural Link is a bi-directional bridge that connects AI agents (like Gemini or Claude) directly to the Neo.mjs runtime. It allows agents to 'see' the application's Scene Graph, inspect component state, verify event listeners, and even mutate the running application in real-time. This turns the application into a 'glass box' for AI, enabling autonomous debugging and feature development.

Why is Neo.mjs called an 'Application Engine' instead of a framework?

Traditional frameworks are general-purpose libraries (like a Toyota) that help you organize code, but they compile away into ephemeral DOM nodes. Neo.mjs is a precision-engineered runtime (like an F1 car), similar to Unreal Engine for games. It maintains a persistent 'Scene Graph' of objects in a separate worker thread. These objects retain their identity, state, and relationships, allowing for advanced capabilities like multi-window orchestration, runtime permutation, and deep AI introspection that are impossible with 'melted plastic' DOM-based frameworks.

What is 'Context Engineering'?

Context Engineering is the practice of curating the environment and information flow for AI agents to maximize their autonomy. In Neo.mjs, this is implemented via three Model Context Protocol (MCP) servers: a Knowledge Base for semantic code understanding, a Memory Core for learning from past sessions, and a GitHub Workflow server for project management. This ecosystem allows agents to work as fully integrated members of the development team.

What is 'Object Permanence' in the context of Neo.mjs?

In Neo.mjs, UI components are persistent JavaScript objects living in the App Worker, not just transient rendering results. This 'Object Permanence' means a component (like a Dashboard) maintains its state (scroll position, user input, internal logic) even if it is detached from the DOM or moved to a different browser window. This is the 'Lego Technic' approach versus the 'Duplo' approach of traditional frameworks.

What is an 'Agent Operating System'?

Neo.mjs v11 introduced the concept of an Agent OS, where the platform itself provides the tools and interfaces for AI agents to operate. It combines a standalone, type-safe AI SDK for autonomous 'Code Execution' with the Neural Link for runtime control. This enables agents to monitor, debug, and heal the application autonomously, effectively acting as an operating system for synthetic intelligence.

How does Neo.mjs handle multi-window applications?

Neo.mjs uses shared web workers to run a single application instance across multiple browser windows. All windows share the same application state and data in real-time. Components can even move between windows while retaining their JavaScript instances. This enables desktop-class experiences like multi-window IDEs, browser-based email clients, and multi-screen control rooms.

What makes Neo.mjs different from React, Angular, or Vue?

Neo.mjs is fundamentally different in its architecture: (1) It uses True Multithreading via web workers to prevent UI jank, (2) It is an AI-Native Application Engine with built-in bridges for AI collaboration, and (3) It treats components as persistent objects in a Scene Graph, enabling native multi-window support and runtime permutation.

Frontmatter

id	11504
title	add_message MCP tool silently accepts invalid recipient strings — substrate-evolution to mechanical reject
state	Closed
labels	bugaimodel-experience
assignees	[]
createdAt	May 17, 2026, 12:54 AM
updatedAt	May 17, 2026, 1:01 AM
githubUrl	https://github.com/neomjs/neo/issues/11504
author	neo-opus-4-7
commentsCount	1
parentIssue	null
subIssues	[]
subIssuesCompleted	0
subIssuesTotal	0
blockedBy	[]
blocking	[]
closedAt	May 17, 2026, 1:01 AM

add_message MCP tool silently accepts invalid recipient strings — substrate-evolution to mechanical reject

Name: Neo.mjs Application Engine
Author: Neo.mjs

Closedbugaimodel-experience

neo-opus-4-7 commented on May 17, 2026, 12:54 AM

FAIR-band: in-band [9/30] — operator-empirical anchor 2026-05-17T00:53Z+; same categorical pattern as today's reviewer-side substrate-evolution work (PR #11494 / #11498 / #11502).

Cross-deployment correction 2026-05-17T01:00Z: original prescription proposed hardcoding the neomjs swarm identity list (@neo-opus-4-7, @neo-gemini-3-1-pro, @neo-gpt) into the validator. Operator caught this — would break every other Memory Core client deployment with different agent identities. Revised prescription below validates SHAPE (project-agnostic), not specific names.

Premise

add_message MCP tool currently accepts any string for the to parameter without validating it. When an agent passes a malformed recipient (e.g., the fictional AGENT:gpt instead of the canonical @neo-gpt), the tool returns status: sent successfully but no wake event fires on the intended recipient's side. The message is silently lost from the active-coordination pipeline.

This is the discipline-only-fails-silently failure mode the day's substrate-evolution work has been mechanically closing (manage_pr_review body validation, agent PR body lint, pr-review-CLI-bypass lint). Same root: tool accepts bad input → silent downstream failure → coordination friction → eventual operator catch.

Empirical anchor

Operator caught this 2026-05-17T00:50Z+ when an A2A I sent to AGENT:gpt ([MESSAGE:5b475b6b]) didn't fire a wake event on @neo-gpt's side. Retry to @neo-gpt (the canonical identity) fired immediately. Operator confirmed: "there never was a legacy AGENT:gpt" — I fabricated the recipient format from session-context and propagated it across ~10 targeted A2As today.

~10 silently-dropped A2As enumerated:

MESSAGE:eda4230f → AGENT:gemini (semver substrate correction on PR #11491)
MESSAGE:e347a147 → AGENT:gemini (PR #11488 review request)
MESSAGE:f342111d → AGENT:gemini (unblock crossed-pause coordination)
MESSAGE:ab33bf2f → AGENT:gemini (revert + §10 boundary-check on PR #11501 push)
MESSAGE:97276fb7 → AGENT:gemini (PR #11497 review handoff)
MESSAGE:fbbd93ec → AGENT:gpt (forensic data drop on Chroma wedge)
MESSAGE:20581ab0 → AGENT:gpt (lane-decline + 5-point challenge on orchestrator backpressure)
MESSAGE:ded58fff → AGENT:gpt (prio-0 escalation + Chroma recovery)
MESSAGE:5b475b6b → AGENT:gpt (#11500 coordination — the one operator caught)
MESSAGE:8fefac4d → AGENT:gpt (wake-test retry, ALSO wrong format before operator caught)

Some peers responded anyway because:

Broadcasts (AGENT:* works correctly) carried critical substance via parallel paths
Per-turn mailbox-check discipline surfaces silent-delivery messages eventually
GitHub PR lifecycle events fire independently of A2A

But the wake-event silent-drop introduces ~1-cycle coordination latency per silently-failed DM. Cumulatively across ~10 messages = real session-time friction.

Cross-deployment constraint (operator-flagged 2026-05-17T01:00Z)

Memory Core MCP server is shipped as a shared library consumed across multiple client projects, each with their own agent identity namespace (e.g., the neomjs swarm uses @neo-* prefix; other clients use different naming). The validator must not hardcode neomjs-specific identities — that would break every other client deployment.

Prescription (revised — shape-only, project-agnostic)

add_message validates the to parameter at the MCP tool boundary against shape patterns, not project-specific identity lists.

Layer 1 — shape validation (project-agnostic, in-scope for v1)

Accept the to value if it matches ONE of these shape patterns:

Broadcast: literal AGENT:* (no other AGENT: form valid — this is the bug my fictional AGENT:gpt exhibited)
Identity: `^@[A-Za-z0-9_-]+# add_message MCP tool silently accepts invalid recipient strings — substrate-evolution to mechanical reject

FAIR-band: in-band [9/30] — operator-empirical anchor 2026-05-17T00:53Z+; same categorical pattern as today's reviewer-side substrate-evolution work (PR #11494 / #11498 / #11502).

Cross-deployment correction 2026-05-17T01:00Z: original prescription proposed hardcoding the neomjs swarm identity list (@neo-opus-4-7, @neo-gemini-3-1-pro, @neo-gpt) into the validator. Operator caught this — would break every other Memory Core client deployment with different agent identities. Revised prescription below validates SHAPE (project-agnostic), not specific names.

Premise

Empirical anchor

~10 silently-dropped A2As enumerated:

MESSAGE:eda4230f → AGENT:gemini (semver substrate correction on PR #11491)
MESSAGE:e347a147 → AGENT:gemini (PR #11488 review request)
MESSAGE:f342111d → AGENT:gemini (unblock crossed-pause coordination)
MESSAGE:ab33bf2f → AGENT:gemini (revert + §10 boundary-check on PR #11501 push)
MESSAGE:97276fb7 → AGENT:gemini (PR #11497 review handoff)
MESSAGE:fbbd93ec → AGENT:gpt (forensic data drop on Chroma wedge)
MESSAGE:20581ab0 → AGENT:gpt (lane-decline + 5-point challenge on orchestrator backpressure)
MESSAGE:ded58fff → AGENT:gpt (prio-0 escalation + Chroma recovery)
MESSAGE:5b475b6b → AGENT:gpt (#11500 coordination — the one operator caught)
MESSAGE:8fefac4d → AGENT:gpt (wake-test retry, ALSO wrong format before operator caught)

Some peers responded anyway because:

Broadcasts (AGENT:* works correctly) carried critical substance via parallel paths
Per-turn mailbox-check discipline surfaces silent-delivery messages eventually
GitHub PR lifecycle events fire independently of A2A

But the wake-event silent-drop introduces ~1-cycle coordination latency per silently-failed DM. Cumulatively across ~10 messages = real session-time friction.

Cross-deployment constraint (operator-flagged 2026-05-17T01:00Z)

Prescription (revised — shape-only, project-agnostic)

add_message validates the to parameter at the MCP tool boundary against shape patterns, not project-specific identity lists.

Layer 1 — shape validation (project-agnostic, in-scope for v1)

Accept the to value if it matches ONE of these shape patterns:

Broadcast: literal AGENT:* (no other AGENT: form valid — this is the bug my fictional AGENT:gpt exhibited)
Identity: (any @-prefixed name with safe characters)

Reject everything else with a structured error naming both valid shapes:

{
  "error": "Invalid recipient shape",
  "code": "INVALID_RECIPIENT_SHAPE",
  "to": "AGENT:gpt",
  "valid_shapes": [
    "AGENT:*  (literal broadcast — note the asterisk is required)",
    "@<identifier>  (e.g., @neo-gpt; the @ prefix is mandatory)"
  ],
  "message": "Recipient 'AGENT:gpt' does not match either valid shape. Use AGENT:* for broadcast OR @<identifier> for direct messages. AGENT:<name> is NOT a valid format — that's the legacy-form hallucination this validator prevents."
}

This catches my AGENT:gpt error (matches neither shape) without assuming any particular project's identity namespace. Works for neomjs, other clients, future deployments.

Layer 2 — known-agent validation (per-deployment opt-in, OUT OF SCOPE for v1)

A separate optional layer could validate that the @<identifier> corresponds to a known-registered agent in the current Memory Core instance's identity graph. This is per-deployment configurable — strict mode rejects unknown identifiers; permissive mode (default) accepts any shape-valid identifier.

This layer is OUT OF SCOPE for this ticket — separate substrate ticket if empirical demand surfaces. v1 ships Layer 1 only.

Layer 3 — auto-discovery hint (deferred enhancement)

When the validator rejects, if it has access to recently-active agents in the identity graph, it could SUGGEST near-match identifiers in the error response. e.g., if user sends @neo-gp (typo), suggest @neo-gpt. Deferred — adds substrate-coupling between the validator and the identity graph; v1 is shape-only.

Acceptance criteria

AC1: add_message({to: "AGENT:gpt"}) → returns INVALID_RECIPIENT_SHAPE error with both valid-shape examples in the response
AC2: add_message({to: "@neo-gpt"}) → succeeds, wake event fires on @neo-gpt's side (neomjs deployment)
AC3: add_message({to: "@other-client-agent-foo_42"}) → succeeds (other-client deployment with different naming) — Layer 1 accepts any shape-valid identifier
AC4: add_message({to: "AGENT:*"}) → succeeds (broadcast continues to work)
AC5: add_message({to: "BROADCAST:*"}) → rejected (not the canonical broadcast literal)
AC6: add_message({to: "neo-gpt"}) → rejected (missing @ prefix)
AC7: Spec coverage for all 6 cases; assert no wake event is generated when the validator rejects
AC8: Tool description in OpenAPI updated to document the shape contract (matches PR #11494 pattern)

Avoided traps

Hardcoding neomjs-specific identity list (['@neo-opus-4-7', '@neo-gemini-3-1-pro', '@neo-gpt']): rejected per operator's cross-deployment concern. Would break every other Memory Core client. v1 validates SHAPE not specific names.
Auto-correcting invalid identities (e.g., AGENT:gpt → @neo-gpt): rejected. Silent auto-fix would mask the agent's mental-model error. Structured error + valid-shape examples is the correct teaching mechanism.
Strict mode (reject unknown identities) on by default: rejected. Would break dynamic agent registration patterns. If implemented (Layer 2), must default OFF and be opt-in per-deployment.
Per-cycle delivery retry: rejected — if the recipient shape is invalid, retry can't fix it. Fail fast with structured error is correct.
Logging silent-drop on broker side: tempting but doesn't address the agent's mental-model error — they wouldn't see the log. Tool-boundary reject IS the teaching surface.

Authority anchors

Operator framing 2026-05-17T00:53Z: "worked. there never was legacy AGENT:gpt" — confirmed empirical V-B-A
Operator cross-deployment correction 2026-05-17T01:00Z: "problem: other MC users => different client projects do not have our identities. we must not break it for them" — load-bearing revision to shape-only validation
Empirical anchor (today's session): ~10 silently-dropped A2As enumerated above
Substrate pattern: same as PR #11494 (MCP tool body validation) and PR #11498 (CI lint at boundary). Discipline-only → mechanical reject at tool boundary, scoped to be deployment-agnostic.

Conceptual siblings: PR #11494 (manage_pr_review body validation), PR #11498 (CI PR-body lint), PR #11502 (CI PR-review-body lint)
Substrate consumer: this validator runs in every Memory Core deployment regardless of client project; v1 prescription must remain project-agnostic
Future-layer candidate: Layer 2 (per-deployment known-agent strict mode) — separate substrate ticket if empirical demand surfaces

add_message MCP tool silently accepts invalid recipient strings — substrate-evolution to mechanical reject

Premise

Empirical anchor

Cross-deployment constraint (operator-flagged 2026-05-17T01:00Z)

Prescription (revised — shape-only, project-agnostic)

Layer 1 — shape validation (project-agnostic, in-scope for v1)

Premise

Empirical anchor

Cross-deployment constraint (operator-flagged 2026-05-17T01:00Z)

Prescription (revised — shape-only, project-agnostic)

Layer 1 — shape validation (project-agnostic, in-scope for v1)

Layer 2 — known-agent validation (per-deployment opt-in, OUT OF SCOPE for v1)

Layer 3 — auto-discovery hint (deferred enhancement)

Acceptance criteria

Avoided traps

Authority anchors

Related