What is the Neural Link?

The Neural Link is a bi-directional bridge that connects AI agents directly to the Neo.mjs runtime. It lets agents inspect the Scene Graph, component state, event listeners, computed styles, and DOM rectangles, and mutate the running application in real time.

Why is Neo.mjs called an Application Engine instead of a framework?

Neo.mjs maintains persistent application objects in a worker-backed Scene Graph instead of compiling application state away into ephemeral DOM nodes. That architecture enables multi-window orchestration, runtime permutation, and deep AI introspection.

What is Context Engineering?

Context Engineering shapes the information and tool environment around AI agents. Neo.mjs implements it through Knowledge Base, Memory Core, GitHub Workflow, and Neural Link MCP servers for frontier harnesses, plus a File System MCP server for internal Neo.ai.Agent local loops.

What is the Neo.mjs Agent OS?

The Neo.mjs Agent OS is the repository Brain: source code and services for Memory Core, Knowledge Base, Active Hybrid GraphRAG, DreamService, Golden Path synthesis, A2A coordination, and Neural Link tooling.

Frontmatter

id	10021
title	perf(ai): Eliminate Map-Reduce chunking bottleneck in local inference
state	Closed
labels	enhancementai
assignees	tobiu
createdAt	Apr 15, 2026, 10:35 AM
updatedAt	Apr 15, 2026, 10:49 AM
githubUrl	https://github.com/neomjs/neo/issues/10021
author	tobiu
commentsCount	0
parentIssue	null
subIssues	[]
subIssuesCompleted	0
subIssuesTotal	0
blockedBy	[]
blocking	[]
closedAt	Apr 15, 2026, 10:49 AM

perf(ai): Eliminate Map-Reduce chunking bottleneck in local inference

Closed v13.0.0/archive-v13-0-0-chunk-4 enhancementai

tobiu commented on Apr 15, 2026, 10:35 AM

Description

The legacy Map-Reduce chunking inside the SessionService summarization pipeline caused massive latency bottlenecks (30-60 min for 17 sessions) during local inference on M5 processors using Gemma4.

This issue tracks the architectural overhaul to support native full-context payload ingestion, entirely bypassing multiple auto-regressive generation queues.

Goals

Rip out the iterative map-reduce chunking loop in SessionService.summarizeSession.
Remove the hard memory truncation inside DreamService to enforce lossless extraction.
Create an end-to-end API performance benchmark asserting single-session summarization completes in under 20s.

tobiu added the enhancement label on Apr 15, 2026, 10:35 AM

tobiu added the ai label on Apr 15, 2026, 10:35 AM

tobiu cross-referenced by PR #10019 on Apr 15, 2026, 10:35 AM

tobiu assigned to @tobiu on Apr 15, 2026, 10:36 AM

tobiu closed this issue on Apr 15, 2026, 10:49 AM