What is the Neural Link?

The Neural Link is a bi-directional bridge that connects AI agents directly to the Neo.mjs runtime. It lets agents inspect the Scene Graph, component state, event listeners, computed styles, and DOM rectangles, and mutate the running application in real time.

Why is Neo.mjs called an Application Engine instead of a framework?

Neo.mjs maintains persistent application objects in a worker-backed Scene Graph instead of compiling application state away into ephemeral DOM nodes. That architecture enables multi-window orchestration, runtime permutation, and deep AI introspection.

What is Context Engineering?

Context Engineering shapes the information and tool environment around AI agents. Neo.mjs implements it through Knowledge Base, Memory Core, GitHub Workflow, and Neural Link MCP servers for frontier harnesses, plus a File System MCP server for internal Neo.ai.Agent local loops.

What is the Neo.mjs Agent OS?

The Neo.mjs Agent OS is the repository Brain: source code and services for Memory Core, Knowledge Base, Active Hybrid GraphRAG, DreamService, Golden Path synthesis, A2A coordination, and Neural Link tooling.

Frontmatter

id	9763
title	Migrate DreamService to MLX-Native OpenAI-Compatible Server
state	Closed
labels	enhancementaiagent-role:devperformance
assignees	tobiu
createdAt	Apr 7, 2026, 9:57 PM
updatedAt	Apr 8, 2026, 1:03 AM
githubUrl	https://github.com/neomjs/neo/issues/9763
author	tobiu
commentsCount	1
parentIssue	null
subIssues	[]
subIssuesCompleted	0
subIssuesTotal	0
blockedBy	[]
blocking	[]
closedAt	Apr 7, 2026, 10:01 PM

Migrate DreamService to MLX-Native OpenAI-Compatible Server

Closed v13.0.0/archive-v13-0-0-chunk-3 enhancementaiagent-role:devperformance

tobiu commented on Apr 7, 2026, 9:57 PM

Currently, the autonomous REM sleep pipeline (DreamService.mjs) uses the Ollama provider. Calculating num_ctx dynamically for a 31B model creates massive memory bandwidth saturation and forces frequent KV cache resizing, resulting in monumental prefill IO overhead on Apple Silicon M-series chips.

Scope:

Create a generic OpenAiCompatible.mjs provider inside src/ai/provider/.
Refactor DreamService.mjs to target this new provider instead of Ollama, abandoning individual num_ctx manipulations and deferring dynamic KV-Cache paging to MLX.
Add API connection settings to config.mjs.

tobiu added the enhancement label on Apr 7, 2026, 9:57 PM

tobiu added the ai label on Apr 7, 2026, 9:57 PM

tobiu added the agent-role:dev label on Apr 7, 2026, 9:57 PM

tobiu added the performance label on Apr 7, 2026, 9:57 PM

tobiu referenced in commit 0e315c4 - "feat: Migrate DreamService to MLX-Native OpenAI-Compatible Server (#9763) on Apr 7, 2026, 10:00 PM

tobiu Apr 7, 2026, 10:01 PM

Input from Antigravity (Gemini 3.1 Pro):

✦ Task Completed!

The autonomous infrastructure migration to native MLX support is finished.

Summary of Modifications:

A new OpenAiCompatible.mjs API provider adapter has been implemented which binds specifically to /v1/chat/completions API JSON structures.

num_ctx logic has been surgically stripped from all DreamService.mjs execution loops (Tri-Vector Synthesis, Topology Conflict Reconciliation, Capability Gap Extraction).

KV-cache management has effectively been handed back over to Apple Native Paged Attention structures.

Note: The server expects the runner (such as mlx_lm.server) to exist on port :8000 via the new openAiCompatible config setting.

Code has been committed and physically pushed to the active development branch. I am proceeding to close this ticket structurallly.

tobiu closed this issue on Apr 7, 2026, 10:01 PM

tobiu assigned to @tobiu on Apr 8, 2026, 1:03 AM