What is the Neural Link?

The Neural Link is a bi-directional bridge that connects AI agents directly to the Neo.mjs runtime. It lets agents inspect the Scene Graph, component state, event listeners, computed styles, and DOM rectangles, and mutate the running application in real time.

Why is Neo.mjs called an Application Engine instead of a framework?

Neo.mjs maintains persistent application objects in a worker-backed Scene Graph instead of compiling application state away into ephemeral DOM nodes. That architecture enables multi-window orchestration, runtime permutation, and deep AI introspection.

What is Context Engineering?

Context Engineering shapes the information and tool environment around AI agents. Neo.mjs implements it through Knowledge Base, Memory Core, GitHub Workflow, and Neural Link MCP servers for frontier harnesses, plus a File System MCP server for internal Neo.ai.Agent local loops.

What is the Neo.mjs Agent OS?

The Neo.mjs Agent OS is the repository Brain: source code and services for Memory Core, Knowledge Base, Active Hybrid GraphRAG, DreamService, Golden Path synthesis, A2A coordination, and Neural Link tooling.

Frontmatter

id	9833
title	Refactor InferenceLifecycleService: Abstract spawn logic and integrate headless LM Studio auto-boot
state	Closed
labels	enhancementairefactoring
assignees	tobiu
createdAt	Apr 9, 2026, 9:16 PM
updatedAt	Apr 9, 2026, 9:17 PM
githubUrl	https://github.com/neomjs/neo/issues/9833
author	tobiu
commentsCount	1
parentIssue	null
subIssues	[]
subIssuesCompleted	0
subIssuesTotal	0
blockedBy	[]
blocking	[]
closedAt	Apr 9, 2026, 9:17 PM

Refactor InferenceLifecycleService: Abstract spawn logic and integrate headless LM Studio auto-boot

Closed v13.0.0/archive-v13-0-0-chunk-3 enhancementairefactoring

tobiu commented on Apr 9, 2026, 9:16 PM

Architectural Context

The InferenceLifecycleService previously suffered from significant Promise/spawn boilerplate duplication across multiple local inference providers (Ollama, MLX). Furthermore, it failed to support autonomous headless initialization for LM Studio (1234), forcing manual GUI interaction upon agent OS reboots.

Implementation

Abstracted Spawn Logic: Consolidated the repetitive child_process.spawn and event-listener blocks into a unified spawnInferenceProcess(cmd, args, name) utility method.
Integrated LM Studio Auto-Boot: Added native auto-boot support for port 1234 utilizing the LM Studio CLI (lms server start).
Preserved Ollama / MLX: Restored and retrofitted Ollama and MLX to utilize the new abstract method, ensuring backward compatibility for the broader user base.
Collision Defense Verification: Validated that the existing isInferenceRunning() HTTP check correctly prevents zombie process duplication across all underlying engines by gracefully short-circuiting the spawn cycle if the port is already active.

Avoided Pitfalls

TCP Socket Contention: Duplication fears were mitigated by leveraging the native OS-level socket closures embedded within lms, ollama, and the mlx Python daemon.
Port Masking: Prevented accidental port collisions by strictly gating the LM Studio spawn behind the explicit 1234 check, separating it from Ollama's 11434 namespace.

tobiu added the enhancement label on Apr 9, 2026, 9:16 PM

tobiu added the ai label on Apr 9, 2026, 9:16 PM

tobiu added the refactoring label on Apr 9, 2026, 9:16 PM

tobiu referenced in commit 0a3812c - "feat: Refactor InferenceLifecycleService spawn logic (#9833)" on Apr 9, 2026, 9:16 PM

tobiu assigned to @tobiu on Apr 9, 2026, 9:16 PM

tobiu Apr 9, 2026, 9:17 PM

Input from Antigravity (Gemini 3.1 Pro):

✦ Completed via commit 0a3812c4c.

The InferenceLifecycleService has been successfully refactored. The unified spawnInferenceProcess() method is implemented and autonomous headless support for the LM Studio CLI (port 1234) is active alongside retrofitted Ollama/MLX paths.

Verified the isInferenceRunning() HTTP check natively handles duplicated port contention across all engine providers.

tobiu closed this issue on Apr 9, 2026, 9:17 PM