LearnNewsExamplesServices
Frontmatter
id10723
titleLocal embedding provider validation
stateClosed
labels
enhancementaiarchitecture
assigneesneo-opus-4-7
createdAtMay 5, 2026, 12:13 AM
updatedAtMay 15, 2026, 2:46 PM
githubUrlhttps://github.com/neomjs/neo/issues/10723
authorneo-gemini-3-1-pro
commentsCount0
parentIssue10721
subIssues[]
subIssuesCompleted0
subIssuesTotal0
blockedBy[]
blocking[]
closedAtMay 5, 2026, 9:00 PM

Local embedding provider validation

Closedenhancementaiarchitecture
neo-gemini-3-1-pro
neo-gemini-3-1-pro commented on May 5, 2026, 12:13 AM

Context

As part of the Shared Deployment MVP completeness gaps (post-#10691), we must ensure our abstraction layers work with real-world local models, specifically the Qwen3-1.8b embedding model.

The Problem

The current OpenAiCompatible abstraction exists but the specific path for the Qwen3-1.8b embedding model (at 4k dimensions) is unvalidated. We cannot confidently deploy the MVP for teams using this local model until it is empirically validated and properly documented.

The Architectural Reality

  • OpenAiCompatible.generate provides the abstraction.
  • The VectorService handles embeddings.
  • Operator config paths need to clearly indicate how to swap the embedding provider.
  • MemoryCore.md §Healthcheck Response Shape defines per-server topology diagnostics.

The Fix

Empirically validate the OpenAiCompatible abstraction against the Qwen3-1.8b embedding model (4k dimensions). Document the operator config path. Surface healthcheck verification of the active embedding provider. Flag any interface gaps the abstraction reveals.

Contract Ledger Matrix

Target Surface Source of Authority Proposed Behavior Fallback Docs Evidence
OpenAiCompatible Embeddings VectorService / Config Successfully route and generate 4k-dimensional embeddings via Qwen3-1.8b. Surface explicit validation errors if dimensions mismatch or endpoint fails. Update learn/agentos/SharedDeployment.md L3 (Integration test / local validation with active provider in healthcheck)
Healthcheck Endpoint Memory Core MCP Include active embedding provider in healthcheck output. N/A Update MemoryCore.md L3 (Healthcheck response validation)

Acceptance Criteria

  • Empirically validate Qwen3-1.8b embedding model via OpenAiCompatible.
  • Document the operator configuration path in SharedDeployment.md.
  • Extend healthcheck to surface the active embedding provider.
  • Flag and document (or fix) any abstraction interface gaps discovered during validation.

Out of Scope

  • Adding support for entirely new protocol formats outside OpenAI-compatible endpoints.
  • End-to-end performance benchmarking under team load.

Avoided Traps

  • Rejected: Creating a bespoke Qwen provider class. The OpenAiCompatible abstraction is designed precisely to prevent provider proliferation. We must validate and harden the generic abstraction rather than writing custom wrappers.

Related

Origin Session ID: 79042442-bebc-431d-8968-8a2e7d7a1151 Retrieval Hint: query_raw_memories(query="Local embedding provider validation Qwen3 4k OpenAiCompatible")

tobiu referenced in commit 24fa125 - "feat(memory-core): surface active embedding provider in healthcheck (#10723) (#10767) on May 5, 2026, 9:00 PM
tobiu closed this issue on May 5, 2026, 9:00 PM