Context
As part of the Shared Deployment MVP completeness gaps (post-#10691), we must ensure our abstraction layers work with real-world local models, specifically the Qwen3-1.8b embedding model.
The Problem
The current OpenAiCompatible abstraction exists but the specific path for the Qwen3-1.8b embedding model (at 4k dimensions) is unvalidated. We cannot confidently deploy the MVP for teams using this local model until it is empirically validated and properly documented.
The Architectural Reality
OpenAiCompatible.generate provides the abstraction.
- The
VectorService handles embeddings.
- Operator config paths need to clearly indicate how to swap the embedding provider.
MemoryCore.md §Healthcheck Response Shape defines per-server topology diagnostics.
The Fix
Empirically validate the OpenAiCompatible abstraction against the Qwen3-1.8b embedding model (4k dimensions). Document the operator config path. Surface healthcheck verification of the active embedding provider. Flag any interface gaps the abstraction reveals.
Contract Ledger Matrix
| Target Surface |
Source of Authority |
Proposed Behavior |
Fallback |
Docs |
Evidence |
OpenAiCompatible Embeddings |
VectorService / Config |
Successfully route and generate 4k-dimensional embeddings via Qwen3-1.8b. |
Surface explicit validation errors if dimensions mismatch or endpoint fails. |
Update learn/agentos/SharedDeployment.md |
L3 (Integration test / local validation with active provider in healthcheck) |
| Healthcheck Endpoint |
Memory Core MCP |
Include active embedding provider in healthcheck output. |
N/A |
Update MemoryCore.md |
L3 (Healthcheck response validation) |
Acceptance Criteria
Out of Scope
- Adding support for entirely new protocol formats outside OpenAI-compatible endpoints.
- End-to-end performance benchmarking under team load.
Avoided Traps
- Rejected: Creating a bespoke Qwen provider class. The
OpenAiCompatible abstraction is designed precisely to prevent provider proliferation. We must validate and harden the generic abstraction rather than writing custom wrappers.
Related
Origin Session ID: 79042442-bebc-431d-8968-8a2e7d7a1151
Retrieval Hint: query_raw_memories(query="Local embedding provider validation Qwen3 4k OpenAiCompatible")
Context
As part of the Shared Deployment MVP completeness gaps (post-#10691), we must ensure our abstraction layers work with real-world local models, specifically the Qwen3-1.8b embedding model.
The Problem
The current
OpenAiCompatibleabstraction exists but the specific path for the Qwen3-1.8b embedding model (at 4k dimensions) is unvalidated. We cannot confidently deploy the MVP for teams using this local model until it is empirically validated and properly documented.The Architectural Reality
OpenAiCompatible.generateprovides the abstraction.VectorServicehandles embeddings.MemoryCore.md§Healthcheck Response Shape defines per-server topology diagnostics.The Fix
Empirically validate the
OpenAiCompatibleabstraction against the Qwen3-1.8b embedding model (4k dimensions). Document the operator config path. Surface healthcheck verification of the active embedding provider. Flag any interface gaps the abstraction reveals.Contract Ledger Matrix
OpenAiCompatibleEmbeddingslearn/agentos/SharedDeployment.mdMemoryCore.mdAcceptance Criteria
OpenAiCompatible.SharedDeployment.md.Out of Scope
Avoided Traps
OpenAiCompatibleabstraction is designed precisely to prevent provider proliferation. We must validate and harden the generic abstraction rather than writing custom wrappers.Related
Origin Session ID: 79042442-bebc-431d-8968-8a2e7d7a1151 Retrieval Hint:
query_raw_memories(query="Local embedding provider validation Qwen3 4k OpenAiCompatible")