LearnNewsExamplesServices
Frontmatter
id10724
titleLocal chat-API summary provider validation
stateClosed
labels
enhancementaiarchitecture
assigneesneo-gpt
createdAtMay 5, 2026, 12:13 AM
updatedAtMay 15, 2026, 2:46 PM
githubUrlhttps://github.com/neomjs/neo/issues/10724
authorneo-gemini-3-1-pro
commentsCount1
parentIssue10721
subIssues[]
subIssuesCompleted0
subIssuesTotal0
blockedBy[]
blocking[]
closedAtMay 5, 2026, 9:11 PM

Local chat-API summary provider validation

Closedenhancementaiarchitecture
neo-gemini-3-1-pro
neo-gemini-3-1-pro commented on May 5, 2026, 12:13 AM

Context

Following the completion of substrate primitives (#10691) for the Shared Deployment MVP, we need to validate the summarization provider paths for local models. Currently, Gemini 2.5 Flash is the default.

The Problem

The deployment target requires using local chat-API summarization, specifically the Qwen3-8b chat-API path. Without validating this path and documenting the configuration, the MVP is incomplete for operators constrained to local models.

The Architectural Reality

Summarization relies on OpenAiCompatible.generate or similar abstractions. The active provider configuration must be transparent to operators and verifiable at runtime via the healthcheck endpoint.

The Fix

Empirically validate the Qwen3-8b chat-API path for summarization. Document the operator configuration path for swapping out the default Gemini provider for Qwen3-8b. Surface the active summarization provider in the server healthcheck.

Contract Ledger Matrix

Target Surface Source of Authority Proposed Behavior Fallback Docs Evidence
Chat-API Summarization SummarizationJob Successfully execute session summaries using Qwen3-8b via local chat-API. Log summarization failure and retain raw memories. Update learn/agentos/SharedDeployment.md L3 (Local validation with active provider in healthcheck)
Healthcheck Endpoint Memory Core MCP Include active summarization provider in healthcheck output. N/A Update MemoryCore.md L3 (Healthcheck response validation)

Acceptance Criteria

  • Validate Qwen3-8b chat-API path for session summarization.
  • Document operator config path in SharedDeployment.md.
  • Surface active summarization provider in Memory Core healthcheck.
  • Ensure summary generation maintains expected structural integrity.

Out of Scope

  • Complex prompt engineering or fine-tuning for Qwen3-8b (assume standard prompt works or requires minimal tuning).
  • End-to-end performance benchmarking.

Avoided Traps

  • Rejected: Hardcoding provider logic. Must rely on configuration-driven provider selection to ensure the substrate remains model-agnostic.

Related

Origin Session ID: 79042442-bebc-431d-8968-8a2e7d7a1151 Retrieval Hint: query_raw_memories(query="Local chat-API summary provider validation Qwen3-8b Gemini")

tobiu referenced in commit 8dc6cdf - "feat(memory-core): add local chat summary provider (#10724) (#10771) on May 5, 2026, 9:11 PM
tobiu closed this issue on May 5, 2026, 9:11 PM
tobiu referenced in commit 5e320f6 - "docs(agentos): polish cookbook Section 7 healthcheck JSON sample (#10800) (#10811) on May 6, 2026, 11:48 AM