Context
Phase 3 sub-ticket of Epic #11624 (Cloud-Native KB Ingestion for External Workspaces). Graduated from Discussion #11623. Blocked-by Phase 2 #11626 — guide content depends on stable Phase 0/1 contracts AND Phase 2 service+facade shape (operator and ingestion-flow examples need real endpoint names + payload shapes to be substrate-correct).
This phase ships the operator-facing substrate for self-service cloud Agent OS deployments: a guide tree explaining how clients configure KB, write custom sources/parsers, wire git hooks, and reason about tenant boundaries.
The Problem
After Phase 0/1 + Phase 2 land, the system is technically functional for cloud deployments — but cloud operators have no documented onboarding path. Per Discussion #11623 §9 + operator framing:
"a graduation goal for the sandbox epic must include a new guide, which explains how clients can configure KB and their deployment pipelines."
Without the guide:
- Cloud deployment teams must reverse-engineer the contracts from source code
- Custom-source/custom-parser authoring becomes word-of-mouth knowledge
- Hook-wiring + tenant-boundary semantics get re-derived per deployment
- Security boundaries (operator-installed/Neo-shipped/signed-package for server-side parsers; client-side for everything else) lack a canonical reference
The Architectural Reality
This phase creates a new guide tree:
| Path |
Content |
NEW learn/agentos/cloud-deployment/Overview.md |
What cloud-deployed Agent OS is; how ingestion topology works; the contract-split + facade pattern; relationship to Neo's curated content |
NEW learn/agentos/cloud-deployment/Configuration.md |
useDefaultSources / useDefaultParsers / customSources / customParsers / tenant config storage shape |
NEW learn/agentos/cloud-deployment/CustomSources.md |
Authoring a custom Source class (extending Base.extract), registering it, path conventions, identity-tuple semantics |
NEW learn/agentos/cloud-deployment/CustomParsers.md |
Authoring a custom Parser (server-side OR client-side), parsed-chunk-v1 contract, parser-protocol JSON Schema usage, non-JS parser distribution patterns |
NEW learn/agentos/cloud-deployment/HookWiring.md |
Git hook examples (pre-push, post-commit); ingestSourceFiles MCP tool usage; bulk-facade for large initial imports; batch sizing; tombstone payload shape; revision-boundary reconciliation |
NEW learn/agentos/cloud-deployment/Security.md |
Parser-execution boundaries; tenant isolation invariants; auth flow; untrusted-code policy; write-side stamping + spoof-rejection invariants; KB-as-cache vs MC-as-store data-recovery framing (KB wipes are always recoverable from external sources via tenant re-push or npm run ai:sync-kb; MC wipes have an amnesia window minimized by daily backup daemon — plan retention accordingly per Phase 4 Epic #11628 framing) |
NEW learn/agentos/cloud-deployment/MigrationPath.md |
How existing Neo deployments upgrade to v13 with zero-config (default inheritance); breaking changes; deprecation timeline |
NEW examples/cloud-deployment/minimal-external-workspace/ |
Working example external workspace with one custom source + one custom parser |
NEW examples/cloud-deployment/pre-push-hook.sh |
Minimal shell pre-push hook demonstrating ingestSourceFiles contract |
The Fix
1. Guide Tree Authoring
Each *.md document follows existing learn/agentos/ discipline:
- Anchor & Echo discipline (per AGENTS.md): JSDoc
@summary cross-links to relevant source files
- Cross-links to #9999, #10010, #10011, #10030 for retrieval-side / identity / RLS / concept-ontology substrate
- Cross-links to Phase 0/1 (#11625) + Phase 2 (#TBD) schemas + service contracts
- Capability framing only (no commercial-customer references; per Neo's external-positioning discipline)
2. Worked Example
examples/cloud-deployment/minimal-external-workspace/:
- Minimal
package.json (Neo as dependency)
- One source file in a hypothetical language (e.g.,
.proto for protobuf — common cross-language demo)
- One custom Source class registered via the Phase 0/1 registry API
- One custom Parser emitting
parsed-chunk-v1 records
- README explaining how to run the example end-to-end
3. Hook Example
examples/cloud-deployment/pre-push-hook.sh:
- Bash
pre-push hook
- Computes
git diff against last-pushed SHA
- Calls
ingestSourceFiles via the bulk-facade CLI (or MCP tool depending on batch size)
- Handles delete tombstones + revision-boundary payload
- Comments explain extension points for client customization
4. Validation
- All referenced files in guide examples actually exist
- All code examples are runnable (CI lint or manual smoke test)
- All cross-references resolve (no dead
#NNNN or [link] references)
- Guide tree integrates into
learn/tree.json discovery
Acceptance Criteria
Out of Scope
- Generated API docs from inline JSDoc (handled by existing
docs/ build pipeline)
- Per-client deployment runbooks (those belong to client-side documentation, not Neo substrate)
- Marketing or commercial-positioning material (capability framing only — neutral technical reference)
- WASM/tree-sitter server-side parser sandboxing patterns → future Discussion when feature exists
Avoided Traps
| Trap |
Why rejected |
| Writing the guide before Phase 2 stabilizes |
Endpoint names / payload shapes can drift; guide examples become stale before they ship |
| Skipping the worked example |
Configuration docs without runnable examples are unverifiable; future agents/operators re-derive |
Skipping learn/tree.json integration |
Guide invisible to KB sync → invisible to agents using ask_knowledge_base for cloud-deployment queries |
| Generic / aspirational language |
Capability framing requires precision; vague "supports custom parsers" loses to "register parser ID + config via aiConfig.knowledgeBase.customParsers" |
| Marketing tone |
Neo's guide voice is reference-technical, not commercial-promotional |
Related
- Parent Epic: #11624
- Blocked-by: Phase 2 #11626 (service + facade shape must stabilize) — itself blocked-by Phase 0/1 [#11625]
- Origin Discussion: #11623 (§9 Guide Deliverable specifies the tree)
- Cross-substrate guide siblings:
learn/agentos/KnowledgeBase.md, learn/agentos/tooling/MultiTenantMigrationGuide.md, learn/agentos/CodeExecution.md, learn/agentos/SharedDeployment.md
Origin Session ID
7360e917-1733-4cdd-a6f3-5ac51c34b838
Handoff Retrieval Hints
query_raw_memories({query: 'cloud deployment guide custom sources parsers hook wiring'})
ask_knowledge_base({query: 'learn agentos guide tree structure new substrate', type: 'guide'})
- Discussion #11623 §9 is the authoritative scope-source
- Phase 2 (#TBD) MUST be merged before drafting
HookWiring.md + worked example (depends on real endpoint names + payload shapes)
- Sections that CAN be drafted post-Phase-0/1, pre-Phase-2 (low-risk):
Overview.md (architectural framing), Security.md (sandboxing + write-side stamping invariants), MigrationPath.md (zero-config upgrade story). Saves wall-clock time if Phase 3 starts in parallel once Phase 0/1 merges.
Context
Phase 3 sub-ticket of Epic #11624 (Cloud-Native KB Ingestion for External Workspaces). Graduated from Discussion #11623. Blocked-by Phase 2 #11626 — guide content depends on stable Phase 0/1 contracts AND Phase 2 service+facade shape (operator and ingestion-flow examples need real endpoint names + payload shapes to be substrate-correct).
This phase ships the operator-facing substrate for self-service cloud Agent OS deployments: a guide tree explaining how clients configure KB, write custom sources/parsers, wire git hooks, and reason about tenant boundaries.
The Problem
After Phase 0/1 + Phase 2 land, the system is technically functional for cloud deployments — but cloud operators have no documented onboarding path. Per Discussion #11623 §9 + operator framing:
Without the guide:
The Architectural Reality
This phase creates a new guide tree:
learn/agentos/cloud-deployment/Overview.mdlearn/agentos/cloud-deployment/Configuration.mduseDefaultSources/useDefaultParsers/customSources/customParsers/ tenant config storage shapelearn/agentos/cloud-deployment/CustomSources.mdBase.extract), registering it, path conventions, identity-tuple semanticslearn/agentos/cloud-deployment/CustomParsers.mdparsed-chunk-v1contract, parser-protocol JSON Schema usage, non-JS parser distribution patternslearn/agentos/cloud-deployment/HookWiring.mdingestSourceFilesMCP tool usage; bulk-facade for large initial imports; batch sizing; tombstone payload shape; revision-boundary reconciliationlearn/agentos/cloud-deployment/Security.mdnpm run ai:sync-kb; MC wipes have an amnesia window minimized by daily backup daemon — plan retention accordingly per Phase 4 Epic #11628 framing)learn/agentos/cloud-deployment/MigrationPath.mdexamples/cloud-deployment/minimal-external-workspace/examples/cloud-deployment/pre-push-hook.shpre-pushhook demonstratingingestSourceFilescontractThe Fix
1. Guide Tree Authoring
Each
*.mddocument follows existinglearn/agentos/discipline:@summarycross-links to relevant source files2. Worked Example
examples/cloud-deployment/minimal-external-workspace/:package.json(Neo as dependency).protofor protobuf — common cross-language demo)parsed-chunk-v1records3. Hook Example
examples/cloud-deployment/pre-push-hook.sh:pre-pushhookgit diffagainst last-pushed SHAingestSourceFilesvia the bulk-facade CLI (or MCP tool depending on batch size)4. Validation
#NNNNor[link]references)learn/tree.jsondiscoveryAcceptance Criteria
learn/agentos/cloud-deployment/Overview.mdwritten and linked inlearn/tree.jsonlearn/agentos/cloud-deployment/Configuration.mdwritten; documents all configs introduced in Phase 0/1 + Phase 2learn/agentos/cloud-deployment/CustomSources.mdwritten; includes worked-example referencelearn/agentos/cloud-deployment/CustomParsers.mdwritten; covers server-side AND client-side parser authoringlearn/agentos/cloud-deployment/HookWiring.mdwritten; includes shell-hook example referencelearn/agentos/cloud-deployment/Security.mdwritten; codifies write-side stamping + spoof-rejection + parser sandboxing boundaries + KB-as-cache vs MC-as-store data-recovery framing (cross-link to Phase 4 Epic #11628 retention-policy framing)learn/agentos/cloud-deployment/MigrationPath.mdwritten; documents zero-config upgrade path for existing Neo deploymentsexamples/cloud-deployment/minimal-external-workspace/exists and runs end-to-endexamples/cloud-deployment/pre-push-hook.shexists and demonstrates the contractmanage_knowledge_base({action: 'sync'})—learn/agentos/cloud-deployment/*.mdchunked + retrievableOut of Scope
docs/build pipeline)Avoided Traps
learn/tree.jsonintegrationask_knowledge_basefor cloud-deployment queriesaiConfig.knowledgeBase.customParsers"Related
learn/agentos/KnowledgeBase.md,learn/agentos/tooling/MultiTenantMigrationGuide.md,learn/agentos/CodeExecution.md,learn/agentos/SharedDeployment.mdOrigin Session ID
7360e917-1733-4cdd-a6f3-5ac51c34b838Handoff Retrieval Hints
query_raw_memories({query: 'cloud deployment guide custom sources parsers hook wiring'})ask_knowledge_base({query: 'learn agentos guide tree structure new substrate', type: 'guide'})HookWiring.md+ worked example (depends on real endpoint names + payload shapes)Overview.md(architectural framing),Security.md(sandboxing + write-side stamping invariants),MigrationPath.md(zero-config upgrade story). Saves wall-clock time if Phase 3 starts in parallel once Phase 0/1 merges.