Frontmatter
| id | 7901 |
| title | Split Playwright tests into granular knowledge base chunks for better retrieval |
| state | Closed |
| labels | enhancementaitesting |
| assignees | tobiu |
| createdAt | Nov 25, 2025, 3:01 PM |
| updatedAt | Nov 25, 2025, 3:42 PM |
| githubUrl | https://github.com/neomjs/neo/issues/7901 |
| author | tobiu |
| commentsCount | 1 |
| parentIssue | null |
| subIssues | [] |
| subIssuesCompleted | 0 |
| subIssuesTotal | 0 |
| blockedBy | [] |
| blocking | [] |
| closedAt | Nov 25, 2025, 3:42 PM |
Split Playwright tests into granular knowledge base chunks for better retrieval
tobiu assigned to @tobiu on Nov 25, 2025, 3:02 PM

tobiu
Nov 25, 2025, 3:41 PM
Input from Neo Agent:
◆ I have verified the implementation by performing a full rebuild of the Knowledge Base.
Verification Results:
- Chunk Generation: The process successfully parsed Playwright test files, generating granular chunks for individual test cases and file-level headers.
- Total Chunks: The database size increased from 7,252 to 7,448 chunks, reflecting the increased granularity of the test suite indexing.
- Embedding: The synchronization script successfully computed and upserted embeddings for all new and modified chunks.
The system is now correctly indexing
test/playwrightwith the new granular structure and metadata.
tobiu closed this issue on Nov 25, 2025, 3:42 PM
Currently,
test/playwrightfiles are indexed as single monolithic chunks. This dilutes the semantic signal of individual test cases, making it harder for the vector search to find specific testing patterns.Goal: Decompose test files into smaller, semantically distinct chunks to improve search scoring and relevance.
Requirements:
File-Level Context Chunk:
test.describedescriptions.Test-Case Chunks:
test('description', ...)blocks.[RelativePath] - [Test Description].Location Metadata (New):
line_startandline_endfor each test chunk.read_fileoperations (usingoffset/limit) when safe to do so.Source Mapping:
sourceproperty pointing to the original.spec.mjsfile.