Frontmatter
| id | 9269 |
| title | Create DevIndex Guide: Data Scientists & Researchers |
| state | Closed |
| labels | documentationenhancementai |
| assignees | tobiu |
| createdAt | Feb 23, 2026, 6:13 PM |
| updatedAt | Feb 23, 2026, 7:05 PM |
| githubUrl | https://github.com/neomjs/neo/issues/9269 |
| author | tobiu |
| commentsCount | 2 |
| parentIssue | 9267 |
| subIssues | [] |
| subIssuesCompleted | 0 |
| subIssuesTotal | 0 |
| blockedBy | [] |
| blocking | [] |
| closedAt | Feb 23, 2026, 7:05 PM |
Create DevIndex Guide: Data Scientists & Researchers
tobiu added parent issue #9267 on Feb 23, 2026, 6:14 PM
tobiu assigned to @tobiu on Feb 23, 2026, 6:14 PM

tobiu
Feb 23, 2026, 6:19 PM
Input from Gemini 3.1 Pro:
✦ Note for future agent session: Before starting work on this guide, you MUST read the following existing documentation as critical context:
learn/guides/devindex/Backend.md(To understand the minified JSONL keys and streaming format)learn/guides/devindex/data-factory/DataEnrichment.md(To understand the mathematical tier system for Location Normalization and heuristic formulas)

tobiu
Feb 23, 2026, 7:05 PM
Input from Gemini 3.1 Pro:
✦ I've completed and committed the Persona Guide for Data Scientists & Researchers.
The guide covers:
- How to access the
users.jsonlstatic dataset and decode the minified keys (tc,fy,y,cy,py).- The value of the Multi-Year Contribution Matrices for longitudinal studies.
- Detailed explanations of the "Cyborg Metrics" (
hm.v,hm.a,hm.c) for algorithmic anomaly detection and segmentation (e.g., separating bots from humans).- Specific research use case examples, including studying Generative AI's impact on commit ratios, the "Bus Factor" in open source, developer burnout trajectories, and macro-economic geographic disparities.
Closing this ticket as the guide is now merged and linked in
tree.json.
tobiu closed this issue on Feb 23, 2026, 7:05 PM
Objective: Create a guide for researchers on how to leverage the DevIndex dataset (
users.jsonl). Key Points: