LearnNewsExamplesServices
Frontmatter
id9032
titleFeat: DevRank Spider Random Walk Strategy
stateClosed
labels
enhancementai
assigneestobiu
createdAtFeb 7, 2026, 7:44 PM
updatedAtFeb 7, 2026, 9:57 PM
githubUrlhttps://github.com/neomjs/neo/issues/9032
authortobiu
commentsCount1
parentIssue8930
subIssues[]
subIssuesCompleted0
subIssuesTotal0
blockedBy[]
blocking[]
closedAtFeb 7, 2026, 9:57 PM

Feat: DevRank Spider Random Walk Strategy

Closed v12.0.0 enhancementai
tobiu
tobiu commented on Feb 7, 2026, 7:44 PM

Enhance the Spider's discovery algorithm to avoid "Filter Bubbles" (repeatedly scanning top repos) and increase coverage.

Strategies to Explore:

  1. Temporal Slicing: Search repositories by random creation date ranges (e.g., created:2022-05-01..2022-05-07) instead of just stars:>1000.
  2. Stargazer Leap: Pick a random indexed user -> fetch their starred repos -> scan those repos.
  3. Dictionary Attack: Search for random keywords/prefixes.

Goal: Ensure diverse discovery of "hidden gem" developers.

tobiu added the enhancement label on Feb 7, 2026, 7:44 PM
tobiu added the ai label on Feb 7, 2026, 7:44 PM
tobiu added parent issue #8930 on Feb 7, 2026, 7:44 PM
tobiu assigned to @tobiu on Feb 7, 2026, 7:44 PM
tobiu referenced in commit 40236b2 - "chore: Exclude DevRank Data from NPM Package (#9032) on Feb 7, 2026, 7:45 PM
tobiu referenced in commit 968bd0e - "feat: Implement DevRank Spider Random Walk Strategy (#9032)" on Feb 7, 2026, 9:57 PM
tobiu
tobiu Feb 7, 2026, 9:57 PM

Input from Gemini 3 Pro:

✦ Implemented the Random Walk Strategy to diversify candidate discovery and break out of "filter bubbles".

Implemented Strategies

  1. Core: High Stars (40%): The classic top-down approach (stars:>1000).
  2. Discovery: Keyword (30%): Dictionary attack using 60+ dev-related keywords (e.g., "wasm", "compiler", "neural") to find niche experts.
  3. Discovery: Temporal (20%): Slices the last 10 years into random 1-week windows to find "hidden gems" regardless of total star count.
  4. Discovery: Stargazer Leap (10%): Traverses the graph by picking a random indexed user and scanning their starred repositories.

Verification

Ran a local devrank:spider session which randomly selected the Keyword strategy (topic:machine-learning) and successfully discovered ~728 new candidates in a single 3-page run.

Code committed in apps/devrank/services/Spider.mjs.

tobiu closed this issue on Feb 7, 2026, 9:57 PM