data: ground-truth PhD roster fixture for 3 pilot chairs#57
Open
ValentinJSchmidt wants to merge 4 commits into
Open
data: ground-truth PhD roster fixture for 3 pilot chairs#57ValentinJSchmidt wants to merge 4 commits into
ValentinJSchmidt wants to merge 4 commits into
Conversation
Captures current PhD rosters for the three pilot chairs as the recall benchmark for PhD discovery (#47): - Autonomous Learning / Distributed Intelligence (Georg Martius) - Theory of Machine Learning (Ulrike von Luxburg) - Autonomous Vision Group (Andreas Geiger) Rosters were captured from each chair's official team page on 2026-06-22 via automated fetch. NOT final: entries still need human verification against the live pages before this is treated as authoritative ground truth. status values active/incoming/associated/ former are documented in the file header; only 'active' counts as a recall target. Postdocs, research engineers and admin staff excluded. Refs #46 Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
Ground-truth PhD fixture for the 3 pilot chairs drafted and committed as WIP; pending human verification. Refs #46 Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
Former PhD alumni are removed from the ground-truth roster entirely. People listed under a chair's 'Researcher' role are now treated as active PhDs (Martius: Kloss, Kolev, Geist), while research engineers remain excluded. Martius roster: 20 active, 0 former. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
Add a 'postdoc' status counted as a recall target. von Luxburg gains 5 postdocs (Bhattacharjee, Bordt, König, Thiessen, Waller) confirmed from the live team page. Martius and Geiger have no current postdoc section, so none added there. Also backfill profile URLs for the three Martius Researchers (Kloss, Kolev, Geist) found on the team page. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
Collaborator
Author
|
This is now also manually approved |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Adds a hand-curated ground-truth PhD roster fixture for the 3 pilot chairs and marks Step 2 (#46) as in progress in STATUS. This is the recall benchmark that makes "none forgotten" measurable for PhD discovery (#47).
Motivation
PhD discovery (#47) has no objective target to measure recall against — OpenAlex exposes no supervisor→PhD edge. This fixture captures the official team-page rosters so any future discovery method can be scored against a known set.
What Changed
skills/tests/fixtures/ground_truth_phds.json— new fixture covering:status(active/incoming/associated/former),role_text, profile/evidence URLs, and averification_note. Postdocs, group leaders, and staff are intentionally excluded with rationale in each chair'snotes. Onlyactivecounts as a recall target.STATUS.md— Step 2 (data: build hand-verified ground-truth roster for 3 pilot chairs #46) → 🟨 in progress, plus a dated log line.Known Issues / Not Yet Done
notes.Part of #46.