Paco Nathan  

Rich Context is a research effort within the Coleridge Initiative at NYU Wagner, developing a knowledge graph built from metadata about the use of curated datasets and related research. As a foundation, the Administrative Data Research Facility (ADRF) is a platform used across 15 government agencies in the US for social science research with sensitive data. ADRF promotes evidence-based policymaking and provides support for data stewardship practices among the agencies. Rich Context, in turn, is used to represent several kinds of entities involved: datasets, data providers, researchers, research publications, subject headings, etc. ML applications leverage this graph to perform entity linking, e.g., identifying dataset attribution within open access research publications. This talk is about Rich Context, its ML competition, and the related new features for data governance and collaboration which are going into JupyterLab.