LIS Education and Data Science for the National Digital Platform (LEADS-4-NDP)

LEADS Home People LEADS Fellows  Fellows Blog

LIS Education and Data Science for the National Digital Platform (LEADS-4-NDP) is preparing next generation LIS faculty and early career information professionals to meaningfully integrate data science into LIS education and their research and work endeavors.

Supported by the Institute of Library and Museum Services (IMLS) Laura Bush 21st Century Librarian Program, LEADS-4-NDP is building a cohort of talented Fellows, ready to address and advance data science challenges in the workplace and education.

In 2018, LEADS-4-NDP supported 10 LEADS Fellows from ALA accredited programs across the United States. The 2019 the LEADS class includes 11 more Fellows and may expand to early career professionals who elect to bring a workplace challenge.

All student Fellows receive a summer stipend and complete the following: 1) an online preparatory curriculum, 2) an intensive 3-day data science boot camp at Drexel University (June 6-8), and 3) a ten-week virtual data science internship over the summer with one of our LEADS partners.

For inquiries about this unique educative program, please contact the Metadata Research Center (, or Jane Greenberg directly.

View the PR Flyer
View 2018 LEADS fellow final lightning talks here

Applicants have been selected for the 2019 year.

LEADS Fellowship Projects at NDP Sites

LEADS is a virtual fellowship program; students located near their selected site may have more on site interaction.

Project Partner/Site

Project title

(link to full description)

Project outcome


1. California Digital Library, University of California,
Office of the President
Making a Metadata Meritocracy Refined workflows for gaining acceptance of proposed metadata terms
2. Digital Curation Innovation Center (DCIC), University of Maryland’s iSchool Automating the Detection of Personally Identifiable Information (PII) The project will partner with in Seattle to work on processing and releasing these records to the public using community-vetted and approved access policies
3. Digital Public Library of America (DPLA) DPLA Resources and Vocabulary Enrichment for Analytics A method and an approach for identifying term variations and applying terminology more consistent terminology to support analytics.
4. Digital Research Services, University of Pennsylvania Libraries Semi-automatically assigning keywords to medieval manuscripts on OPenn Will feed into an ongoing project to improve the search providing access to OPenn, and will also make visualization of the data possible in ways not possible now.
5. Historical Society of Pennsylvania Enhancing access to historic biographical data through visualization tools Improved access to historic data for scholars and other user groups via user-friendly visualization tools. The ability to visualize connections between people, places, and institutions.
6. OCLC Automatic Identification of Publisher Entities to Support Discovery and Navigation Seeks to advance our understanding of the publisher entity in library bibliographic data.
7. Academy of Natural Sciences This project seeks to augment existing biodiversity specimen collection data by linking related descriptions found in the rich tradition of natural history literature
8. Digital Scholarship Center, Temple University Automating keyword assignments for the Nineteenth-Century Knowledge Project Will contribute to an ongoing project involving the automatic indexing of four digitized historical Encyclopedia Britannicas. This aspect of the project seeks to convert digitized historical controlled vocabularies into SKOS format.
9. RAMP, Montana State University Library Analyzing the RAMP dataset to better understand content, use, and performance of institutional repositories Exploring potential of the RAMP dataset, including: analyzing the scholarly record across institutional repositories (IR); demonstrating level of IR use; evaluating incentives to improving click-through rates; comparing RAMP download reports with vendor and IRUS download reports; automatically predicting disambiguated structured and ontological metadata, etc.