LIS Education and Data Science for the National Digital Platform (LEADS-4-NDP)

LEADS Home | People | LEADS Fellows | Fellows Blog | LEADS Forum


*See 2020 LEADS Forum page for information about the upcoming event*

LIS Education and Data Science for the National Digital Platform (LEADS-4-NDP) is preparing next generation LIS faculty and early career information professionals to meaningfully integrate data science into their research and work endeavors. LEADS-4-NDP is supported by the Institute of Library and Museum Services (IMLS) Laura Bush 21st Century Librarian Program, and includes a cohort of 21 talented LEADS Fellows ready to address and advance data science challenges in the workplace and education.

LEADS Fellows have received a summer stipend and completed the following: 1) an online preparatory curriculum, 2) an intensive 3-day data science boot camp at Drexel University and 3) a ten-week virtual data science internship over the summer with one of our LEADS partners.

For inquiries about this unique educative program, please contact the Metadata Research Center (mrc.metadata@drexel.edu), or Jane Greenberg directly.

View the PR Flyer
View 2018 LEADS fellow final lightning talks here

2019 LEADS Fellows have completed internships and are preparing final deliverables


LEADS Fellowship Projects at NDP Sites

LEADS is a virtual fellowship program; students located near their selected site may have more on site interaction.

Project Partner/Site

Project title

(link to full description)

Project outcome

1. California Digital Library, University of California,
Office of the President
Making a Metadata Meritocracy Refined workflows for gaining acceptance of proposed metadata terms
2. Digital Curation Innovation Center (DCIC), University of Maryland’s iSchool Automating the Detection of Personally Identifiable Information (PII) The project will partner with Densho.org in Seattle to work on processing and releasing these records to the public using community-vetted and approved access policies
3. Digital Public Library of America (DPLA) DPLA Resources and Vocabulary Enrichment for Analytics A method and an approach for identifying term variations and applying terminology more consistent terminology to support analytics.
4. Digital Research Services, University of Pennsylvania Libraries Semi-automatically assigning keywords to medieval manuscripts on OPenn Will feed into an ongoing project to improve the search providing access to OPenn, and will also make visualization of the data possible in ways not possible now.
5. Historical Society of Pennsylvania Enhancing access to historic biographical data through visualization tools Improved access to historic data for scholars and other user groups via user-friendly visualization tools. The ability to visualize connections between people, places, and institutions.
6. OCLC Automatic Identification of Publisher Entities to Support Discovery and Navigation Seeks to advance our understanding of the publisher entity in library bibliographic data.
7. Academy of Natural Sciences This project seeks to augment existing biodiversity specimen collection data by linking related descriptions found in the rich tradition of natural history literature
8. Digital Scholarship Center, Temple University Automating keyword assignments for the Nineteenth-Century Knowledge Project Will contribute to an ongoing project involving the automatic indexing of four digitized historical Encyclopedia Britannicas. This aspect of the project seeks to convert digitized historical controlled vocabularies into SKOS format.
9. RAMP, Montana State University Library Analyzing the RAMP dataset to better understand content, use, and performance of institutional repositories Exploring potential of the RAMP dataset, including: analyzing the scholarly record across institutional repositories (IR); demonstrating level of IR use; evaluating incentives to improving click-through rates; comparing RAMP download reports with vendor and IRUS download reports; automatically predicting disambiguated structured and ontological metadata, etc.