News & Events

LEADS Forum: January 24th, 2020

After two successful years of the LEADS-4-NDP program, the Metadata Research Center and Drexel CCI will host a LEADS forum on Friday, January 24th, here at Drexel University.

2018 LEADS cohort
LEADS-4-NDP 2019 cohort
2019 LEADS cohort







This event is an opportunity for LEADS advisory board members, mentors, and fellows from both cohorts of participants from LEADS program to get together. The forum will include a panel of project mentors, student presentations, breakout groups, and an opportunity to discuss different models for continuing the LEADS program.

What: LEADS-4-NDP Forum
Date: January 24th, 2020
Time: 10am – 3pm
Where: 3675 Market Street, Quorum (floor 2)
Drexel University
Philadelphia, PA
Forum agenda: TBA.

Sam Grabus exploring the canals in Utrecht
News & Events

MRC’s Sam Grabus presents at Digital Humanities 2019, in Utrecht

MRC PhD student Sam Grabus and Temple University’s Peter Logan presented their paper at Digital Humanities 2019 in Utrecht, Netherlands, on Thursday, July 11th.

Sam Grabus presenting at DH 2019, in Utrecht
Sam Grabus presenting at DH 2019, in Utrecht, demonstrating how the HIVE tool maps naturally-extracted keywords to controlled vocabulary terms.

The presentation, entitled, “Knowledge Representation: Old, New, and Automated Indexing,” shared comparative topic relevance results from automatically indexing 19th century Encyclopedia Britannica entries with two controlled vocabularies: an historical knowledge organization system developed by Ephraim Chambers, as well as the contemporary Library of Congress Subject Headings.

News & Events

CCI Presents at ICHI 2019 in Xi’an, China

Drexel CCI participated in the 7th IEEE International Conference on Healthcare Informatics (ICHI 2019) in Xi’an, China, from June 10-13th. CCI professor Chris Yang served as the general co-chair and panelist for the conference.

Phd students Ou Stella Liang and Michal Monselise presented their full paper, “Identifying Important Risk Factors Associated with Vehicle Injuries using Driving Behavior Data and Predictive Analytics.” The paper was co-authored with Chris Yang. Ou Stella also presented a data analytics challenges paper co-authored with Ali Jazayeri and Chris Yang, entitled, “Interpatient Similarity-based Imputation of Missing Data in Electronic Health Records.”

Ou Stella participated in the doctoral consortium with her presentation, “Determining Safe Prescription Practices for Pregnant Women.”


News & Events

MRC Hosts NASKO 2019

The Metadata Research Center hosted the North American Symposium on Knowledge Organization (NASKO 2019) from June 13-14.

Sam Grabus, Jane Greenberg, Sonia Pascua, Deborah Garwood NASKO
NASKO 2019 Participants: MRC’s Sam Grabus, Jane Greenberg, Sonia Pascua, and Deborah Garwood.

MRC Phd Student Sam Grabus presented her paper, “Representing Aboutness: Automatically Indexing 19th-Century Encyclopedia Britannica Entries.” The presentation discussed topic relevance revaluation for automatic indexing results, evaluating which of three keyword extraction algorithms produce more relevant results for the digital collection.

Sam Grabus presenting at NASKO 2019
Sam Grabus presenting at NASKO 2019
News & Events

MRC Co-Sponsors NASKO this week: NASKO Highlights

The Metadata Research Center is co-hosting the North American Symposium on Knowledge Organization (NASKO 2019) from June 13-14th, at the College of Computing and Informatics.

Howard White: “On Patrick Wilson”

Professor Emeritus and Visiting Research Professor Howard White will deliver a special presentation at NASKO, titled “On Patrick Wilson.” Read more about Howard here.

Continue reading “MRC Co-Sponsors NASKO this week: NASKO Highlights”

News & Events

Metadata Mixer: “Metadata Madness”

TOPIC: Metadata Madness – accomplishments for the year, and/or goals for the summer.
Presenters: CCI PhD students, Cecilia Preston
Wednesday, June 12th
Time: 12:30-1:30 PM
Location: 3675 Market Street,
University City Science Center,
CCI’s new location
Room: Dean’s conference room is #1039 (10th floor)

ADDED FUN: A visit to the Metadata Research Center, now residing on the 11th floor of 3675 Market Street, joining AI (artificial intelligence) and data science [This is for guests outside CCI who may attend].

News & Events

LEADS-4-NDP 2019 Data Science Boot Camp

The LEADS-4-NDP 2019 fellowship program kicked off this week with a 3-day data science boot camp at Drexel University’s College of Computing and Informatics. Eleven fellows from iSchools across the U.S. are paired with nine National Digital Platform partner sites for 10-week remote internships to address data science challenges.

LEADS-4-NDP 2019 cohort
The 2019 LEADS cohort, joined by CCI’s Dr. Il-Yeol Song, Dr. Jane Greenberg, OCLC’s Jean Godby, and Project Manager Sam Grabus

Boot camp sessions included big data management; metadata; data pre-processing; data visualization; data mining and machine learning; large-scale and parallel computing, and automated data analytics tools. As part of the boot camp, LEADS mentors OCLC’s Jean Godby and DCIC’s Richard Marciano shared about data science opportunities at their institutions; And LEADS mentors Steven Dilliplane, Academy of Natural Sciences, and Peter Logan, Temple University’s Digital Scholarship Center, participated in boot camp activities.

Read more about the LEADS program HERE.

News & Events

Jeremy Leipzig Proposal Defense

Date: Thursday, 5/16
Time: 3:30pm
Location: Room 1005


  • Dr. Jane Greenberg, chair
  • Dr. Erjia Yan
  • Dr. Xia Lin
  • Dr. Gail Rosen, ECE
  • Dr. Karthik Ram, UC Berkeley

Title: Reproducible Computational Research in Bioinformatics: A Study of Tools and Metadata to Bind the Analytic Stack

Abstract: Reproducible computational research (RCR), and the “reproducibility crisis” continues to attract attention in a number of scientific disciplines. In this proposal, I represent reproducibility in terms of cohesiveness in the “analytic stack” comprising raw input data, tools, workflows, analyses, and publications. I review a number of existing major case study types – reproduction, replication, refactor, robustness test, survey, census, and case narrative. Of particular interest are refactors, in which an existing analysis with abstract methods is reimplemented by a third party. This proposal will identify three studies to be refactored, the state-of-the-art tools and standards to be applied, and how these attempts will be evaluated by external reviewers. The process of the refactor can be used to evaluate the limitations of reproducibility using conventional tools. From the refactor and survey I will identify persistent gaps in the “analytic stack”, and describe features of metadata solutions that can be used to address these deficiencies.