CCI Senior Design

Clinical AI

Project Category

Research

Project Description

Evaluating data sets formats and how they are impacting the results on different ML approaches

Team Logo

Abstract

The paper is about Machine learning in the clinical domain. It is very hard to get patients' data because the hospitals don't want to make it available and the only data set that exists out there is MIMIC. MIMIC has a lot of limitations because patients presented in this data set are only in Intensive Care Units. In this data set there are information about the patients' prescriptions,  laboratory, admissions, diagnosis etc. 
The paper presents ways on how to shape features from the data set in order to use them in Machine Learning and how they produce different results. In addition, it's showing how ML approaches such as Logistig regression, SVM and Xgboost perform with different data sets formats.
The conclusion is that for logistic regression and SVM certain datasets are better across multiple diseases while for XGBoost others perform better.

Video Presentation

https://1513041.mediaspace.kaltura.com/media/Presentation/1_qr55mxl9

Screenshot 1

Screenshot 2

Team Members

Name: Laura-Amira Talaat-Hamid

Email: lt486@drexel.edu

Behind The Scenes

Name: Prof. Jeff Salvage

Email: jks29@drexel.edu

Name: Hegler Tissot

Email: hc848@drexel.edu