X-Rays
Screen Shot 2020-09-23 at 6.55.52 PM.png

Detection of COVID-19 induced Pneumonia using Chest X-rays - A Deep Learning Implementation.

An approach to distinguish between NORMAL, PNEUMONIA, and COVID-19 positive patients using Convolutional Neural Networks for Image classification on Chest X-rays.

The links to the pre-processed data sets are provided below:

Download COVID-19 Dataset

Download NORMAL and PNEUMONIA Dataset

  • Exploratory Data Analysis and Data Extraction (Notebook)


*CXR - Chest X-rays

*PA - Posteroanterior (view of X-ray images)

Screen Shot 2020-09-23 at 4.44.58 PM.png
Screen Shot 2020-09-23 at 7.19.06 PM.png
Screen Shot 2020-09-23 at 7.17.38 PM.png
Screen Shot 2020-09-23 at 7.24.41 PM.png

Big Data Wrangling with Google Book Ngrams

(Load, filter, and visualize a large dataset in AWS cloud environment)

  • Work with real-world data using Hadoop, Spark, and the AWS S3 filesystem.

  • Pull data from (public) S3 bucket to HDFS (Hadoop)

  • Analyze and filter data using Spark

  • Perform Data Analysis and visualizations.

Hotel Desk Check-In
Screen Shot 2020-09-23 at 7.13.31 PM.png
Screen Shot 2020-09-23 at 7.13.56 PM.png

Hotel Reviews - Booking.com

(Natural Language Processing/ Sentiment Analysis)

Download the Dataset

  • Exploratory Data Analysis and Data Wrangling

  • Employ various ML models and comparing their performance (Logistic Regression with PCA and Cross-Validation, KNN, Decision Trees and Random Forests)

Screen Shot 2020-09-23 at 7.13.31 PM.png
Screen Shot 2020-09-23 at 7.13.56 PM.png

US Presidential Elections

(EDA, Data Cleaning, Visualization, Statistical modeling, and inference)

  • EDA and Data cleaning on previous election data.

  • Statistical modeling, Model selection, and Inference

BIXI Montréal, bike-sharing system (Data Analysis and Data Visualization)

  • Data Analysis of the real-world data using SQL queries on MySQL workbench.

  • Data Visualization and Dashboards on Tableau with key insights into revenue and business growth.

* Due to confidentiality reasons and some of these projects being deliverables for the BrainStation Data Science program, the information is not     publicly hosted on Github. Please reach out to me over LinkedIn and I will be happy to provide with more information.