Technical Skills
My Harvard coursework focused on Python but also included C/Unix and Java. Data Science work included extensive work with the popular python packages for Machine/Deep Learning and data management.
The projects linked below highlight many of the skills I learned at Harvard.
Languages, Frameworks, Libraries and Methodologies
  • Python
  • Open CV
  • Pandas
  • Numpy
  • SciKit-Learn / Statsmodels
  • PyGam / PyMC3
  • Keras / Tensorflow
  • Bokeh
  • Django
  • Flask
  • Docker
  • Agile
  • UML
  • Flask
  • C / Unix
  • Java (incl. Maven)
  • Springboot
  • Microservices
  • RaspberryPi
ML, Deep Learning and Statistical Modeling
  • Linear and Logistical Regression
  • Generalized Additive Models (GAMs)
  • Basic Reinforcement Learning
  • Bayesian Modeling
  • Splines
  • Clustering
  • Non-parametric Models
  • Ensemble Methods
  • NLP (Embeddings, RNN, LSTM)
  • CNN / Mask RCNN
  • Autoencoders
  • PCA
  • Generative Adversarial Networks (GANs)
  • Transfer Learning

Traffic Monitor

Track Detected Objects in Video Stream

By: Mark McDonald
The Traffic Monitor application will detect objects in a video feed and log, report and chart instances of objects that are recognized. A variety of objects can be selected for tracking as well as the source of the video stream.
Fashion Recommendation

Identification of Fashion Objects in an Image

By: Mark McDonald, An Hoang, Vivek Bhatia
Fashion-related items are segmented from images using Mask RCNN. Segmented objects are identified based on a large dataset provided in a Kaggle competition. An existing implementation of the Mask RCNN model is modified to apply to this project. This project was the result of a group effort for my course in Computer Vision.
EU Green Deal Reporting

EU Climate Reporting for EU Commissioners

By: Mark McDonald, Alan Martinson, Carly Gloge, Hemant Bajpai, Pritam Dey, Taylor Meyer
Harvard Software Engineering degree capstone group project. Our team was tasked by an EU Commission department to build a template which consolidated complex and diverse sources of data into an easy-to-understand front-end designed for EU Commissioner staff members. The effort focused on air pollution.
Mark's Covid Tracker

Daily US COVID-19 Trends

By: Mark McDonald
Various interactive charts are presented to the user show pertinent country-wide, state and county-level trends in the spread of COVID-19 in the US. This project uses Bokeh to build visualizations, Django to host the web pages, and JavaScript for user interactivity.
NYC Capital Projects

Predicting outcomes for NYC capital projects

By: Mark McDonald, Michael Sedelmeyer, An Hoang
Final group project for Advanced Data Science class at Harvard. Data available at the beginning of various large NYC capital projects is used to determine how likely the initial project budget and timeline will be after 3 years. This project compares various prediction methodologies as well as data engineering to find a latent reference class for the projects.
Chatterbox

Barebones web-chat with no fuss

By: Mark McDonald
A simple chat application demonstrating basic HTML, CSS, Flask and Internet messaging using sockets.
Pathfinder

Off-Road Path Navigation

By: Mark McDonald
Using Keras and OpenCV, Pathfinder will highlight a navigable path on a video of off-road terrain. The model was trained on a small set of publicly available data that was augmented to produce a useful set of training data.