I'm a Computer Science student at the University of Waterloo. I previously worked at Software Analysis and Intelligence Lab (SAIL), where I applied traditional machine learning techniques to provide insights to academic software engineering. I love programming unique and novel solutions to a variety of domains - take a look at the work and projects I've done throughout the years.
Led project on data mining of Medium. Built a robust web crawler to mine a network of 6+ million articles totalling 109GB in size. Applied machine learning models (logistic regression, support vector machines) using Python (pandas and scikit-learn) to study 6 million Software Engineering articles. Implemented multiprocessing techniques to increase feature extraction performance by 8-fold. Wrote SQL queries on Google BigQuery to assess Medium's usage on Stack Overflow.
Used R and Python to preprocess and extract 100+ features from 19.3 million messages to study the Stack Overflow chat platform. Trained logistic regression models on 86K chat rooms, evaluated with established evaluation metrics such as AUC/ROC, AIC, Wald statistics.
I'm a maker at heart. Over the years, I've worked on a variety of projects in my spare time. Check out my Github to see more.