I am a Master of Science in Biostatistics graduate from the Harvard T.H. Chan School of Public Health. I currently work full-time at Siemens Healthineers as a Biostatistician. In May of 2017, I graduated with a B.S. in Public Health with a concentration in Biostatistical Studies and a Minor in Mathematics from the University of Massachusetts, Amherst. Most of my experience revolves around statistical programming and analysis: I am an advanced programmer in R and SAS statistical programming languages, with intermediate skills in SQL, Tableau, HTML, CSS, and STATA. I am highly skilled at developing interactive R Shiny web applications and tools, as well as polished R Markdown reports. Statistical methodology, clinical trials, data science, data visualization, and machine learning are also my areas of expertise.
Nathan Hall
24 Colgate Rd
Apartment 3
Roslindale, MA 02131 US
(978) 799-1560
https://nathanh36.github.io/nehall_biostatistician/
Master of Science in Biostatistics • March 2019
GPA: 3.81/4.00
Bachelor of Science in Public Health • May 2017
Minor in Mathematics with a concentration in Biostatistical Studies.
GPA: 3.81/4.00
Summer Institute for Training in Biostatistics - Scholarship • June 2016 - July 2016
Biostatistician • January 2019 - Present
Biostatistics Intern • May 2018 - December 2018
Data Analyst • May 2017 - January 2018
In addition to the programming languages below, I have advanced data visualization capabilities and an innate aptitude to learn new skills very quickly. I am very familiar with a number of statistical and data science techniques, including (but not limited to): regression modeling, data science, machine learning (random forests, gradient boosting machines, neural networks, etc.), clinical trials, statistical process control & Six Sigma, statistical methodology, and missing data methodology/multiple imputation.
Full-text of my Masters Thesis from Harvard University, titled: "Using Random Forests to Multiply Impute Data in an Online Patient-Centered Support Platform". The thesis is aimed at exploring a unique method for the handling of extremely large proportions of missing data in a mental health-oriented, online observational observational setting.
Machine Learning, Multiple Imputation, StatisticsPublication I am a co-author of in the Journal of Affective Disorders (JAD), in collaboration with MGH Bipolar Clinic and MGH Biostatistics Center staff. This paper outlines the Pilot study of a lifestyle intervention for bipolar disorder: Nutrition Exercise Wellness Treatment (NEW Tx). This study was aimed at investigating the effectiveness of a new and creative approach to managing both mental and physical health outcomes.
Mental Health, Statistics, Pilot StudyFinal project from a health data science course I took during my time at Harvard University. My team and I used sentiment and network analysis, in addition to developing an interactive R Shiny web application, to quantify various outcomes during the Murfreesboro & Shelbyville riots of 2017. The data is mined directly from Twitter during the time window that the riots where taking place.
Sentiment Analysis, Network Analysis, Data Science, R ShinyApart from my desire to continue working in the biotech field as either a biostatistician, data scientist, interactive web application developer (or a combination of all 3), I hope to improve upon my web development & UI design skills even further. Although I just started working with HTML & CSS, I have picked up on these skills rather quickly and I certainly enjoy designing webpages and functional user interfaces. Leaning javascript in the near future is also a goal of mine, to make my web applications even better and implement improved graphics capabilities. Overall, it is extremely important to me that I continue to leverage my current skills, in addition to developing new ones, in order to improve health outcomes on a global scale.