Computer scientist & biologist. Portfolio showcasing projects, research, and skills.
Computer Scientist & Biologist
My foundation in Biology from James Madison University gives me the domain expertise to understand complex life science challenges, while my Computer Science degree from the University of Colorado Boulder provides the technical skills to build robust, data-driven solutions. I build projects at the intersection of software, machine learning, and health sciences.
As a Software Engineer Intern at AstraZeneca, I designed and developed a C#/.NET application to automate the ingestion and normalization of experimental data into SQL Server, directly supporting downstream machine learning workflows. I am passionate about leveraging AI to create scalable and maintainable tools that advance healthcare.
Career Goals
I aim to pursue a career building AI and data-driven tools that improve healthcare outcomes. My long-term goal is to develop innovative software and machine learning solutions for biotech, health tech, or pharmaceutical organizations, while continuing to grow as a researcher and engineer.
| View Projects | Download Resume |
B.S. Computer Science — University of Colorado Boulder, GPA 4.0, Dec 2025
Awards: Dean’s List Student, CSPB Orientation Leader
B.S. Biology — James Madison University, Jul 2021
Awards: Dean’s List Student
📧 matthewmartin117@gmail.com
LinkedIn | GitHub
Python, C#, Java, SQL, C++
TensorFlow, PyTorch, Keras, Pandas, NumPy, Scikit-learn
Flask, React, Spring Boot, JavaScript
Git, Docker, AWS, Maven, Postman, SFML
Clinical & lab workflows, data preprocessing, ETL pipelines, exploratory data analysis
Java Spring Boot API + ML Classifier for HIPAA Protected Health Data
Developing a secure, full-stack application to classify protected health information (PHI). The project involves building a robust Java Spring Boot API to handle sensitive data securely and integrating a machine learning model for real-time classification.
Developed a model to classify pneumonia from chest X-rays, overcoming a highly imbalanced dataset. The high recall is critical for minimizing false negatives in a medical context.
Impact: Achieved 86% accuracy and a **recall of 0.92** using a CNN with a VGG16 base and class weighting.
Tech: TensorFlow, Keras, Python
GitHub
Analyzed a large-scale CDC survey dataset of over 400K responses to predict heart disease risk. Trained and evaluated multiple models to identify key health indicators.
Impact: Gradient Boosting models achieved ~90% recall, vital for identifying at-risk individuals.
Tech: Scikit-learn, Pandas, NumPy
GitHub
Used unsupervised learning to uncover distinct patient subtypes from complex, high-dimensional gene data (54K+ features), demonstrating patterns not visible through simple analysis.
Impact: Applied **PCA for dimensionality reduction** and **K-Means clustering** to identify meaningful biological subgroups.
Tech: Scikit-learn, PCA, K-Means
GitHub
Engineered a high-performance C++ application to process and visualize high-frequency telemetry data streams in real-time without data loss.
Impact: Successfully parsed complex data packets and rendered dynamic, live-updating charts using the **SFML graphics library**.
Tech: C++, SFML
GitHub
Built a full-stack web application that simulates infectious disease spread using the SIRD model. Users can adjust parameters and visualize outcomes via interactive dashboards.
Impact: Provides an accessible tool for understanding epidemiological concepts and the effects of variables like transmission rate.
Tech: Python, Flask, JavaScript
GitHub
Engineered a full-stack social media application from the ground up, allowing users to connect based on shared interests through posts, comments, and a recommendation system.
Impact: A complete MERN-like stack application featuring user authentication, a RESTful API, and a dynamic front-end.
Tech: Python, Flask, React, PostgreSQL
GitHubSoftware Engineering Intern | AstraZeneca — May 2025–Aug 2025
Sampling Coordinator | United States Pharmacopeia — Mar 2022–Jan 2023
Research Associate | Curative Health — Aug 2021–Dec 2021