Project Data
Deployed infrastructure, predictive modeling, and live web architecture.
Personal ML & Full-Stack
NBA Finals NLP Predictor
Live DeployAn end-to-end NLP pipeline that successfully predicted the exact series length and outright winner of the 2026 NBA Finals. I built a Retrieval-Augmented Generation (RAG) architecture utilizing ChromaDB and semantic chunking to manage and query an empirical dataset of over 1,000 post-game transcript snippets.
To extract sports intelligence, the system processes text through a custom RoBERTa pipeline, mapping 7-dimensional emotional feature vectors to isolate subtle psychological signals—such as shifts in team momentum and the anxiety profile of Head Coach Mike Brown.
To maintain compute efficiency and prevent overfitting on the tabular dataset, the final predictions are driven by a lightweight Random Forest algorithm, ensuring the mathematical logic behind every prediction remains strictly explainable.
Optichash
Live DeployI refactored this system from a monolithic web app into a hardware-aware, distributed computer vision pipeline focused on minimizing computational waste. The architecture uses a Java Spring Boot API Gateway to route multi-part image streams across a multi-stage Docker network.
To intercept duplicate queries before they hit the machine learning layer, a native C++ microservice evaluates structural geometry and generates O(1) Perceptual Hashes (pHash) via a Discrete Cosine Transform. For novel images, the pipeline relies on an edge-focused PyTorch MobileNetV3 model optimized with INT8 dynamic quantization, safely reducing the VRAM footprint by over 75% while achieving an inference baseline of exactly 58.6M FLOPs.
To bypass free-tier cloud constraints, I also engineered the vanilla JavaScript frontend into a stateful simulator, utilizing the HTML5 <canvas> API to run physical pixel heuristics locally in the browser.
Academic & Computational
NFL Combine Predictive Analysis
Python / MLThis started as an academic project and grew into a supervised machine learning suite. I used Random Forest and SVM ensembles to predict NFL rookie trajectories, evaluating draft classes for the NFC North and a few other key teams.
ISS Telemetry Analytics
Computational ModelingBuilt as an extension of my CMSE coursework, this is an unsupervised learning and spatial analysis pipeline. I mapped orbital bias and altitude decay by processing telemetry data streams to calculate structural limits and optimize trajectory models.