Accomplished Lead Data Scientist with extensive experience in machine learning and reliability engineering. Proven history of leading high-impact data science projects from ideation to deployment, generating over $1.2 billion in business value. Specializes in steering data-driven strategies and managing complete project lifecycles, aligning with iterative principles to address complex business challenges and facilitate cost reduction. Excels in both technical leadership and mentoring, translating complex technical findings to non-technical stakeholders to ensure successful project outcomes.
- A Visual History of Nobel Prize Winners
- Cleaning Bank Marketing Campaign Data
- Dr. Semmelweis and the Discovery of Handwashing
- Exploring NYC Public School Test Result Scores
- Exploring the Bitcoin Cryptocurrency Market
- Investigating Netflix Movies and Guest Stars in The Office
- Mobile Games AB Testing with Cookie Cats
- Name Game: Gender Prediction using Sound
- Predicting Credit Card Approvals
- Predicting High Traffic for a Website
- The Android App Market on Google Play
- The GitHub History of the Scala Language
Twice winner of machine learning competitions at Amazon for Regression and Binary Classification data science competitions.
Shared machine learning methodology to predict diesel engine life in large off-highway trucks. Life was extended by 33%.
Spoke at Caterpillar large mining truck conference to discuss Empire-Cat’s truck rebuild program. Empire will completely rebuild a mining haul trucks at 60% of the cost of a new machine and operates with 2 to 4% more physical availability above trucks from the factory.
Discussed Empire's reliability engineering program which involves a focus on "find it" using RCA, critical component life analysis, inspections, telematics, and condition monitoring, followed by identifying key factors using machine learning, and then implementing "fix it" solutions through interim corrective actions and working with Caterpillar's FMEA process for permanent solutions.
DataCamp provides assessments for a range of topics within data science, including Python, R, SQL, machine learning, statistics, and more.
| Assessment | Percentile |
|---|---|
| Data Manipulation with Python | 95th |
| Data Visualization with Python | 95th |
| Importing and Cleaning Data with Python | 99th |
| Machine Learning Fundamentals in Python | 99th |
| Programming for Data Engineering | 99th |
| Python Programming | 98th |
| Statistics Fundamentals with Python | 97th |
| Assessment | Percentile |
|---|---|
| Data Manipulation with R | 98th |
| Data Visualization with R | 96th |
| Importing and Cleaning Data with R | 96th |
| Machine Learning Fundamentals in R | 98th |
| R Programming | 98th |
| Statistics Fundamentals with R | 97th |
| Assessment | Percentile |
|---|---|
| Data Analysis in SQL (PostgresSQL) | 99th |
| Data Management in SQL (PostgresSQL) | 95th |
| Assessment | Percentile |
|---|---|
| AI Fundamentals | 99th |
| Analytic Fundamentals | 97th |
| Data Management Theory | 97th |
| Data Storytelling | 95th |
| Data Visualization Theory | 99th |
| Exploratory Analysis Theory | 95th |
| Statistical Experimentation Theory | 99th |
| Understanding and Interpreting Data | 98th |
| Assessment | Percentile |
|---|---|
| Data Manipulation with Python | 95th |
| Data Visualization with Python | 95th |
| Importing and Cleaning Data with Python | 99th |
| Machine Learning Fundamentals in Python | 99th |
| Programming for Data Engineering | 99th |
| Python Programming | 98th |
| Statistics Fundamentals with Python | 97th |
| Assessment | Percentile |
|---|---|
| Data Manipulation with R | 98th |
| Data Visualization with R | 96th |
| Importing and Cleaning Data with R | 96th |
| Machine Learning Fundamentals in R | 98th |
| R Programming | 98th |
| Statistics Fundamentals with R | 97th |
| Assessment | Percentile |
|---|---|
| Data Analysis in SQL (PostgresSQL) | 99th |
| Data Management in SQL (PostgresSQL) | 95th |

