Data/Analytics Engineer β’ Data Analyst β’ BI Developer β’ ML Engineer (Healthcare & Product Analytics)
π MS Computer Science @ Clemson University (GPA 3.87)
π RaleighβDurhamβCaryβRTP (NC)
π‘ Specializing in Healthcare Analytics, ML Pipelines, BI Systems & Data Engineering
Iβm a data/analytics engineering professional with experience building end-to-end clinical analytics platforms, enterprise-grade Power BI dashboards, and NLP-driven product insights systems. I thrive in turning messy real-world datasets into clean, validated, explainable insights that guide business decisions.
- Built ML + NLP pipelines analyzing 368K+ records
- Developed a clinical risk prediction model (ROC-AUC: 0.79)
- Designed BI dashboards for senior leadership (KPI, RLS, automated refresh)
- Implemented dbt + DuckDB star-schema warehouses for healthcare datasets
- Fine-tuned BERT & RoBERTa models (F1: 0.84)
- Built ML pipelines analyzing 368K+ records
- Designed clinical risk prediction model (ROC-AUC: 0.79)
- Delivered Power BI dashboards for senior leadership decision-making
- dbt + DuckDB star-schema modeling for healthcare data
- NLP modeling with BERT & RoBERTa (F1: 0.84)
End-to-end cardiovascular risk analytics system
β HIPAA-style de-ID β dbt warehouse β ML β Power BI clinical dashboard
ROC-AUC: 0.79
π Live Power BI Dashboard β’
π GithubLink
NLP pipeline using BERT/RoBERTa to classify user complaints, praise, and feature requests
F1 Score: 0.84
π Coming Soon
|
Python, SQL EDA, KPI Development, A/B Testing, Statistical Analysis, Visualization Logistic Regression, Tree Models, BERT/RoBERTa, Feature Engineering, Model Evaluation |
dbt (models, tests, documentation), ETL/ELT, Dimensional Modeling Power BI (DAX, M, Star Schema, RLS), Tableau, Matplotlib, Seaborn AWS (S3, Glue, Redshift), EC2, IAM |
Research Assistant β HAIE Lab | Clemson University
β’ Analyzed 368K+ Product Hunt comments using ML/NLP
β’ Built automated data pipelines (reduced processing time 60%)
β’ Developed BERT multi-label classifier (F1: 0.84)
Graduate Assistant β Data Analytics | Clemson Graduate School
β’ Designed enterprise Power BI dashboards
β’ Implemented RLS and automated refresh schedules
β’ Supported VPs/Deans with KPI tracking
Data Science Intern β Data Visualization Lab, Clemson Library
β’ Built forecasting models (85% accuracy)
β’ Created operational dashboards (Tableau/Power BI)
β’ ETL across 50K+ records
Senior Lecturer β PCIU (Study Leave)
β’ Taught DBMS, DS, Algorithms
β’ Supervised ML/AI research projects
π§ Email: shahnajfujaila@gmail.com
π LinkedIn: linkedin.com/Fujaila-Shahnaj
π Portfolio: https://fshahnaj.github.io
