Skip to content
View Fshahnaj's full-sized avatar

Block or report Fshahnaj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
fshahnaj/README.md

πŸ‘‹ Hi, I'm Fujaila Shahnaj

Data/Analytics Engineer β€’ Data Analyst β€’ BI Developer β€’ ML Engineer (Healthcare & Product Analytics)

πŸŽ“ MS Computer Science @ Clemson University (GPA 3.87)
πŸ“ Raleigh–Durham–Cary–RTP (NC)
πŸ’‘ Specializing in Healthcare Analytics, ML Pipelines, BI Systems & Data Engineering


🌟 About Me

I’m a data/analytics engineering professional with experience building end-to-end clinical analytics platforms, enterprise-grade Power BI dashboards, and NLP-driven product insights systems. I thrive in turning messy real-world datasets into clean, validated, explainable insights that guide business decisions.

  • Built ML + NLP pipelines analyzing 368K+ records
  • Developed a clinical risk prediction model (ROC-AUC: 0.79)
  • Designed BI dashboards for senior leadership (KPI, RLS, automated refresh)
  • Implemented dbt + DuckDB star-schema warehouses for healthcare datasets
  • Fine-tuned BERT & RoBERTa models (F1: 0.84)

  • Built ML pipelines analyzing 368K+ records
  • Designed clinical risk prediction model (ROC-AUC: 0.79)
  • Delivered Power BI dashboards for senior leadership decision-making
  • dbt + DuckDB star-schema modeling for healthcare data
  • NLP modeling with BERT & RoBERTa (F1: 0.84)

πŸ“Š Featured Projects

πŸ”Ή CardioInsight-AI β€” Healthcare Analytics Platform

End-to-end cardiovascular risk analytics system
β€” HIPAA-style de-ID β†’ dbt warehouse β†’ ML β†’ Power BI clinical dashboard
ROC-AUC: 0.79

πŸ“ˆ Live Power BI Dashboard β€’ πŸ“‚ GithubLink


πŸ”Ή Product Hunt Community Insights (368K+ records)

NLP pipeline using BERT/RoBERTa to classify user complaints, praise, and feature requests
F1 Score: 0.84 πŸ“˜ Coming Soon


πŸ› οΈ Technical Skills

πŸ”§ Languages

Python, SQL

πŸ“Š Data Analytics

EDA, KPI Development, A/B Testing, Statistical Analysis, Visualization

🧠 Machine Learning

Logistic Regression, Tree Models, BERT/RoBERTa, Feature Engineering, Model Evaluation

πŸ—οΈ Data Engineering

dbt (models, tests, documentation), ETL/ELT, Dimensional Modeling
DuckDB, MySQL, Oracle, Spark

πŸ“ˆ BI & Visualization

Power BI (DAX, M, Star Schema, RLS), Tableau, Matplotlib, Seaborn

☁️ Cloud

AWS (S3, Glue, Redshift), EC2, IAM


πŸ’Ό Experience

Research Assistant β€” HAIE Lab | Clemson University
β€’ Analyzed 368K+ Product Hunt comments using ML/NLP
β€’ Built automated data pipelines (reduced processing time 60%)
β€’ Developed BERT multi-label classifier (F1: 0.84)


Graduate Assistant β€” Data Analytics | Clemson Graduate School
β€’ Designed enterprise Power BI dashboards
β€’ Implemented RLS and automated refresh schedules
β€’ Supported VPs/Deans with KPI tracking


Data Science Intern β€” Data Visualization Lab, Clemson Library
β€’ Built forecasting models (85% accuracy)
β€’ Created operational dashboards (Tableau/Power BI)
β€’ ETL across 50K+ records


Senior Lecturer β€” PCIU (Study Leave)
β€’ Taught DBMS, DS, Algorithms
β€’ Supervised ML/AI research projects


πŸ“¬ Contact

πŸ“§ Email: shahnajfujaila@gmail.com
πŸ”— LinkedIn: linkedin.com/Fujaila-Shahnaj
🌐 Portfolio: https://fshahnaj.github.io


⭐ Thanks for visiting! ⭐

Pinned Loading

  1. CardioInsight-AI CardioInsight-AI Public

    Production-grade ETL pipeline for cardiovascular risk analytics with HIPAA-compliant data processing, automated quality validation, and enterprise BI dashboards.

    Jupyter Notebook

  2. data-engineer-handbook data-engineer-handbook Public

    Forked from DataExpert-io-Community/data-engineer-handbook

    This is a repo with links to everything you'd ever want to learn about data engineering

    Jupyter Notebook

  3. Deep-Learning-Projects Deep-Learning-Projects Public

    Jupyter Notebook

  4. fork-commit-merge fork-commit-merge Public

    Forked from fork-commit-merge/fork-commit-merge

    Fork, Commit, Merge. A project designed to help you familiarize yourself with the open source contribution workflow on GitHub!

    JavaScript

  5. SoftwareFoundations SoftwareFoundations Public

    Forked from abastola0/SoftwareFoundations

    JavaScript

  6. FShahnaj.github.io FShahnaj.github.io Public

    HTML