Skip to content

youssefHosni/Data-Science-Portofolio

Repository files navigation

Data Science Portfolio

Substack Medium Kaggle YouTube

End to End Projects

  • Utilized Python programming language and Bash scripting to preprocess and extract features from 4D complex brain imaging data amounting to 0.25 TB, applying a 3D multiresolution optical flow technique.
  • Employed signal processing techniques with Python programming language to modulate respiratory signals with calculated brain signal speed.
  • Conducted A/B testing to identify significant differences in the modulation of brain cardiovascular pulse with respiration between control subjects and individuals with Alzheimer's disease.
  • Visualized and presented research findings to neurological researchers, contributing to a greater understanding of Alzheimer's disease. Additionally, authored and submitted a paper to JCBFM.
  • Paper: Elabasy, A., Suhonen, M., Rajna, Z. et al. Respiratory brain impulse propagation in focal epilepsy. Sci Rep 13, 5222 (2023). https://doi.org/10.1038/s41598-023-32271-7
  • Trained multiple classifiers that improved the classification performance of imbalanced Alzheimer’s fMRI images by more than 12% compared to state-of-the-art on the same data.
  • Analyzed, visualized, and discussed the results with a team of neurological researchers to have a better understanding of the results and Alzheimer’s disease.
  • Analyzed, visualized, and reported the results and submitted a research paper to the ISPr 2023 scientific conference.
  • Developed a real-time sign language interpretation application using React.js, TensorFlow, and tensorflow.js and deployed it on IBM cloud servers.
  • Build a stable diffusion web application using Hugging Face and React and deploy it on fast API.
  • Building a recommendation engine using Alternating Least Squares in PySpark and using the popular MovieLens dataset and the Million Songs dataset.
  • Building a real-time car plate detection mobile application using TensorFlow and EasyOCR.

Skill Based Projects

Generative AI

Fine Tuning LLMs

Reterival Augmented Generation (RAG) Application

Prompt Engineering

Machine Learning:

Regression

  • Automobile price prediction: Utilize Python to implement end to end data science pipeline to predict the price of old Automobile based on the given features.

Classification

  • Sensor Activity Recogniation: Classifying the output of eight sensors into five activities and studied the effect of changing window sizes and axel combination.
  • Alzhimers CV-BOLD Classification: Utilized Python to develop supervised machine learning techniques to classify imbalanced Alzheimer’s CVBOLD data, which enhanced the classification performance by 10%.

Clustering


Deep Learning

Classification


Computer Vision


Natural Language Processing

  • Sentiment Analysis web app: Web application for classification of reviews, using deep learning model implemented in PyTorch and deployed on Amazon SageMaker.
  • Plagirasm Detector web app: Creating plagiarism detector trained on LSC and containments features and deployed on AWS SageMaker.
  • Data Science Resume Selector: Selecting the resume that are eligbile to data scientist postions, the dataset used contains 125 resumes, in the resumetext column. Resumes were queried from Indeed.

Time-series Analysis


Data Visulization:


Spark


Data Modeling

  • Songs App User Activity Data Modeling: Modeling user activity data for a music streaming app called Sparkify to optimize queries for understanding what songs users are listening to by creating a Postgres relational database and ETL pipeline to build up Fact and Dimension tables and insert data into new tables.
  • Songs App data modeling using Apache Casandra: Create an Apache Cassandra database which can create queries on song play data to answer analysis questions.

Certificates


Course Work

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published