Skip to content

sienlonglim/sienlonglim

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

55 Commits
ย 
ย 

Repository files navigation

  • ๐Ÿ‘‹ Hi, Iโ€™m Static Badge
  • ๐Ÿ‘€ Iโ€™m interested in Data Science, Data Engineering and Data Analytics. Anything related to AI/ML!
  • ๐ŸŒฑ Iโ€™m currently pursuing a Masters of Science in Analytics at Georgia Tech.
  • ๐Ÿ’ž๏ธ Iโ€™m looking to collaborate on anything that can help me learn.
  • ๐Ÿ“ซ How to reach me Static Badge Static Badge Static Badge

Personal projects:

  1. Dagster-dbt-duckdb Pipeline
    Static Badge GitHub commit activity (branch)
  • Dagster for orchestration
  • S3 as data lake
  • Dbt for data modeling and transformation
  • DuckDB for data warehousing

  1. Document Query Bot (RAG Framework)
    Static Badge GitHub commit activity (branch) Static Badge
  • Document splitting
  • Embeddings (OpenAI)
  • Vector database (Chroma / FAISS)
  • Semantic search
  • Retrieval chain

  1. Job Finder
    Static Badge GitHub commit activity (branch) scheduled run main.py
  • Data mining
  • Automation with GitHub runners
  • S3 and SMTP integration

  1. HDB Resale Prices Predictor and Dashboard
    Static Badge GitHub commit activity (branch) Static Badge Static Badge Static Badge Static Badge
  • Large dataset involving geodata ๐ŸŒ
  • Rest API calls to Data.gov.sg and OneMap API ๐Ÿ—บ๏ธ
  • Feature creation and selection (KBest, L1 Regularisation)
  • Hyperparameter tuning (Random CV)
  • Ensemble models (Gradient boosting, Random forest)
  • Web Application (Flask) with Bootstrap 5

  1. Stock portfolio analysis (K-means), forecasting (ARIMA) and stock recommendation
    Static Badge GitHub commit activity (branch)
  • Web Scrapping (BS4)
  • ETL
  • RDBMS (MySQL)
  • K means clustering
  • ARIMA

  1. Web application for SkillsFuture website attendance taking summary
    Static Badge GitHub commit activity (branch)
  • Web Application (Flask) with Bootstrap 5
  • Telegram Bot API
  • RDBMS (MariaDB)

  1. EDA of Real Anonymized Financial Dataset with SQL (Czech Republic PKDD 99' Discovery Challenge)
    Static BadgeGitHub commit activity (branch) Static Badge
  • Database design (MariaDB - CLI, visualizer)
  • MariaDB with CLI and DBVisualizer
  • SQL queries, connectors
  • Report writing

Releases

No releases published

Packages

No packages published