I am a German student, who is passionate about Data Engineering and Infrastructure. I have work experience in Data- and Software Engineering. In my free time I like learning new skills and concepts and building completely overengineered projects, which are showcased on my Github.
Infrastructure / Data Platform / Data Engineering 🏗
- Distributed System on Aws streaming earthquakes using Kafka
- ELT batch processing on Aws and data modeling with DBT
- Sample Data Lakehouse architecture, deployed in containers
- Simple beginners guide to containerization with Docker, with a focus on storage and build time reduction
Open source contributions 💡
Project | Added | Link |
---|---|---|
Apache Airflow | Functionality and respective unit tests to export and import roles including permissions using the Airflow CLI | Merged Pull-Request |
Apache Airflow | Changed the Airflow docker-compose to easily ingest custom config files and added relevant documentation | Merged Pull-Request |
PM4PY | Functionality to filter for a maximum coverage percentage of graph variants | Merged Pull-Request |