Big Data Management Systems @ CEID

In this assignment, a typical pipeline for real-time data input, processing, and storage in a NoSQL database (MongoDB) is implemented.

Data Generation: A Python script, based on the results of the uxsim simulator, sends data to a Kafka broker at regular intervals.

Real-time Processing: The data from the Kafka broker is consumed by an Apache Spark implementation, which performs real-time processing on it.

Storage in a NoSQL Database: Both the raw data and its processed form (produced by Spark) are stored in a MongoDB database.

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.idea		.idea
auxiliary		auxiliary
scripts		scripts
.gitattributes		.gitattributes
.gitignore		.gitignore
.pylintrc		.pylintrc
LICENSE		LICENSE
README.md		README.md
main.py		main.py

Provide feedback