Skip to content

medeng/big-data-exploration

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

75 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MongoDB Big-Data-Exploration Project

This project seeks to discover, investigate, and solve big data-set questions while utilizing MongoDB for storage and computations. This summer internship project also shows how to answer questions concerning big datasets stored in MongoDB using MongoDB's frameworks and connector. Both the MongoDB native aggregation framework and hadoop were utilized to explore the data.

The data for this project comes from two major sources:

Roadmap

This project can be divided into three sections, each with in-depth wiki pages describing our steps and observation:

  • Basic-Flights - Basic analysis on the Flights dataset using MongoDB Aggregation Framework
  • PageRank-Flights - Computing PageRank over the Flights dataset using the MongoDB MapReduce Framework
  • Twitter-Memes - Computing PageRank over the Twitter-Memes dataset using Hadoop and associated frameworks/languages (like Apache Pig, Amazon EMR)

Contributors

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published