Skip to content

jyotibhanot/datagrokr-coding-challenge

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

datagrokr-coding-challenge

datagrokr-coding-challenge

S3 file processing

This project contains script for downloading a file from S3 and finding the top 5 most frequent words in that file.

Deliverable: 1

  1. data.py - Python script to download the file from s3 and count the most frequent words. Python modules used : boto3.
  2. aws_credentials - File to hold the aws credentials with permissions to download the file from s3.

Deliverable: 2

  1. Dockerfile - Dockerfile to package the environment and dependencies in order to run the python script. Dependencies : python-pip, boto3.

About

datagrokr-coding-challenge

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published