datagrokr-coding-challenge
This project contains script for downloading a file from S3 and finding the top 5 most frequent words in that file.
- data.py - Python script to download the file from s3 and count the most frequent words. Python modules used : boto3.
- aws_credentials - File to hold the aws credentials with permissions to download the file from s3.
- Dockerfile - Dockerfile to package the environment and dependencies in order to run the python script. Dependencies : python-pip, boto3.