Skip to content

r4ghu/IntroToPandas

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

IntroToPandas

This repo contains material covered in 'Intro to Pandas' talk given as a part of Victoria Data Science Meetup. For more details about the Data Science community in Victoria, please visit the following link - Victoria Data Science Meetup

Requirements

Environment Setup

  • Create Anaconda environment. Open your terminal and type the following commands.
$ conda create -n Intro2Pandas python=3.6
$ source activate Intro2Pandas
(Intro2Pandas) $ conda install pandas
(Intro2Pandas) $ conda install jupyter notebook

Dataset

For this talk we will be using the Kaggle's Competition dataset - 'House Prices: Advanced Regression Techniques'. Kaggle is a great place to participate and learn Data Science related concepts and it has a rich community ranging from Beginners to Advanced. If you haven't created an account yet, please sign up, join the competition and download the data from the following link - https://www.kaggle.com/c/house-prices-advanced-regression-techniques

Tutorial

Once the environment is setup. You can run jupyter notebook and start following the Intro to Pandas.ipynb notebook. The objectives of the talk are as follows:

  • Get familiarized with Kaggle
  • Understand how to use Pandas library
  • Cover foundations for data science.
  • Submit final results to the Kaggle competition.

By the end of this talk, you will see yourself on the leaderboard of an ongoing data science competition on Kaggle :-)

Author

Sri Raghu Malireddi / @r4ghu