Aim

Within a user specified timeframe, this programme extracts the Twitter data (Tweets) that matches the user specified query parameters i.e. a list of hashtags and phrases

Data Constraints

The programme filters out any retweets, replies, non-image tweets and also sanitises tweets by removing the non-ASCII characters. This was done on the request of the user.

Result

'Twitter_API_Result.csv': Stores the following information related to the data constrained tweets; Twitter Handle, Time Stamp, Tweet(Description), Names of Images associated to each Tweet,URL of images, URL of Tweets
Images are downloaded and stored locally using their URL extracted from Tweets
'Error_Log.txt': Logs any errors generated during the API call

Structure of Folder

'TwitterAPI_Socialmedia_Extract.py': Main Python file to run
'credentials.txt': Contains the credentials of a Twitter Developer Account
'hashtags.txt': Contains the list of hashtags (or any other query parameter) separated by ','. To understand the format of this file, check the Procedure Section

Installation

Installing 'Tweepy' library in Anaconda on Mac OS

conda install -c conda-forge tweepy

Procedure

Clone the 'credentials.txt' and replace the keys and tokens with the your credentials obtained via Twitter Developer Account
Clone the 'hashtags.txt' and enter the query parameters separated by ','.All the queries in the first row are OR'd. This expression is then AND'd with the OR'd query expressions of second row. Example:

2a. Row1: Science,Data

2b. Row2: Research,Academic

2c. Final Query= (Science OR Data) AND (Research OR Academic)

Run 'TwitterAPI_Socialmedia_Extract.py'

User Inputs

The following inputs need to be provided to run the algorithm

Start date of search (Cannot be more than 7 days from the current date due to Twitter Policy)
End data of search
Number of pages to be returned. Each page contains multiple tweets
Number of tweets to be returned per page

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
.DS_Store		.DS_Store
README.md		README.md
Twitter Handout.pdf		Twitter Handout.pdf
TwitterAPI_Socialmedia_Extract.py		TwitterAPI_Socialmedia_Extract.py
credentials.txt		credentials.txt
hashtags.txt		hashtags.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Aim

Data Constraints

Result

Structure of Folder

Installation

Procedure

User Inputs

Documentation

About

Releases

Packages

Languages

RonakSharma1/Twitter_Data_Collection

Folders and files

Latest commit

History

Repository files navigation

Aim

Data Constraints

Result

Structure of Folder

Installation

Procedure

User Inputs

Documentation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages