Skip to content

The main script in this repository uses python's Beautiful Soup package to web-scrape Etsy shop data.

Notifications You must be signed in to change notification settings

hdlugas/etsy_shop_scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

56 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Etsy Shop Scraper

The script 'shop_scraper.py':

  1. inputs i) a keyword that is searched in the Etsy shop search engine and ii) the total number of pages (each page has 10 shops as of 11/2022) to be scraped.
  2. writes a csv which has one row for each Etsy shop scraped and columns ['shop_name', 'shop_id', 'currency_id', 'country_id', 'listing_enabled', 'browsing_enabled', 'buyer_location_restricted', 'rate_updated_enabled', 'test_account', 'accepts_custom_requests', 'number_admirers', 'total_sales', 'starting_year', 'number_reviews', 'avg_rating', 'number_items_listed', 'avg_price', 'std_price', 'avg_feat_price', 'std_feat_price', 'number_feature_items'] to a csv file located at a path that you must change.

The correlation matrix of the processed data frame:

The distribution of a given variable (taken here to be shop rating):

The distribution of a given variable (e.g. accepts custom requests flag, number of items listed, average item price, number of feature items) grouped by a given categorical variable (e.g. average shop rating):

About

The main script in this repository uses python's Beautiful Soup package to web-scrape Etsy shop data.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published