Skip to content

hubanton/Cub-200-Scripts

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

53 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Cub-200-Scripts

Collection of all scripts / data used for gathering information about all 200 birds listed in the CUB-200 Dataset

Since different naming patterns are used depending on the site, different text files exist depending on the used API:

  1. xeno-canto-names.txt: These are the names used for downloading from xeno-canto. Since the API does a partial matching on names, some birds are referenced by their scientific name
  2. botw-names.txt: Names used for webscrapper, only slight differences to xeno-canto
  3. latin-names.txt: Scientific names of birds, is used for botw-scraper and for general naming [These do not work with the scrapper, xeno due to naming differences, partial name matching]
  4. CUB-200-names.txt: The class labels as found in the CUB-200 dataset

Additional Info

Additional infos regarding differences in species, ambiguous bird names etc can be found in additional info/

Running the BOTW-Webscraper

  1. Install all packages
  2. Inside the working folder, create a .env with your login data (Required!)
  3. Select whether to install pdfs, textfiles or both
  4. Run the script

Running the Xeno-Canto-Downloader

  1. Install all packages
  2. Run the script (This stores all audio files and meta-files inside the current working directory (separate folders) and also keeps track of bird_names which are ambiguous or have a limited number of recordings)

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published