SER_dataloader

This project is part of a master's thesis for the: course DA231X - Degree Project in Computer Science and Engineering, Second Cycle 30hp

These scripts are meant to load and transform datasets into Dataframes for ease of use (that is, no actual data, just preparation scripts). The data is loaded by reading file_names from the corresponding dataset-directories (see below how for directory structure).

Get Started

Install

Install with pip:

pip install git+https://github.com/felixlut/SER_dataloader.git@main

Usage

The package is meant to be accessed via python like this:

from dataloader import Dataloader
dataset_name = 'cremad' # /cremad/emodb/...
data = Dataloader(data_args.data_path, force_reload=False) # After the first instance of this command, the dataset is cached. force_reload=True for circumventing the caching
dataset_as_df = data.load_dataset(dataset_name)

Data

The data is supposed to be stored in .../datasets/{dataset_name}, and should have the below described structure overall.

.
├── datasets
    ├── dataset_name
        ├── wav/wav                         # 2 layers of wav, kaggle datahandling is weird...
        |   ├── file_abc123.wav
        |   └── ...
        ├── wav_telephone/wav_telephone     # same as for wav, 2 layers cuz kaggle...
        |   ├── file_abc123.wav
        |   └── ...
        └── meta_data.csv                   # Possibly extra csv:s containing some metadata used for constructing the dataframes, ex. actor_2_gender.csv

Supported Datasets

The following datasets are supported, and have to have the same directory name mentioned below (don't mind the inconsistency of the data postfix, kaggle is once again coming through with only the highest quality of data-handling). These can also be derived from the correspponding scripts.

Dataset - directory_name
Cremad-D - cremad
~~ESD - esd-data~~ (Changed license, no longer commercially viable)
Subesco - subesco
~~emoDB - emodb-data~~ (License not specified)
MESD - mesddata
~~EMOVO - emovodata~~ (License not specified)
Oréau - oreaudata
emoUERJ - emouerj

Name		Name	Last commit message	Last commit date
Latest commit History 184 Commits
dataloader		dataloader
.gitignore		.gitignore
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SER_dataloader

Get Started

Install

Usage

Data

Supported Datasets

About

Releases

Packages

Languages

felixlut/SER_dataloader

Folders and files

Latest commit

History

Repository files navigation

SER_dataloader

Get Started

Install

Usage

Data

Supported Datasets

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages