Skip to content

1.0.0 SC2_Datasets Release

Compare
Choose a tag to compare
@Kaszanas Kaszanas released this 28 Jul 22:57
· 48 commits to main since this release

v1.0.0 (2022/07/28)

  • Changed repository name to "SC2_Datasets"
  • Changed package name to sc2_datasets
  • Inroduced poetry to the project.
  • Added Contributor License Agreement (CLA)
  • Added GitHub Actions for continuous integration (CI):
    • Linter execution
    • Testing
    • Documentation building
  • Added GitHub Actions for continous deployment (CD):
    • Uploading to test.pypi
    • Uploading to pypi

StarCraft II Datasets

Library can be used to interface with datasets that were pre-processed with our pipeline
as described in:

Currently we have exposed PyTorch and PyTorch Lightning API. Our goal is to provide
infrastructure used for StarCraft II analytics.

Supported Datasets

SC2EGSet: StarCraft II Esport Game State Dataset

This project contains official API implementation for the SC2EGSet: StarCraft II Esport Game State Dataset, which is built based on SC2ReSet: StarCraft II Esport Replaypack Set.
Contents of this library provide PyTorch and PyTorch Lightning API for pre-processed StarCraft II dataset.

Installation

  1. Manually install PyTorch with minimal version of ^1.11.0+cu116.
  2. Perform the following command:
$ pip install sc2_datasets

Usage

Basic example usage can be seen below. For advanced interactions with the datasets
please refer to the documentation.

Use SC2EGSet with PyTorch:

from sc2_datasets.torch.sc2_egset_dataset import SC2EGSetDataset
from sc2_datasets.available_replaypacks import EXAMPLE_SYNTHETIC_REPLAYPACKS

if _name_ == "_main_":
    # Initialize the dataset:
    sc2_egset_dataset = SC2EGSetDataset(
        unpack_dir="./unpack_dir_path",           # Specify existing directory path, where the data will be unpacked.
        download_dir="./download_dir_path",       # Specify existing directory path, where the data will be downloaded.
        download=True,
        names_urls=EXAMPLE_SYNTHETIC_REPLAYPACKS, # Use a synthetic replaypack containing 1 replay.
    )

    # Iterate over instances:
    for i in range(len(sc2_egset_dataset)):
        sc2_egset_dataset[i]

Use SC2EGSet with PyTorch Lightning:

from sc2_datasets.lightning.sc2egset_datamodule import SC2EGSetDataModule
from sc2_datasets.available_replaypacks import EXAMPLE_SYNTHETIC_REPLAYPACKS

if _name_ == "_main_":
    sc2_egset_datamodule = SC2EGSetDataModule(
                unpack_dir="./unpack_dir_path",            # Specify existing directory path, where the data will be unpacked.
                download_dir="./download_dir_path",        # Specify existing directory path, where the data will be downloaded.
                download=False,
                replaypacks=EXAMPLE_SYNTHETIC_REPLAYPACKS, # Use a synthetic replaypack containing 1 replay.
            )

Contributing

Interested in contributing? Check out the contributing guidelines. Please note that this project is released with a Contributor License Agreement (CLA) and a Code of Conduct. By contributing to this project, you agree to abide by its terms.

License

sc2egset_dataset was created by Andrzej Białecki. It is licensed under the terms of the GNU General Public License v3.0 license.

Citations

This Repository

If you wish to cite the official API for the SC2EGSet: StarCraft II Esport Game State Dataset.

@software{bialecki_andrzej_2022_6629006,
  author       = {Białecki, Andrzej and
                  Białecki, Piotr and
                  Szczap, Andrzej and
                  Krupiński, Leszek},
  title        = {{Kaszanas/SC2EGSet\_Dataset: 0.9.0 SC2EGSet\_Dataset 
                   Release}},
  month        = jun,
  year         = 2022,
  publisher    = {Zenodo},
  version      = {0.9.0},
  doi          = {10.5281/zenodo.6629005},
  url          = {https://doi.org/10.5281/zenodo.6629005}
}

Dataset Description Pre-print

To cite the article that introduces SC2ReSet and SC2EGSet use this:

@misc{https://doi.org/10.48550/arxiv.2207.03428,
  doi       = {10.48550/ARXIV.2207.03428},
  url       = {https://arxiv.org/abs/2207.03428},
  author    = {Białecki, Andrzej and Jakubowska, Natalia and Dobrowolski, Paweł and Białecki, Piotr and Krupiński, Leszek and Szczap, Andrzej and Białecki, Robert and Gajewski, Jan},
  keywords  = {Machine Learning (cs.LG), Artificial Intelligence (cs.AI), Machine Learning (stat.ML), FOS: Computer and information sciences, FOS: Computer and information sciences},
  title     = {SC2EGSet: StarCraft II Esport Replay and Game-state Dataset},
  publisher = {arXiv},
  year      = {2022},
  copyright = {Creative Commons Attribution 4.0 International}
}

SC2ReSet: StarCraft II Esport Replaypack Set

To cite the replay collection that was used to generate the dataset use this:

@dataset{bialecki_andrzej_2022_5575797,
  author       = {Białecki, Andrzej},
  title        = {SC2ReSet: StarCraft II Esport Replaypack Set},
  month        = jun,
  year         = 2022,
  publisher    = {Zenodo},
  version      = {1.0.0},
  doi          = {10.5281/zenodo.5575797},
  url          = {https://doi.org/10.5281/zenodo.5575797}
}

SC2EGSet: StarCraft II Esport Game State Dataset

To cite the data itself use this:

@dataset{bialecki_andrzej_2022_6629349,
  author       = {Białecki, Andrzej and
                  Jakubowska, Natalia and
                  Dobrowolski, Paweł and
                  Szczap, Andrzej and
                  Białecki, Robert and
                  Gajewski, Jan},
  title        = {SC2EGSet: StarCraft II Esport Game State Dataset},
  month        = jun,
  year         = 2022,
  publisher    = {Zenodo},
  version      = {1.0.0},
  doi          = {10.5281/zenodo.6629349},
  url          = {https://doi.org/10.5281/zenodo.6629349}
}