This is the repository for the newly created Czech Subjectivity Dataset (Subj-CS) and our paper:
Accepted to LREC 2022 conference.
The Czech Subjectivity Dataset is available for download from this https://drive.google.com/file/d/1R0bPPWJ7sdIaCxyPrO_rmTVFNNsd9RaI/view?usp=sharing
The dataset is also available in the HuggingFace Datasets
We will add usage and setup soon.
python3 baseline.py...
Create conda enviroment
-
git clone git@github.com:pauli31/czech-subjectivity-dataset.git
The dataset and code can be freely used for academic and research purposes. It is strictly prohibited to use the dataset for any commercial purpose.
Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
If you use our dataset or software for academic research, please cite our paper
@inproceedings{priban-steinberger-2022-czech,
title = "{C}zech Dataset for Cross-lingual Subjectivity Classification",
author = "P{\v{r}}ib{\'a}{\v{n}}, Pavel and
Steinberger, Josef",
booktitle = "Proceedings of the Thirteenth Language Resources and Evaluation Conference",
month = jun,
year = "2022",
address = "Marseille, France",
publisher = "European Language Resources Association",
url = "https://aclanthology.org/2022.lrec-1.148",
pages = "1381--1391",
}