This repository contains the code for a multi-label text classification pipeline. The documentation can be found here.
- Create a new conda environment:
conda create -n multiscope python=3.12.5
- Activate the environment:
conda activate multiscope
- Clone the repository:
git clone https://github.com/clips/multiscope.git
- Change the working directory:
cd multiscope
- Install the requirements:
pip3 install -r requirements.txt
Use Multiscope in Google Colab: https://colab.research.google.com/drive/1NexZF1CtWdsxkLn0hqFT7Y12T9um4oee?usp=sharing
- Activate the
multiscope
conda environment:conda activate multiscope
- First, run
python setup_nltk.py
to install all necessary NLTK-related files. - To run the pipeline in a Gradio User Interface, run (CUDA_VISIBLE_DEVICES=X)
python app.py
and browse to http://127.0.0.1:7860.
Jens Van Nooten and Walter Daelemans. 2024. Multiscope: A User-Friendly Multi-Label Text Classification Dashboard. CLiPS Technical Report Series 11 (CTRS 11). ISSN 2033-3544. Computational Linguistics, Psycholinguistics, and Sociolinguistics Research Center.