This repository refers to a study to characterize and identify bots in a social network using only the network structure. This work considers a directed network of users constructed from comments in Reddit. The network characterization highlights the significant structural differences of bots, allowing them to be classified using only network features.
This work was guided by Professor Daniel R. Figueiredo (PESC / COPPE / UFRJ).
-
Full Paper on Brazilian Computing Society (pt-BR)
Presented at the XXXIX Congresso da Sociedade Brasileira de Computação -
Undergraduate Project (Poli / COPPE / UFRJ) (pt-BR)
as a partial fulfillment of the requirements for the degree of Computer and Information Engineer
The code elaborated for this work is compatible with Python 3.7.8.
To install all dependencies and use Jupyter Notebook with a virtual environment:
python3 -m venv venv
source venv/bin/activate
pip3 install -r requirements.txt
python3 -m ipykernel install --user --name=venv
jupyter notebook
In the notebooks
folder are the Jupyter Notebook files that can be used to evaluate other datasets.
Reddit Comments datasets can be found in pushshift, the files should be placed uncompressed in data/raw/
.