This is a simple application built on top of Flask that allows for the ranking of compounds by the amount of BioAssays and BioSystems found in PubChem. A web-based interface is provided for viewing the ranked compounds and some commands for setting up and running the ranker.
- Clone this repository
- Make sure you have python3 installed
- cd into the project directory and run
pip install -r requirements.txt
- Go to
CompoundRanker/__init__.py
and fill in the "ADMIN_EMAIL" setting. Required for Pubchem - run
python manage.py initdb
to intitialize the database - run
python manage.py fillmetabs <path> <name>
where path is the absolute path to the CSV file containing the dataset and name is the name you wish to call the dataset - run
python manage.py fillcids <name>
to gather and fill the CIDs table where name is the dataset name - run
python manage.py fillcounts <name>
to fill the counts table where name is the name of the dataset you wish to count. Takes a long time - run
python manage.py runserver
to run the server
Datasets must be in CSV format with each compound on one row. The CAS number takes the first position followed by the IUPAC name in brackets. This data must be in the first column, anything in other columns will be ignored
<CAS> (<IUPAC>)