pnlp_study_project

Practical NLP for Survey Analysis with deepsight GmbH.

Project Structure

|-- data
    |-- raw
    |-- processed
|-- notebooks
    |-- sentiment_analysis
    |-- topic_clustering
|-- src
    |-- sentiment_classifier

Usage

Clone the repository.
Navigate to the repository directory.
Install dependencies. (pip install -r requirements.txt) Note: OS X users require a compiler with good C++11 support per the FastText documentation. Information on how to install one of the available compilers can be found here.
Obtain dataset available on slack. Place in 'data' directory.
Obtain fasttext English model here. Place ‘common-crawl-300d-2M-subword’ in 'src'.
Navigate to src and test run the main script. (python main.py)

Experiment Logging

To access the full capabilities of the script you should integrate the sacred experiment with the MongoDB Atlas cluster prepared for this project. Sacred is an experiment logging toolkit which easily integrates into scripts and records configurations, results, files, etc. You can find more information on Sacred here.

First, provide your email to the MongoDB Atlas cluster owner. You will receive an invite to register and gain access to the project.

Next, adjust the credentials.py file in the src directory by adding the username and password you used to register for MongoDB Atlas. You should now be able to test that your experiment logs to the database by uncommenting the experiment observer in main.py. If the script runs without problems you have successfully logged the experiment.

Viewing Experiment Results

To view experiment results logged to the database you must install Omniboard. Omniboard is a browser based application optimized for viewing experiments logged via sacred and MongoDB.

Omniboard requires node.js. First, install node.js here.

Next, use the terminal to install Omniboard. (npm install -g omniboard). Run Omniboard (npx omniboard –mu url) You must replace 'url' with the database server url. The easiest way to obtain this is to run the main.py and copy-paste the output url into the terminal. If you receive no errors Omniboard should be running.

To view the board and interact with the results, navigate to localhost:9000 in your browser. From Omniboard you can adjust the shown columns and other settings via the options menu at the top right of the dashboard.

Further information on setting up Omniboard can be found here.

Name		Name	Last commit message	Last commit date
Latest commit History 93 Commits
data		data
notebooks		notebooks
src		src
.gitignore		.gitignore
README.md		README.md
pnlp_sp.yml		pnlp_sp.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pnlp_study_project

Project Structure

Usage

Experiment Logging

Viewing Experiment Results

About

Releases

Packages

Languages

MISSEY/PNLPUOS

Folders and files

Latest commit

History

Repository files navigation

pnlp_study_project

Project Structure

Usage

Experiment Logging

Viewing Experiment Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages