Semi-supervised Recursive Autoencoders for Opinion Detection on Twitter

Problem description

Twitter data from the first 2008 Presidential debate Total number of tweets is 3,238

Data is in ./data folder:

The system is buit on Python 2.7

Other packages you need to install before running the system:

Parameters of the model can be changed in ./data/model/json

d : dimension of the word vector
cat : number of categories of the classification problem
alpha : the proportion of supervised (classification) error and unsupervised (reconstruction) error
lambdaW : regularisation term on word vector reconstruction matrices
lambdaCat : regularisation term on category
lambdaL : regularisation term on word embedding
iter : number of maximum iteration of the minFunc solver

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
References		References
data		data
sa		sa
README.md		README.md
Report.pdf		Report.pdf