A Novel Approach to Topic Evolution

The project objectives were to demonstrate how topic modelling could be used to generate a network graph representation of topics to be used as a statistical layer upon which to train a predictive model.

A multi-package project was developed in Python that allowed for component parts to be easily interchanged and explored. Results were generated in a Jupyter Notebook for repeatability and demonstration purposes. The final code base included classes for several topic model variants, and a network model, including functionality to integrate the two class types. Two topic models – Latent Dirichlet Allocation and Hierarchical Dirichlet Process – were trained on two corpora of book summaries (length 142 and 1,996 documents), retrieved using an API provided by getAbstract. Outputs from both models were used to generate a static network of connected topics, using the Hellinger Distance metric to measure topic similarity, and were visualised on an interactive plot. A HDP model was trained on the larger corpus to demonstrate a dynamic topic network, and the implications of these results were used to propose how a prediction model would be trained to predict document influence and forecast topic trends.

The direct integration of topic modelling and network analytics – with continuity of the same data – was not expected to be achieved during the course of the project, and results encourage further development.

Although a prediction model was not implemented, a comprehensive review of component parts has been used to detail a final design for the integration of the three technologies.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
apiIntegrations		apiIntegrations
forecasting		forecasting
networking		networking
notebooks		notebooks
scripts		scripts
tests		tests
topicmodelling		topicmodelling
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
apiConfig.json		apiConfig.json
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Novel Approach to Topic Evolution

About

Releases

Packages

Languages

Tobias-Fechner/dissertation

Folders and files

Latest commit

History

Repository files navigation

A Novel Approach to Topic Evolution

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages