SenTree

A module that can convert natural language sentence into binary tree implied sequence, is based on and leveraging the Structural-Probe: https://github.com/john-hewitt/structural-probes

The module is easily integrated into existing LLMs based on Transformers by using the binary tree implied sequence as the input and output of the decoder. The module also supports converting binary tree implied sequences to natural language sentences and letting the output be easily converted back to readable sentences.

Requisitions

It is recommended to construct a virtual environment for this project. Only python3 is supported.
Download and install the Structural-Probe project, references: Installing & Getting Started of Structural-Probe.
Configurate your system environment variables for Python to import the modules.
Clone this repository.
Edit the configuration file probe.yaml of this project, especially the absolute path of depth_params_path

Getting Started

Run demo.py and sentree_util.py to get started.

The sentree_util.py can do the conversions between sentences and binary tree implied sequences based on your option and input. It will be a handy tool during the process of integrating SenTree to existing systems.

Integrating

The module is proposed to be used to convert the raw sentence into binary tree implied sequence for decoder. The training processes of autoregressive models stay the same with the original processes of them. The autoregressive process of generating will be altered so that it is no longer a word after a word style, but the latest generated word may be inserted at some position in the being generated sentence that is not completed yet.

The SenTree module should be integrated into autoregressive models as illustrated below:

Additional Information

The binary tree implied sequences output by SenTree corresponding to the input sentences are not constant. These sequences vary depending on the model weights and different auto-encoding models behind the Structural-Probe. Also, this variability is a key feature that can be used to search for proper ways of expressions from both perspectives of decoder and the SenTree system.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
doc-assets		doc-assets
LICENSE		LICENSE
README.md		README.md
bintree.py		bintree.py
demo.py		demo.py
probe.yaml		probe.yaml
sentree.py		sentree.py
sentree_util.py		sentree_util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SenTree

Requisitions

Getting Started

Integrating

Additional Information

About

Releases

Packages

Languages

License

arklyg/sentree

Folders and files

Latest commit

History

Repository files navigation

SenTree

Requisitions

Getting Started

Integrating

Additional Information

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages