Transformer-Text-Summarizer

This project implement the transformer decoder to summarize text. Summarization is an important task in natural language processing and could be useful for a consumer enterprise. For example, it can be used to scrape articles, summarize them, and then you can use sentiment analysis to identify the sentiments.

Getting Started

Dependencies

Following packages should be installed on python 3:

Trax
numpy
random

Trax is an end-to-end library for deep learning that focuses on clear code and speed. It is actively used and maintained in the Google Brain team. It is faster than Tensorflow and Pytorch and also the codes are more clear. It also supprts both TPUs and GPUs.

Summarization with transformer model:

Dataset

CNN/DailyMail non-anonymized summarization dataset is used in this project which can be found in Tensorflow Dataset (TFDS). There are two features:

article: text of news article, used as the document to be summarized
highlights: joined text of highlights with around each highlight, which is the target summary.

Instructions

You can train the model from scrath using the Google Colab notebooks. Please use Transformer-Text-Summarizer.ipynb for Trax version.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
README.md		README.md
Transformer-Text-Summarizer.ipynb		Transformer-Text-Summarizer.ipynb
transformerNews.png		transformerNews.png
transformer_decoder_zoomin.png		transformer_decoder_zoomin.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformer-Text-Summarizer

Getting Started

Dependencies

Summarization with transformer model:

Dataset

Instructions

About

Releases

Packages

Languages

saeedkhaki92/Transformer-Text-Summarizer

Folders and files

Latest commit

History

Repository files navigation

Transformer-Text-Summarizer

Getting Started

Dependencies

Summarization with transformer model:

Dataset

Instructions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages