Name		Name	Last commit message	Last commit date
parent directory ..
data		data
README.md		README.md
google_text_summary_with_t5.ipynb		google_text_summary_with_t5.ipynb
t5_pt_huggingface.ipynb		t5_pt_huggingface.ipynb
t5_tf_german_text_summary_results.ipynb		t5_tf_german_text_summary_results.ipynb
t5_tf_huggingface.ipynb		t5_tf_huggingface.ipynb

README.md

Text summarization with the Pretrained T5

Pretrained T5

We will try out the T5 pretrained Transformer from Google. (https://ai.googleblog.com/2020/02/exploring-transfer-learning-with-t5.html) They say it's the new a "Shared Text-To-Text Framework" for NLP which should explore the limits of transfer learning. They trained the T5 with the C4 dataset, which is a unlabeled dataset and a cleaned version of Common Crawl that is two orders of magnitude larger than Wikipedia (https://www.tensorflow.org/datasets/catalog/c4).

There are multiple sized versions of the T5. The biggest one has 11B Parameters, which is a lot. The large version of BERT(https://github.com/google-research/bert) has 340M Parameters. It's to big to train on the free TPU from colab, So we will use the 3B Pretrained Model.

Dataset for text summarization

We will try out the CNN DailyMail Dataset. It is the most used Dataset for text summarization.

Implementation

There are 3 Types of implementations, first the implementation from google, then a tensorflow and pytorch implementation, which are based on the huggingface library. The ones based on the huggingface library have the plus points, that they are better expandable or customizable.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

t5

t5

README.md

Text summarization with the Pretrained T5

Pretrained T5

Dataset for text summarization

Implementation

Google Implemenation

Huggingface Tensorflow Implemenation

Huggingface Pytorch Implemenation

Files

t5

Directory actions

More options

Directory actions

More options

Latest commit

History

t5

Folders and files

parent directory

README.md

Text summarization with the Pretrained T5

Pretrained T5

Dataset for text summarization

Implementation

Google Implemenation

Huggingface Tensorflow Implemenation

Huggingface Pytorch Implemenation