diff --git a/README.md b/README.md index 7bae1d28434..ff32569cf67 100644 --- a/README.md +++ b/README.md @@ -78,6 +78,10 @@ ESPnet uses [pytorch](http://pytorch.org/) as a deep learning engine and also fo - Set `frontend` to be `s3prl` - Select any upstream model by setting the `frontend_conf` to the corresponding name. - Streaming Transformer/Conformer ASR with blockwise synchronous beam search. +- Restricted Self-Attention based on [Longformer](https://arxiv.org/abs/2004.05150) as an encoder for long sequences + +### SUM: Speech Summarization +- End to End Speech Summarization Recipe for Instructional Videos using Restricted Self-Attention [[Sharma et al., 2022]](https://arxiv.org/abs/2110.06263) Demonstration - Real-time ASR demo with ESPnet2 [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/espnet/notebook/blob/master/espnet2_asr_realtime_demo.ipynb)