Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update PUBLICATIONS.md #5963

Merged
merged 4 commits into from
Feb 10, 2023
Merged
Show file tree
Hide file tree
Changes from 3 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 4 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -94,6 +94,10 @@ target/
# Jupyter Notebook
.ipynb_checkpoints

# Override Jupyter in Github Language states for more accurate estimate of repo code.
# Reference: https://github.com/github/linguist/blob/master/docs/overrides.md#generated-code
*.ipynb linguist-generated
XuesongYang marked this conversation as resolved.
Show resolved Hide resolved

# IPython
profile_default/
ipython_config.py
Expand Down
28 changes: 26 additions & 2 deletions PUBLICATIONS.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,22 @@ Here, we list a collection of research articles that utilize the NeMo Toolkit. I

# Automatic Speech Recognition (ASR)

<details>
<summary>2023</summary>

* [Fast Entropy-Based Methods of Word-Level Confidence Estimation for End-to-End Automatic Speech Recognition](https://ieeexplore.ieee.org/abstract/document/10022960)
* [Damage Control During Domain Adaptation for Transducer Based Automatic Speech Recognition](https://ieeexplore.ieee.org/abstract/document/10023219)

</details>

<details>
<summary>2022</summary>

* [Multi-blank Transducers for Speech Recognition](https://arxiv.org/abs/2211.03541)
* [Accidental Learners: Spoken Language Identification in Multilingual Self-Supervised Models](https://arxiv.org/abs/2211.05103)

</details>

<details>
<summary>2021</summary>

Expand Down Expand Up @@ -44,9 +60,9 @@ Here, we list a collection of research articles that utilize the NeMo Toolkit. I
## Speaker Recognition (SpkR)

<details>
<summary>2021</summary>
<summary>2022</summary>

* [TitaNet: Neural Model for speaker representation with 1D Depth-wise separable convolutions and global context]( https://arxiv.org/pdf/2110.04410.pdf)
* [TitaNet: Neural Model for Speaker Representation with 1D Depth-Wise Separable Convolutions and Global Context](https://ieeexplore.ieee.org/abstract/document/9746806)

</details>

Expand Down Expand Up @@ -122,6 +138,13 @@ Here, we list a collection of research articles that utilize the NeMo Toolkit. I

# Text To Speech (TTS)

<details>
<summary>2022</summary>

* [Adapter-Based Extension of Multi-Speaker Text-to-Speech Model for New Speakers](https://arxiv.org/abs/2211.00585)

</details>

<details>
<summary>2021</summary>

Expand All @@ -140,6 +163,7 @@ Here, we list a collection of research articles that utilize the NeMo Toolkit. I
<summary>2022</summary>

* [Shallow Fusion of Weighted Finite-State Transducer and Language Model for Text Normalization](https://arxiv.org/abs/2203.15917)
* [Thutmose Tagger: Single-pass neural model for Inverse Text Normalization](https://arxiv.org/abs/2208.00064)

</details>

Expand Down