NAS for Transformers

Transformers have emerged as the leading architecture in deep learning for a wide range of applications, particularly in natural language processing (NLP) and computer vision (CV). Despite their success, designing effective Transformer models remains a complex and resource-intensive task due to their intricate architecture and the substantial computational demands of training and optimization. Neural Architecture Search (NAS) offers a promising solution to these challenges by automating the search for optimal Transformer architectures. In this report, I examine the key concepts related to NAS and Transformers, present notable results achieved in the NAS for Transformers field, and discuss existing limitations as well as potential future directions.

For a more detail analysis please refer to report_nas_for_transformers.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
NAS FOR TRANSFORMERS.pptx		NAS FOR TRANSFORMERS.pptx
Neural_Architecture_Search_for_Transformers_A_Survey.pdf		Neural_Architecture_Search_for_Transformers_A_Survey.pdf
README.md		README.md
attention_is_all_you_need.pdf		attention_is_all_you_need.pdf
evolved_transformer_supplementary.pdf		evolved_transformer_supplementary.pdf
nasBERT.pdf		nasBERT.pdf
report_nas_for_transformers.pdf		report_nas_for_transformers.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NAS for Transformers

About

Releases

Packages

irisdaniaj/NAS-for-Transformers

Folders and files

Latest commit

History

Repository files navigation

NAS for Transformers

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages