Skip to content

Add TRT-LLM params like max_num_tokens and opt_num_tokens #2057

Add TRT-LLM params like max_num_tokens and opt_num_tokens

Add TRT-LLM params like max_num_tokens and opt_num_tokens #2057

Triggered via pull request May 20, 2024 19:30
Status Success
Total duration 53m 50s
Artifacts

cicd-main.yml

on: pull_request
cicd-cluster-clean
3s
cicd-cluster-clean
cicd-test-container-setup
2m 44s
cicd-test-container-setup
L0_Unit_Tests_GPU
20m 14s
L0_Unit_Tests_GPU
L0_Unit_Tests_CPU
25m 19s
L0_Unit_Tests_CPU
L2_Community_LLM_Checkpoints_tests_Llama
1m 34s
L2_Community_LLM_Checkpoints_tests_Llama
L2_Community_LLM_Checkpoints_tests_StarCoder
1m 15s
L2_Community_LLM_Checkpoints_tests_StarCoder
L2_Community_LLM_Checkpoints_tests_Falcon
1m 11s
L2_Community_LLM_Checkpoints_tests_Falcon
ASR_dev_run_Speech_to_Text
1m 1s
ASR_dev_run_Speech_to_Text
ASR_dev_run_Speech_to_Text_WPE_-_CitriNet
47s
ASR_dev_run_Speech_to_Text_WPE_-_CitriNet
ASR_dev_run_Speech_Pre-training_-_CitriNet
39s
ASR_dev_run_Speech_Pre-training_-_CitriNet
ASR_dev_run_Speech_To_Text_Finetuning
57s
ASR_dev_run_Speech_To_Text_Finetuning
ASR_dev_run_Speech_to_Text_WPE_-_Conformer
37s
ASR_dev_run_Speech_to_Text_WPE_-_Conformer
ASR_dev_run-part_two_Speech_to_Text_WPE_-_Squeezeformer
36s
ASR_dev_run-part_two_Speech_to_Text_WPE_-_Squeezeformer
L2_Speech_to_Text_EMA
1m 26s
L2_Speech_to_Text_EMA
L2_Speaker_dev_run_Speaker_Recognition
34s
L2_Speaker_dev_run_Speaker_Recognition
L2_Speaker_dev_run_Speaker_Diarization
37s
L2_Speaker_dev_run_Speaker_Diarization
L2_Speaker_dev_run_Speech_to_Label
36s
L2_Speaker_dev_run_Speech_to_Label
L2_Speaker_dev_run_Speaker_Diarization_with_ASR_Inference
1m 2s
L2_Speaker_dev_run_Speaker_Diarization_with_ASR_Inference
L2_Speaker_dev_run_Clustering_Diarizer_Inference
1m 9s
L2_Speaker_dev_run_Clustering_Diarizer_Inference
L2_Speaker_dev_run_Neural_Diarizer_Inference
1m 10s
L2_Speaker_dev_run_Neural_Diarizer_Inference
L2_Speaker_dev_run_Multispeaker_ASR_Data_Simulation
1m 4s
L2_Speaker_dev_run_Multispeaker_ASR_Data_Simulation
L2_ASR_Multi-dataloader_dev_run_Speech_to_Text_multi-dataloader
50s
L2_ASR_Multi-dataloader_dev_run_Speech_to_Text_multi-dataloader
L2_ASR_Multi-dataloader_dev_run_Speech_to_Label_multi-dataloader
38s
L2_ASR_Multi-dataloader_dev_run_Speech_to_Label_multi-dataloader
L2_ASR_Adapters_Linear_Adapters
39s
L2_ASR_Adapters_Linear_Adapters
L2_ASR_Adapters_RelPos_MHA_Adapters
38s
L2_ASR_Adapters_RelPos_MHA_Adapters
L2_Speech_Transcription_Speech_to_Text_Transcribe
33s
L2_Speech_Transcription_Speech_to_Text_Transcribe
L2_Transducer_alignment_Running_pytest
1m 24s
L2_Transducer_alignment_Running_pytest
L2_Segmentation_Tool_Parallel_ctc_segmentation_test_L2_Eng_CitriNet_with_wav
3m 12s
L2_Segmentation_Tool_Parallel_ctc_segmentation_test_L2_Eng_CitriNet_with_wav
L2_Segmentation_Tool_Parallel_ctc_segmentation_test_L2_Ru_QN_with_mp3
2m 2s
L2_Segmentation_Tool_Parallel_ctc_segmentation_test_L2_Ru_QN_with_mp3
L2_G2P_Models_G2P_Conformer_training_evaluation_and_inference
1m 2s
L2_G2P_Models_G2P_Conformer_training_evaluation_and_inference
L2_G2P_Models_HeteronymClassificationModel_training_evaluation_and_inference
1m 28s
L2_G2P_Models_HeteronymClassificationModel_training_evaluation_and_inference
L2_Dialogue_Classification_Intent_and_slot_classification_using_SGDQA
50s
L2_Dialogue_Classification_Intent_and_slot_classification_using_SGDQA
L2_Dialogue_Classification_Intent_and_slot_classification_using_IntentSlotClassificationModel
1m 34s
L2_Dialogue_Classification_Intent_and_slot_classification_using_IntentSlotClassificationModel
L2_Dialogue_Classification_Intent_classification_using_ZeroShotIntentModel
1m 49s
L2_Dialogue_Classification_Intent_classification_using_ZeroShotIntentModel
L2_Dialogue_Classification_Design_Intent_classification_using_ZeroShotIntentModel
1m 24s
L2_Dialogue_Classification_Design_Intent_classification_using_ZeroShotIntentModel
L2_Dialogue_Classification_Design_Intent_classification_using_ZeroShotIntentModel_BART_Classifier
1m 6s
L2_Dialogue_Classification_Design_Intent_classification_using_ZeroShotIntentModel_BART_Classifier
L2_Dialogue_Classification_Design_Intent_classification_using_DialogueNearestNeighbourModel
40s
L2_Dialogue_Classification_Design_Intent_classification_using_DialogueNearestNeighbourModel
L2_Dialogue_Generation_Dialogue_Answer_Extender_using_DialogueS2SGenerationModel
56s
L2_Dialogue_Generation_Dialogue_Answer_Extender_using_DialogueS2SGenerationModel
L2_Dialogue_Generation_Dialogue_SGD_Based_Answer_Extender_using_DialogueS2SGenerationModel
1m 11s
L2_Dialogue_Generation_Dialogue_SGD_Based_Answer_Extender_using_DialogueS2SGenerationModel
L2_COPY_Dialogue_Answer_Extender_using_DialogueGPTGenerationModel
54s
L2_COPY_Dialogue_Answer_Extender_using_DialogueGPTGenerationModel
L2_Duplex_Text_Normalization_with_Tarred_dataset
50s
L2_Duplex_Text_Normalization_with_Tarred_dataset
L2_BERT_Text_Classification_with_BERT_Test
39s
L2_BERT_Text_Classification_with_BERT_Test
L2_Parallel_BERT_Question-Answering_SQUAD_v1_1
37s
L2_Parallel_BERT_Question-Answering_SQUAD_v1_1
L2_Parallel_BERT_Question-Answering_SQUAD_v2_0
36s
L2_Parallel_BERT_Question-Answering_SQUAD_v2_0
L2_Parallel_BART_Question-Answering_SQUAD_v1_1
39s
L2_Parallel_BART_Question-Answering_SQUAD_v1_1
L2_Parallel_BART_Question-Answering_SQUAD_v2_0
38s
L2_Parallel_BART_Question-Answering_SQUAD_v2_0
L2_Parallel_GPT2_Question-Answering_SQUAD_v1_1
40s
L2_Parallel_GPT2_Question-Answering_SQUAD_v1_1
L2_Parallel_GPT2_Question-Answering_SQUAD_v2_0
38s
L2_Parallel_GPT2_Question-Answering_SQUAD_v2_0
L2_Intent_and_Slot_Classification_Tasks_Intent_and_Slot_Classification
38s
L2_Intent_and_Slot_Classification_Tasks_Intent_and_Slot_Classification
L2_Intent_and_Slot_Classification_Tasks_Multi-Label_Intent_and_Slot_Classification
41s
L2_Intent_and_Slot_Classification_Tasks_Multi-Label_Intent_and_Slot_Classification
L2_Parallel_NLP_Examples2_NER_finetuning_from_pretrained_Test
43s
L2_Parallel_NLP_Examples2_NER_finetuning_from_pretrained_Test
L2_Parallel_NLP_Examples2_Punctuation_and_capitalization_finetuning_from_pretrained_test
43s
L2_Parallel_NLP_Examples2_Punctuation_and_capitalization_finetuning_from_pretrained_test
L2_Parallel_NLP_Examples2_NER_with_TurkuNLP__bert-base-finnish-cased-v1
38s
L2_Parallel_NLP_Examples2_NER_with_TurkuNLP__bert-base-finnish-cased-v1
L2_Parallel_NLP_Examples2_Evaluation_script_for_Token_Classification
1m 13s
L2_Parallel_NLP_Examples2_Evaluation_script_for_Token_Classification
L2_Parallel_NLP_Examples2_Evaluation_script_for_Punctuation
46s
L2_Parallel_NLP_Examples2_Evaluation_script_for_Punctuation
L2_Parallel_NLP_Examples2_Punctuation_Capitalization_2GPUs_with_DistilBERT_Finetuning_on_other_data
2m 25s
L2_Parallel_NLP_Examples2_Punctuation_Capitalization_2GPUs_with_DistilBERT_Finetuning_on_other_data
Punctuation_Capitalization_tarred_dataset_create_and_use_tarred_dataset
2m 2s
Punctuation_Capitalization_tarred_dataset_create_and_use_tarred_dataset
Punctuation_Capitalization_Using_model-common_datasets_parameters-label_vocab_dir
2m 18s
Punctuation_Capitalization_Using_model-common_datasets_parameters-label_vocab_dir
Punctuation_Capitalization_inference_Restore_punctuation_and_capitalization_in_long_text
44s
Punctuation_Capitalization_inference_Restore_punctuation_and_capitalization_in_long_text
L2_Pretraining_BERT_pretraining_from_Text
36s
L2_Pretraining_BERT_pretraining_from_Text
L2_Pretraining_BERT_from_Preprocessed
43s
L2_Pretraining_BERT_from_Preprocessed
L2_Entity_Linking_Self_Alignment_Pretraining_BERT
1m 36s
L2_Entity_Linking_Self_Alignment_Pretraining_BERT
L2_NMT_Attention_is_All_You_Need_Training_NMT_Training_Post-LN
1m 5s
L2_NMT_Attention_is_All_You_Need_Training_NMT_Training_Post-LN
L2_NMT_Attention_is_All_You_Need_Training_NMT_Training_Pre-LN
51s
L2_NMT_Attention_is_All_You_Need_Training_NMT_Training_Pre-LN
L2_NMT_Attention_is_All_You_Need_Training_NMT_Multi-Validation
1m 1s
L2_NMT_Attention_is_All_You_Need_Training_NMT_Multi-Validation
L2_NMT_Attention_is_All_You_Need_Inference
1m 5s
L2_NMT_Attention_is_All_You_Need_Inference
L2_NMT_Attention_is_All_You_Need_Finetuning
1m 9s
L2_NMT_Attention_is_All_You_Need_Finetuning
L2_NMT_Tarred_Dataset_Creation_Auto_Tarred_Dataset_Creation
50s
L2_NMT_Tarred_Dataset_Creation_Auto_Tarred_Dataset_Creation
L2_NMT_Tarred_Dataset_Creation_Script_Tarred_Dataset_Creation
1m 32s
L2_NMT_Tarred_Dataset_Creation_Script_Tarred_Dataset_Creation
L2_Megatron_NMT_Training_TP2
4m 15s
L2_Megatron_NMT_Training_TP2
L2_Megatron_BART_Perceiver_MIM_Training_TP2
1m 56s
L2_Megatron_BART_Perceiver_MIM_Training_TP2
L2_Megatron_Bert_Pretraining_and_Resume_Training_with_Pipeline_Parallelism
2m 24s
L2_Megatron_Bert_Pretraining_and_Resume_Training_with_Pipeline_Parallelism
L2_Megatron_Bert_Pretraining_and_Resume_Training
1m 48s
L2_Megatron_Bert_Pretraining_and_Resume_Training
L2_Megatron_Core_Bert_Pretraining_and_Resume_Training
2m 50s
L2_Megatron_Core_Bert_Pretraining_and_Resume_Training
L2_Megatron_RETRO_Pretraining_and_Resume_Training
6m 24s
L2_Megatron_RETRO_Pretraining_and_Resume_Training
L2_Legacy_Megatron_RETRO_Pretraining_and_Resume_Training
2m 39s
L2_Legacy_Megatron_RETRO_Pretraining_and_Resume_Training
L2_BioMegatron_Bert_NER_Task
1m 16s
L2_BioMegatron_Bert_NER_Task
L2_Megatron_GPT_Pretraining_and_Resume_Training_TP2
3m 50s
L2_Megatron_GPT_Pretraining_and_Resume_Training_TP2
L2_Megatron_GPT_with_Rope_Pretraining_and_Resume_Training_TP2
1m 52s
L2_Megatron_GPT_with_Rope_Pretraining_and_Resume_Training_TP2
L2_Megatron_GPT_with_ALiBi_Pretraining_and_Resume_Training_TP2
1m 53s
L2_Megatron_GPT_with_ALiBi_Pretraining_and_Resume_Training_TP2
L2_Megatron_GPT_with_KERPLE_Pretraining_and_Resume_Training_TP2
2m 9s
L2_Megatron_GPT_with_KERPLE_Pretraining_and_Resume_Training_TP2
L2_Megatron_GPT_Pretraining_and_Resume_Training_PP2
4m 42s
L2_Megatron_GPT_Pretraining_and_Resume_Training_PP2
L2_Megatron_GPT_Finetuning_PP2
3m 8s
L2_Megatron_GPT_Finetuning_PP2
L2_Megatron_GPT_Finetuning_StarCoder_PP1
55s
L2_Megatron_GPT_Finetuning_StarCoder_PP1
L2_Megatron_GPT_Embedding
1m 9s
L2_Megatron_GPT_Embedding
L2_Megatron_GPT_PEFT_Lora_PP2
2m 16s
L2_Megatron_GPT_PEFT_Lora_PP2
L2_Megatron_GPT_PEFT_Lora_TP2
1m 55s
L2_Megatron_GPT_PEFT_Lora_TP2
L2_Megatron_GPT_Eval
47s
L2_Megatron_GPT_Eval
L2_Megatron_GPT_Eval_PP2
4m 45s
L2_Megatron_GPT_Eval_PP2
L2_Megatron_GPT_SFT_Eval_inference_seq_len_greaterThan_training_seq_len
43s
L2_Megatron_GPT_SFT_Eval_inference_seq_len_greaterThan_training_seq_len
L2_Megatron_Change_Partitions_Reduce_TP_Num_Partitions_-2_to_1-_and_PP_Num_Partitions_-1_to_2
1m 33s
L2_Megatron_Change_Partitions_Reduce_TP_Num_Partitions_-2_to_1-_and_PP_Num_Partitions_-1_to_2
L2_Megatron_Change_Partitions_Increase_TP_Num_Partitions_-2_to_4-_and_PP_Num_Partitions_-1_to_2
1m 19s
L2_Megatron_Change_Partitions_Increase_TP_Num_Partitions_-2_to_4-_and_PP_Num_Partitions_-1_to_2
L2_Megatron_T5_Pretraining_and_Resume_Training_TP2
2m 33s
L2_Megatron_T5_Pretraining_and_Resume_Training_TP2
L2_Megatron_T5_with_ALiBi_Pretraining_and_Resume_Training_TP2
1m 44s
L2_Megatron_T5_with_ALiBi_Pretraining_and_Resume_Training_TP2
L2_Megatron_T5_with_KERPLE_Pretraining_and_Resume_Training_TP2
2m 34s
L2_Megatron_T5_with_KERPLE_Pretraining_and_Resume_Training_TP2
L2_Megatron_T5_Pretraining_and_Resume_Training_PP2
2m 1s
L2_Megatron_T5_Pretraining_and_Resume_Training_PP2
L2_Megatron_T5_w_Mixture_of_Expert_Pretraining
1m 7s
L2_Megatron_T5_w_Mixture_of_Expert_Pretraining
L2_Megatron_UL2_Pretraining_and_Resume_Training_TP2
1m 47s
L2_Megatron_UL2_Pretraining_and_Resume_Training_TP2
L2_Megatron_T5_Eval
31s
L2_Megatron_T5_Eval
L2_Megatron_BART_Pretraining_and_Resume_Training_TP2
2m 32s
L2_Megatron_BART_Pretraining_and_Resume_Training_TP2
L2_Megatron_BART_Pretraining_and_Resume_Training_PP2
2m 35s
L2_Megatron_BART_Pretraining_and_Resume_Training_PP2
L2_Megatron_T5_GLUE_RTE
42s
L2_Megatron_T5_GLUE_RTE
L2_Megatron_T5_GLUE_XNLI
43s
L2_Megatron_T5_GLUE_XNLI
L2_Megatron_T5_PEFT_Lora_TP2
1m 48s
L2_Megatron_T5_PEFT_Lora_TP2
L2_Megatron_Mock_Data_Generation_MockGPTDataset
2m 9s
L2_Megatron_Mock_Data_Generation_MockGPTDataset
L2_Megatron_Mock_Data_Generation_MockT5Dataset
41s
L2_Megatron_Mock_Data_Generation_MockT5Dataset
L2_TTS_Fast_dev_runs_1_Tacotron_2
1m 34s
L2_TTS_Fast_dev_runs_1_Tacotron_2
L2_TTS_Fast_dev_runs_1_WaveGlow
1m 11s
L2_TTS_Fast_dev_runs_1_WaveGlow
L2_TTS_Fast_dev_runs_1_FastPitch
1m 48s
L2_TTS_Fast_dev_runs_1_FastPitch
L2_TTS_Fast_dev_runs_1_Mixer-TTS
1m 50s
L2_TTS_Fast_dev_runs_1_Mixer-TTS
L2_TTS_Fast_dev_runs_1_Hifigan
38s
L2_TTS_Fast_dev_runs_1_Hifigan
Speech_Checkpoints_tests
2m 39s
Speech_Checkpoints_tests
L0_Setup_Test_Data_And_Models
37s
L0_Setup_Test_Data_And_Models
L2_Community_LLM_Checkpoints_tests_Llama3
1m 16s
L2_Community_LLM_Checkpoints_tests_Llama3
L2_PTQ_Llama2_Export_Only
1m 7s
L2_PTQ_Llama2_Export_Only
L2_PTQ_Llama2_FP8
1m 8s
L2_PTQ_Llama2_FP8
L2_PTQ_Llama2_INT8_SQ
50s
L2_PTQ_Llama2_INT8_SQ
OPTIONAL_ASR_dev_run_Speech_To_Text_HF_Finetuning
1m 25s
OPTIONAL_ASR_dev_run_Speech_To_Text_HF_Finetuning
Nemo_CICD_Test
0s
Nemo_CICD_Test
Fit to window
Zoom out
Zoom in

Annotations

2 warnings
L2_Community_LLM_Checkpoints_tests_Llama3
Node.js 16 actions are deprecated. Please update the following actions to use Node.js 20: actions/checkout@v2. For more information see: https://github.blog/changelog/2023-09-22-github-actions-transitioning-from-node-16-to-node-20/.
L2_Community_LLM_Checkpoints_tests_Llama3
The following actions uses node12 which is deprecated and will be forced to run on node16: actions/checkout@v2. For more info: https://github.blog/changelog/2023-06-13-github-actions-all-actions-will-run-on-node16-instead-of-node12-by-default/