Skip to content

Video Neva Pretraining + Inference Implementation #1722

Video Neva Pretraining + Inference Implementation

Video Neva Pretraining + Inference Implementation #1722

Re-run triggered May 3, 2024 16:24
Status Cancelled
Total duration 5h 30m 1s
Artifacts

cicd-main.yml

on: pull_request
cicd-cluster-clean
3s
cicd-cluster-clean
cicd-test-container-setup
10m 3s
cicd-test-container-setup
L0_Unit_Tests_GPU
18m 36s
L0_Unit_Tests_GPU
L0_Unit_Tests_CPU
18m 24s
L0_Unit_Tests_CPU
L2_Community_LLM_Checkpoints_tests_Llama
59s
L2_Community_LLM_Checkpoints_tests_Llama
L2_Community_LLM_Checkpoints_tests_StarCoder
1m 0s
L2_Community_LLM_Checkpoints_tests_StarCoder
L2_Community_LLM_Checkpoints_tests_Falcon
1m 16s
L2_Community_LLM_Checkpoints_tests_Falcon
ASR_dev_run_Speech_to_Text
46s
ASR_dev_run_Speech_to_Text
ASR_dev_run_Speech_to_Text_WPE_-_CitriNet
45s
ASR_dev_run_Speech_to_Text_WPE_-_CitriNet
ASR_dev_run_Speech_Pre-training_-_CitriNet
37s
ASR_dev_run_Speech_Pre-training_-_CitriNet
ASR_dev_run_Speech_To_Text_Finetuning
44s
ASR_dev_run_Speech_To_Text_Finetuning
ASR_dev_run_Speech_To_Text_HF_Finetuning
1m 23s
ASR_dev_run_Speech_To_Text_HF_Finetuning
ASR_dev_run_Speech_to_Text_WPE_-_Conformer
35s
ASR_dev_run_Speech_to_Text_WPE_-_Conformer
ASR_dev_run-part_two_Speech_to_Text_WPE_-_Squeezeformer
34s
ASR_dev_run-part_two_Speech_to_Text_WPE_-_Squeezeformer
L2_Speech_to_Text_EMA
1m 25s
L2_Speech_to_Text_EMA
L2_Speaker_dev_run_Speaker_Recognition
34s
L2_Speaker_dev_run_Speaker_Recognition
L2_Speaker_dev_run_Speaker_Diarization
35s
L2_Speaker_dev_run_Speaker_Diarization
L2_Speaker_dev_run_Speech_to_Label
37s
L2_Speaker_dev_run_Speech_to_Label
L2_Speaker_dev_run_Speaker_Diarization_with_ASR_Inference
41s
L2_Speaker_dev_run_Speaker_Diarization_with_ASR_Inference
L2_Speaker_dev_run_Clustering_Diarizer_Inference
1m 7s
L2_Speaker_dev_run_Clustering_Diarizer_Inference
L2_Speaker_dev_run_Neural_Diarizer_Inference
1m 11s
L2_Speaker_dev_run_Neural_Diarizer_Inference
L2_Speaker_dev_run_Multispeaker_ASR_Data_Simulation
58s
L2_Speaker_dev_run_Multispeaker_ASR_Data_Simulation
L2_ASR_Multi-dataloader_dev_run_Speech_to_Text_multi-dataloader
46s
L2_ASR_Multi-dataloader_dev_run_Speech_to_Text_multi-dataloader
L2_ASR_Multi-dataloader_dev_run_Speech_to_Label_multi-dataloader
35s
L2_ASR_Multi-dataloader_dev_run_Speech_to_Label_multi-dataloader
L2_ASR_Adapters_Linear_Adapters
37s
L2_ASR_Adapters_Linear_Adapters
L2_ASR_Adapters_RelPos_MHA_Adapters
37s
L2_ASR_Adapters_RelPos_MHA_Adapters
L2_Speech_Transcription_Speech_to_Text_Transcribe
1m 0s
L2_Speech_Transcription_Speech_to_Text_Transcribe
L2_Transducer_alignment_Running_pytest
1m 19s
L2_Transducer_alignment_Running_pytest
L2_Segmentation_Tool_Parallel_ctc_segmentation_test_L2_Eng_CitriNet_with_wav
3m 11s
L2_Segmentation_Tool_Parallel_ctc_segmentation_test_L2_Eng_CitriNet_with_wav
L2_Segmentation_Tool_Parallel_ctc_segmentation_test_L2_Ru_QN_with_mp3
1m 59s
L2_Segmentation_Tool_Parallel_ctc_segmentation_test_L2_Ru_QN_with_mp3
L2_G2P_Models_G2P_Conformer_training_evaluation_and_inference
58s
L2_G2P_Models_G2P_Conformer_training_evaluation_and_inference
L2_G2P_Models_HeteronymClassificationModel_training_evaluation_and_inference
1m 50s
L2_G2P_Models_HeteronymClassificationModel_training_evaluation_and_inference
L2_Dialogue_Classification_Intent_and_slot_classification_using_SGDQA
49s
L2_Dialogue_Classification_Intent_and_slot_classification_using_SGDQA
L2_Dialogue_Classification_Intent_and_slot_classification_using_IntentSlotClassificationModel
1m 32s
L2_Dialogue_Classification_Intent_and_slot_classification_using_IntentSlotClassificationModel
L2_Dialogue_Classification_Intent_classification_using_ZeroShotIntentModel
1m 47s
L2_Dialogue_Classification_Intent_classification_using_ZeroShotIntentModel
L2_Dialogue_Classification_Design_Intent_classification_using_ZeroShotIntentModel
1m 22s
L2_Dialogue_Classification_Design_Intent_classification_using_ZeroShotIntentModel
L2_Dialogue_Classification_Design_Intent_classification_using_ZeroShotIntentModel_BART_Classifier
1m 4s
L2_Dialogue_Classification_Design_Intent_classification_using_ZeroShotIntentModel_BART_Classifier
L2_Dialogue_Classification_Design_Intent_classification_using_DialogueNearestNeighbourModel
37s
L2_Dialogue_Classification_Design_Intent_classification_using_DialogueNearestNeighbourModel
L2_Dialogue_Generation_Dialogue_Answer_Extender_using_DialogueS2SGenerationModel
53s
L2_Dialogue_Generation_Dialogue_Answer_Extender_using_DialogueS2SGenerationModel
L2_Dialogue_Generation_Dialogue_SGD_Based_Answer_Extender_using_DialogueS2SGenerationModel
38s
L2_Dialogue_Generation_Dialogue_SGD_Based_Answer_Extender_using_DialogueS2SGenerationModel
L2_COPY_Dialogue_Answer_Extender_using_DialogueGPTGenerationModel
53s
L2_COPY_Dialogue_Answer_Extender_using_DialogueGPTGenerationModel
L2_Duplex_Text_Normalization_with_Tarred_dataset
57s
L2_Duplex_Text_Normalization_with_Tarred_dataset
L2_BERT_Text_Classification_with_BERT_Test
38s
L2_BERT_Text_Classification_with_BERT_Test
L2_Parallel_BERT_Question-Answering_SQUAD_v1_1
37s
L2_Parallel_BERT_Question-Answering_SQUAD_v1_1
L2_Parallel_BERT_Question-Answering_SQUAD_v2_0
36s
L2_Parallel_BERT_Question-Answering_SQUAD_v2_0
L2_Parallel_BART_Question-Answering_SQUAD_v1_1
39s
L2_Parallel_BART_Question-Answering_SQUAD_v1_1
L2_Parallel_BART_Question-Answering_SQUAD_v2_0
37s
L2_Parallel_BART_Question-Answering_SQUAD_v2_0
L2_Parallel_GPT2_Question-Answering_SQUAD_v1_1
40s
L2_Parallel_GPT2_Question-Answering_SQUAD_v1_1
L2_Parallel_GPT2_Question-Answering_SQUAD_v2_0
36s
L2_Parallel_GPT2_Question-Answering_SQUAD_v2_0
L2_Intent_and_Slot_Classification_Tasks_Intent_and_Slot_Classification
37s
L2_Intent_and_Slot_Classification_Tasks_Intent_and_Slot_Classification
L2_Intent_and_Slot_Classification_Tasks_Multi-Label_Intent_and_Slot_Classification
40s
L2_Intent_and_Slot_Classification_Tasks_Multi-Label_Intent_and_Slot_Classification
L2_Parallel_NLP_Examples2_NER_finetuning_from_pretrained_Test
42s
L2_Parallel_NLP_Examples2_NER_finetuning_from_pretrained_Test
L2_Parallel_NLP_Examples2_Punctuation_and_capitalization_finetuning_from_pretrained_test
42s
L2_Parallel_NLP_Examples2_Punctuation_and_capitalization_finetuning_from_pretrained_test
L2_Parallel_NLP_Examples2_NER_with_TurkuNLP__bert-base-finnish-cased-v1
37s
L2_Parallel_NLP_Examples2_NER_with_TurkuNLP__bert-base-finnish-cased-v1
L2_Parallel_NLP_Examples2_Evaluation_script_for_Token_Classification
1m 12s
L2_Parallel_NLP_Examples2_Evaluation_script_for_Token_Classification
L2_Parallel_NLP_Examples2_Evaluation_script_for_Punctuation
1m 7s
L2_Parallel_NLP_Examples2_Evaluation_script_for_Punctuation
L2_Parallel_NLP_Examples2_Punctuation_Capitalization_2GPUs_with_DistilBERT_Finetuning_on_other_data
2m 20s
L2_Parallel_NLP_Examples2_Punctuation_Capitalization_2GPUs_with_DistilBERT_Finetuning_on_other_data
Punctuation_Capitalization_tarred_dataset_create_and_use_tarred_dataset
2m 46s
Punctuation_Capitalization_tarred_dataset_create_and_use_tarred_dataset
Punctuation_Capitalization_Using_model-common_datasets_parameters-label_vocab_dir
2m 22s
Punctuation_Capitalization_Using_model-common_datasets_parameters-label_vocab_dir
Punctuation_Capitalization_inference_Restore_punctuation_and_capitalization_in_long_text
1m 12s
Punctuation_Capitalization_inference_Restore_punctuation_and_capitalization_in_long_text
L2_Pretraining_BERT_pretraining_from_Text
36s
L2_Pretraining_BERT_pretraining_from_Text
L2_Pretraining_BERT_from_Preprocessed
41s
L2_Pretraining_BERT_from_Preprocessed
L2_Entity_Linking_Self_Alignment_Pretraining_BERT
2m 7s
L2_Entity_Linking_Self_Alignment_Pretraining_BERT
L2_NMT_Attention_is_All_You_Need_Training_NMT_Training_Post-LN
1m 4s
L2_NMT_Attention_is_All_You_Need_Training_NMT_Training_Post-LN
L2_NMT_Attention_is_All_You_Need_Training_NMT_Training_Pre-LN
48s
L2_NMT_Attention_is_All_You_Need_Training_NMT_Training_Pre-LN
L2_NMT_Attention_is_All_You_Need_Training_NMT_Multi-Validation
1m 0s
L2_NMT_Attention_is_All_You_Need_Training_NMT_Multi-Validation
L2_NMT_Attention_is_All_You_Need_Inference
1m 27s
L2_NMT_Attention_is_All_You_Need_Inference
L2_NMT_Attention_is_All_You_Need_Finetuning
1m 9s
L2_NMT_Attention_is_All_You_Need_Finetuning
L2_NMT_Tarred_Dataset_Creation_Auto_Tarred_Dataset_Creation
49s
L2_NMT_Tarred_Dataset_Creation_Auto_Tarred_Dataset_Creation
L2_NMT_Tarred_Dataset_Creation_Script_Tarred_Dataset_Creation
1m 15s
L2_NMT_Tarred_Dataset_Creation_Script_Tarred_Dataset_Creation
L2_Megatron_NMT_Training_TP2
4m 9s
L2_Megatron_NMT_Training_TP2
L2_Megatron_BART_Perceiver_MIM_Training_TP2
1m 52s
L2_Megatron_BART_Perceiver_MIM_Training_TP2
L2_Megatron_Bert_Pretraining_and_Resume_Training_with_Pipeline_Parallelism
2m 41s
L2_Megatron_Bert_Pretraining_and_Resume_Training_with_Pipeline_Parallelism
L2_Megatron_Bert_Pretraining_and_Resume_Training
2m 35s
L2_Megatron_Bert_Pretraining_and_Resume_Training
L2_Megatron_Core_Bert_Pretraining_and_Resume_Training
2m 48s
L2_Megatron_Core_Bert_Pretraining_and_Resume_Training
L2_Megatron_RETRO_Pretraining_and_Resume_Training
6m 24s
L2_Megatron_RETRO_Pretraining_and_Resume_Training
L2_Legacy_Megatron_RETRO_Pretraining_and_Resume_Training
1m 43s
L2_Legacy_Megatron_RETRO_Pretraining_and_Resume_Training
L2_BioMegatron_Bert_NER_Task
2m 2s
L2_BioMegatron_Bert_NER_Task
L2_Megatron_GPT_Pretraining_and_Resume_Training_TP2
4m 24s
L2_Megatron_GPT_Pretraining_and_Resume_Training_TP2
L2_Megatron_GPT_with_Rope_Pretraining_and_Resume_Training_TP2
2m 23s
L2_Megatron_GPT_with_Rope_Pretraining_and_Resume_Training_TP2
L2_Megatron_GPT_with_ALiBi_Pretraining_and_Resume_Training_TP2
2m 23s
L2_Megatron_GPT_with_ALiBi_Pretraining_and_Resume_Training_TP2
L2_Megatron_GPT_with_KERPLE_Pretraining_and_Resume_Training_TP2
1m 56s
L2_Megatron_GPT_with_KERPLE_Pretraining_and_Resume_Training_TP2
L2_Megatron_GPT_Pretraining_and_Resume_Training_PP2
4m 3s
L2_Megatron_GPT_Pretraining_and_Resume_Training_PP2
L2_Megatron_GPT_Finetuning_PP2
3m 50s
L2_Megatron_GPT_Finetuning_PP2
L2_Megatron_GPT_Finetuning_StarCoder_PP1
1m 20s
L2_Megatron_GPT_Finetuning_StarCoder_PP1
L2_Megatron_GPT_Embedding
1m 55s
L2_Megatron_GPT_Embedding
L2_Megatron_GPT_PEFT_Lora_PP2
2m 2s
L2_Megatron_GPT_PEFT_Lora_PP2
L2_Megatron_GPT_PEFT_Lora_TP2
2m 21s
L2_Megatron_GPT_PEFT_Lora_TP2
L2_Megatron_GPT_Eval
59s
L2_Megatron_GPT_Eval
L2_Megatron_GPT_Eval_PP2
2m 17s
L2_Megatron_GPT_Eval_PP2
L2_Megatron_GPT_SFT_Eval_inference_seq_len_greaterThan_training_seq_len
1m 14s
L2_Megatron_GPT_SFT_Eval_inference_seq_len_greaterThan_training_seq_len
L2_Megatron_Change_Partitions_Reduce_TP_Num_Partitions_-2_to_1-_and_PP_Num_Partitions_-1_to_2
1m 9s
L2_Megatron_Change_Partitions_Reduce_TP_Num_Partitions_-2_to_1-_and_PP_Num_Partitions_-1_to_2
L2_Megatron_Change_Partitions_Increase_TP_Num_Partitions_-2_to_4-_and_PP_Num_Partitions_-1_to_2
1m 10s
L2_Megatron_Change_Partitions_Increase_TP_Num_Partitions_-2_to_4-_and_PP_Num_Partitions_-1_to_2
L2_Megatron_T5_Pretraining_and_Resume_Training_TP2
2m 34s
L2_Megatron_T5_Pretraining_and_Resume_Training_TP2
L2_Megatron_T5_with_ALiBi_Pretraining_and_Resume_Training_TP2
1m 47s
L2_Megatron_T5_with_ALiBi_Pretraining_and_Resume_Training_TP2
L2_Megatron_T5_with_KERPLE_Pretraining_and_Resume_Training_TP2
2m 31s
L2_Megatron_T5_with_KERPLE_Pretraining_and_Resume_Training_TP2
L2_Megatron_T5_Pretraining_and_Resume_Training_PP2
2m 32s
L2_Megatron_T5_Pretraining_and_Resume_Training_PP2
L2_Megatron_T5_w_Mixture_of_Expert_Pretraining
1m 29s
L2_Megatron_T5_w_Mixture_of_Expert_Pretraining
L2_Megatron_UL2_Pretraining_and_Resume_Training_TP2
1m 50s
L2_Megatron_UL2_Pretraining_and_Resume_Training_TP2
L2_Megatron_T5_Eval
30s
L2_Megatron_T5_Eval
L2_Megatron_BART_Pretraining_and_Resume_Training_TP2
2m 29s
L2_Megatron_BART_Pretraining_and_Resume_Training_TP2
L2_Megatron_BART_Pretraining_and_Resume_Training_PP2
1m 57s
L2_Megatron_BART_Pretraining_and_Resume_Training_PP2
L2_Megatron_T5_GLUE_RTE
40s
L2_Megatron_T5_GLUE_RTE
L2_Megatron_T5_GLUE_XNLI
43s
L2_Megatron_T5_GLUE_XNLI
L2_Megatron_T5_PEFT_Lora_TP2
2m 48s
L2_Megatron_T5_PEFT_Lora_TP2
L2_Megatron_Mock_Data_Generation_MockGPTDataset
2m 11s
L2_Megatron_Mock_Data_Generation_MockGPTDataset
L2_Megatron_Mock_Data_Generation_MockT5Dataset
1m 12s
L2_Megatron_Mock_Data_Generation_MockT5Dataset
L2_TTS_Fast_dev_runs_1_Tacotron_2
1m 6s
L2_TTS_Fast_dev_runs_1_Tacotron_2
L2_TTS_Fast_dev_runs_1_WaveGlow
1m 9s
L2_TTS_Fast_dev_runs_1_WaveGlow
L2_TTS_Fast_dev_runs_1_FastPitch
1m 32s
L2_TTS_Fast_dev_runs_1_FastPitch
L2_TTS_Fast_dev_runs_1_RADTTS
3h 18m
L2_TTS_Fast_dev_runs_1_RADTTS
L2_TTS_Fast_dev_runs_1_Mixer-TTS
1m 37s
L2_TTS_Fast_dev_runs_1_Mixer-TTS
L2_TTS_Fast_dev_runs_1_Hifigan
1m 8s
L2_TTS_Fast_dev_runs_1_Hifigan
Speech_Checkpoints_tests
3m 11s
Speech_Checkpoints_tests
L0_Setup_Test_Data_And_Models
28s
L0_Setup_Test_Data_And_Models
L2_PTQ_Llama2_Export_Only
1m 2s
L2_PTQ_Llama2_Export_Only
L2_PTQ_Llama2_FP8
1m 25s
L2_PTQ_Llama2_FP8
L2_PTQ_Llama2_INT8_SQ
1m 7s
L2_PTQ_Llama2_INT8_SQ
L2_PTQ_Llama2_INT4_AWQ
1m 43s
L2_PTQ_Llama2_INT4_AWQ
Nemo_CICD_Test
0s
Nemo_CICD_Test
Fit to window
Zoom out
Zoom in

Annotations

2 errors and 1 warning
L2_TTS_Fast_dev_runs_1_RADTTS
The run was canceled by @pablo-garay.
L2_TTS_Fast_dev_runs_1_RADTTS
The operation was canceled.
L2_TTS_Fast_dev_runs_1_RADTTS
Delete stale containers failed, docker rm fail with exit code 1 for container e93151f529d7ef1693d282089d4c79eba128a2e78045e2be94b51bce35b9a843