-
Notifications
You must be signed in to change notification settings - Fork 2.4k
Issues: NVIDIA/NeMo
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Mamba inference server init_process_group error for 1 gpu
bug
Something isn't working
#10549
opened Sep 20, 2024 by
SkanderBS2024
RuntimeError: Internal: could not parse ModelProto from /tmp/tmpnf2ficir/tokenizer.decoder.32000.BPE.model
bug
Something isn't working
#10514
opened Sep 17, 2024 by
Yiran-ASU
always_save_nemo
not working properly
bug
#10481
opened Sep 14, 2024 by
aimarz
Megatron -> .nemo checkpoint conversion script Something isn't working
megatron_lm_ckpt_to_nemo.py
fails
bug
#10480
opened Sep 13, 2024 by
aimarz
ImportError: cannot import name '_TORCH_GREATER_EQUAL_2_0' from 'lightning_fabric.utilities.imports' (/usr/local/lib/python3.10/dist-packages/lightning_fabric/utilities/imports.py)
bug
Something isn't working
#10478
opened Sep 13, 2024 by
Desperadoze
Add a checkpoint averaging script for the new .distcp checkpoint format
#10462
opened Sep 11, 2024 by
Kipok
Problem running LoRA PEFT on Llama 3 8B Instruct using NeMo docker container
bug
Something isn't working
#10428
opened Sep 9, 2024 by
anagigoncalves
megatron.core.dist_checkpointing.core.CheckpointingException: Object shard /ckpt/model_weights/model.decoder.layers.self_attention.core_attention._extra_state/shard_0_80.pt not found
bug
Something isn't working
#10386
opened Sep 7, 2024 by
OrenLeung
RuntimeError: stack expects each tensor to be equal size (when using lhotse shar data sets)
bug
Something isn't working
#10382
opened Sep 6, 2024 by
riqiang-dp
00_NeMo_Primer.ipynb in Google Collab fail
bug
Something isn't working
#10368
opened Sep 6, 2024 by
andrea-mucci
Unusually high initial loss during continual pre-training of the Gemma2-2B model.
#10332
opened Sep 2, 2024 by
longxudou
Nemo ASR: TypeError: ConfidenceConfig.__init__() got an unexpected keyword argument 'tdt_include_duration'
#10330
opened Sep 1, 2024 by
ican24
SFT training getting nan loss when using PP=4, TP=4 and model params > 7b
bug
Something isn't working
#10310
opened Aug 29, 2024 by
PurvangL
fastconformer hybrid recipe reports strange val_WER with Something isn't working
nemo:24.07
and nemo:dev
bug
#10299
opened Aug 29, 2024 by
itzsimpl
[rank1]: AttributeError: 'NoneType' object has no attribute 'get' (finetuning Mamba Hybrid)
bug
Something isn't working
#10285
opened Aug 28, 2024 by
SkanderBS2024
NeMo/tutorials/speaker_tasks/ASR_with_SpeakerDiarization needs confidence estimation
#10283
opened Aug 28, 2024 by
jugal-sheth
dim unmatch when doing sft with tensor parallel and sequence parallel and LoRA
bug
Something isn't working
#10280
opened Aug 28, 2024 by
zhuango
fix where the transcription is saved please
bug
Something isn't working
#10277
opened Aug 27, 2024 by
BBC-Esq
be prepared when a user selects a file that isn't strictly mono and other file extensions as well
bug
Something isn't working
#10276
opened Aug 27, 2024 by
BBC-Esq
fix exp_manager.py to work on Windows
bug
Something isn't working
#10275
opened Aug 27, 2024 by
BBC-Esq
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.