-
Notifications
You must be signed in to change notification settings - Fork 2.7k
Issues: NVIDIA/NeMo
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
extra_loggers is not used to log metrics or hyperparameters
bug
Something isn't working
#12046
opened Feb 4, 2025 by
chajath
llava-like dataset implementation "LazySupervisedDataset" likely fails to handle large dataset
#12034
opened Feb 3, 2025 by
bernardhan33
cfg
must have tokenizer
config to create a tokenizer !
bug
#12019
opened Feb 2, 2025 by
kirayomato
LLM pretraining encounter Something isn't working
ImportError: cannot import name 'AttnBackend' from 'megatron.core.transformer.enums'
bug
#12000
opened Jan 31, 2025 by
j40903272
num_sanity_val_steps too large issue
bug
Something isn't working
#11978
opened Jan 28, 2025 by
shanesyy
Add option for prefetch factor of data loader to config
#11977
opened Jan 28, 2025 by
shengshiqi-google
Megatron BERT Embedding conversion inconsistency
bug
Something isn't working
#11970
opened Jan 28, 2025 by
aditya-malte
[QST] Found no performance gain training Mixtral-8x7B with FP8 on H800
#11959
opened Jan 26, 2025 by
umiswing
Pickling error when trying to save checkpoints with custom checkpointIO
bug
Something isn't working
#11955
opened Jan 24, 2025 by
jdnurme
Gemma 2 NeMo 2.0 to HF conversion bug
bug
Something isn't working
#11951
opened Jan 24, 2025 by
domenVres
MegatronGPTModel trains much worse when reducing micro_batch_size
bug
Something isn't working
#11939
opened Jan 23, 2025 by
m-harmonic
Have a nemo training container without additional framework elements
#11933
opened Jan 23, 2025 by
gabwow
Unserializable Error with using Energon Dataloader for NeVA (LLaVA) pretraining / fine-tuning and NeMo 2.0
bug
Something isn't working
#11931
opened Jan 22, 2025 by
bernardhan33
Installation instruction for conda/pip does not work
bug
Something isn't working
#11929
opened Jan 22, 2025 by
erikchwang
Tenacity/s3fs not in requirements
bug
Something isn't working
#11926
opened Jan 22, 2025 by
clayrosenthal
max_steps and time calculation are not working as expected.
#11900
opened Jan 20, 2025 by
Shahad-Mohammed
XLarge Fastconformer Long FT does not converge with default parameters
#11894
opened Jan 20, 2025 by
TornikeAm
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.