-
Notifications
You must be signed in to change notification settings - Fork 86
Issues: Lightning-AI/lightning-thunder
Label tracking meta-issue (edit me to get automatically CC'ed...
#72
opened Mar 25, 2024 by
carmocca
Open
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
High memory consumption without fusion rematerialization
fusion logic
rematerialization
#1762
opened Feb 11, 2025 by
riccardofelluga
Apply New feature or request
set_execution_file
to specific traces such as forward/backward execution traces
enhancement
#1760
opened Feb 11, 2025 by
crcrpar
Support packing multiple sequences with Flash Attention without cross-contamination
enhancement
New feature or request
high priority
nemo
Issues needed to support NVIDIA NeMo models.
#1758
opened Feb 10, 2025 by
IvanYashchuk
more general inplace support (index_copy_ in litgpt fails to trace)
enhancement
New feature or request
in-place
#1743
opened Feb 4, 2025 by
ali-alshaar7
thunderfx produces results with incorrect
requires_grad
autograd
#1733
opened Feb 1, 2025 by
jjsjann123
Grad Transform generates inconsistent saved_for_backward between forward and backward trace.
autograd
#1732
opened Feb 1, 2025 by
jjsjann123
dividing a float16 tensor by a python float is inaccurate with nvfuser
numerical accuracy
nvfuser
#1724
opened Jan 30, 2025 by
beverlylytle
cudnn SDPA : cudnn sdpa is not used for bigcode/starcoder2-7b
cudnn
huggingface
For supporting HF models
#1722
opened Jan 30, 2025 by
kshitij12345
Run LitGPT benchmarking with a custom Attention implementation priority.
benchmarking
#1714
opened Jan 29, 2025 by
wprazuch
Input upcast is missing in Thunder's implementation of torch.nn.functional.rms_norm
operators
#1713
opened Jan 29, 2025 by
IvanYashchuk
symbolic cache policy can't handle string inputs properly.
symbolic values
#1710
opened Jan 28, 2025 by
jjsjann123
Implement max_norm argument for torch.nn.functional.embedding
enhancement
New feature or request
in-place
operators
#1699
opened Jan 27, 2025 by
IvanYashchuk
check memory location of things tagged STATIC_MEMORY_LOCATION by default
cudagraphs
enhancement
New feature or request
#1686
opened Jan 24, 2025 by
t-vi
Transforming traces should always precede a domination check
enhancement
New feature or request
transforms
#1684
opened Jan 22, 2025 by
ali-alshaar7
Investigate bf16 rms norm numerics
numerical accuracy
operators
thunderfx
for things that could be applicable to the dynamo+thunder frontend
#1678
opened Jan 22, 2025 by
t-vi
Connect New feature or request
nvfuser
thunderfx
for things that could be applicable to the dynamo+thunder frontend
prims.copy_with_setitem
to nvFuser's Executor
enhancement
#1676
opened Jan 22, 2025 by
kevinstephano
thunder.jit
has a relatively high CPU overhead when processing small graphs with small inputs.
performance
#1657
opened Jan 17, 2025 by
kiya00
backward creates inconsistent proxies between args and unpacking them
autograd
tracing architecture
#1633
opened Jan 10, 2025 by
t-vi
avoid joint trace in rematerialize forward backward
rematerialization
#1618
opened Jan 8, 2025 by
t-vi
make traces own proxies and bsyms
enhancement
New feature or request
tracing architecture
#1606
opened Jan 6, 2025 by
t-vi
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.