-
-
Notifications
You must be signed in to change notification settings - Fork 969
Issues: axolotl-ai-cloud/axolotl
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
AxolotlGRPOTrainer still shuffles combined datasets even with curriculum_sampling flag enabled
bug
Something isn't working
#2376
opened Mar 2, 2025 by
sidmadala
6 of 8 tasks
Unable to preprocess GRPO dataset
bug
Something isn't working
#2368
opened Feb 28, 2025 by
junethai-mendel
6 of 8 tasks
Model is not getting saved after fine-tuning with weights and biases config: wandb_log_model
bug
Something isn't working
#2337
opened Feb 17, 2025 by
HeenaRajan
6 of 8 tasks
no pad_token or eos_token in wandb eval table "Eval - Predictions vs Ground Truth"
bug
Something isn't working
#2330
opened Feb 13, 2025 by
BaiMoHan
6 of 8 tasks
Mistral-Small-3 support
enhancement
New feature or request
#2308
opened Feb 3, 2025 by
win4r
5 tasks done
axolotl CLI autocomplete
enhancement
New feature or request
#2297
opened Jan 29, 2025 by
winglian
5 tasks done
Refactor New feature or request
training_args_cls
logic in trainer_builder.py
into a utility function.
enhancement
#2288
opened Jan 27, 2025 by
SalmanMohammadi
Adapter mismatch when merging
bug
Something isn't working
#2277
opened Jan 22, 2025 by
teachsheryl
6 of 8 tasks
>=4-nodes(4*4gpu) training hangs at zero_first
bug
Something isn't working
#2275
opened Jan 22, 2025 by
sankexin
6 of 8 tasks
Refactor Dataset Configuration for Modular Typing, Discriminated Unions, and Backward Compatibility
enhancement
New feature or request
#2271
opened Jan 21, 2025 by
NJordan72
5 tasks done
Unable to run Multi-GPU ORPO training on Gemma model
bug
Something isn't working
#2267
opened Jan 17, 2025 by
chimezie
6 of 8 tasks
FSDP+LORA on multiple gpu(A100 80gb*4) ValueError: Cannot flatten integer dtype tensors
bug
Something isn't working
#2250
opened Jan 10, 2025 by
Paxwell-Paxwell
6 of 8 tasks
Add an example for reward model chat template in docs
documentation
Improvements or additions to documentation
good first issue
Good for newcomers
#2240
opened Jan 6, 2025 by
SalmanMohammadi
3 tasks done
[Bug] Resuming training on a pretraining loop does not continue data loading from where it left off
bug
Something isn't working
#2229
opened Jan 2, 2025 by
NanoCode012
6 of 8 tasks
Accelerate v1.2.1 Causes Consistent Errors
bug
Something isn't working
#2215
opened Dec 23, 2024 by
williambarberjr
6 of 8 tasks
max_grad_norm
doesn't appear to be clipping gradients
bug
#2214
opened Dec 22, 2024 by
DevonPeroutky
6 of 8 tasks
"RuntimeError: Invalid device argument : did you call init? "When setting CUDA_VISIBLE_DEVICES
bug
Something isn't working
waiting for reporter
#2199
opened Dec 18, 2024 by
zhanghanxing2022
6 of 8 tasks
load_from_disk for rl tpye training
enhancement
New feature or request
#2192
opened Dec 15, 2024 by
leeparkuky
5 tasks done
APOLLO optimizer
enhancement
New feature or request
#2175
opened Dec 11, 2024 by
fblgit
5 tasks done
When starting with DPO datasets, failed error with TypeError.
bug
Something isn't working
waiting for reporter
#2174
opened Dec 11, 2024 by
Yuto-24
6 of 8 tasks
Show sample batch content
enhancement
New feature or request
#2145
opened Dec 7, 2024 by
fzyzcjy
5 tasks done
Support ORPO/DPO Liger losses (and LigerORPOTrainer)
enhancement
New feature or request
wip
#2141
opened Dec 6, 2024 by
ccdv-ai
5 tasks done
Various bugs with ORPO
bug
Something isn't working
#2105
opened Nov 26, 2024 by
ccdv-ai
6 of 8 tasks
Mistral Nemo LoRA training has super high grad_norm
bug
Something isn't working
#2095
opened Nov 21, 2024 by
Nero10578
6 of 8 tasks
Deepspeed zero3 + LoRA: RuntimeError: Only Tensors of floating point and complex dtype can require gradients
bug
Something isn't working
waiting on upstream
wip
#2068
opened Nov 16, 2024 by
bursteratom
6 of 8 tasks
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.