You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The container image should run fine-tuning without having to specify any extra environment variable
Observed behavior
File "/home/tuning/.local/lib/python3.11/site-packages/triton/runtime/cache.py", line 64, in __init__
os.makedirs(self.cache_dir, exist_ok=True)
File "<frozen os>", line 215, in makedirs
File "<frozen os>", line 215, in makedirs
File "<frozen os>", line 225, in makedirs
PermissionError: [Errno 13] Permission denied: '/.triton'
Describe the bug
When running QLoRA on OpenShift, FMS-HF-Tuning crashes because it cannot access
/.triton
Platform
OpenShift AI,
quay.io/modh/fms-hf-tuning:v2.0.1
Sample Code
Running this configuration:
Expected behavior
The container image should run fine-tuning without having to specify any extra environment variable
Observed behavior
pod.log
Additional context
Work around is to define these environment variables, and make them point to a writable directory:
The text was updated successfully, but these errors were encountered: