Add Troubleshooting Section to README #437

Dhie-boop · 2025-01-28T20:55:50Z

This PR enhances the README by adding a Troubleshooting section to help users resolve common issues they may encounter while using DeepSeek-V3.

New Section Added: Troubleshooting
The following issues and solutions are included:

Model weights not found: Instructions on downloading model weights from Hugging Face and placing them correctly.
CUDA errors during inference: Steps to ensure CUDA is set up correctly and PyTorch is configured for GPU use.
Slow inference performance: Recommendations for hardware optimization and using FP8/BF16 modes for faster inference.
Out of memory errors: Guidance on reducing batch sizes or leveraging model parallelism for multi-GPU setups.
Why This Change?
Users may face these common issues when running DeepSeek-V3. Adding a dedicated troubleshooting section improves usability and reduces potential support queries.
The troubleshooting tips are specific to the DeepSeek-V3 workflow and link to external resources for further guidance.

Please let me know if there are additional issues to include or if further edits are required.

Add Troubleshooting Section to README

7e137bb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Troubleshooting Section to README #437

Add Troubleshooting Section to README #437

Dhie-boop commented Jan 28, 2025

Add Troubleshooting Section to README #437

Are you sure you want to change the base?

Add Troubleshooting Section to README #437

Conversation

Dhie-boop commented Jan 28, 2025