Skip to content

Commit

Permalink
add to docs (axolotl-ai-cloud#703)
Browse files Browse the repository at this point in the history
  • Loading branch information
winglian authored Oct 20, 2023
1 parent 87dbd15 commit 70db131
Show file tree
Hide file tree
Showing 2 changed files with 16 additions and 0 deletions.
2 changes: 2 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -901,6 +901,8 @@ CUDA_VISIBLE_DEVICES="" python3 -m axolotl.cli.merge_lora ...
## Common Errors 🧰
See also the [FAQ's](./docs/faq.md).

> If you encounter a 'Cuda out of memory' error, it means your GPU ran out of memory during the training process. Here's how to resolve it:
Please reduce any below
Expand Down
14 changes: 14 additions & 0 deletions docs/faq.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
# Axolotl FAQ's


> The trainer stopped and hasn't progressed in several minutes.
Usually an issue with the GPU's communicating with each other. See the [NCCL doc](../docs/nccl.md)

> Exitcode -9
This usually happens when you run out of system RAM.

> Exitcode -7 while using deepspeed
Try upgrading deepspeed w: `pip install -U deepspeed`

0 comments on commit 70db131

Please sign in to comment.