Skip to content

Commit

Permalink
cuda: fix vmm oom issue on NVIDIA AGX Orin (ggerganov#4687)
Browse files Browse the repository at this point in the history
Signed-off-by: hydai <hydai@secondstate.io>
  • Loading branch information
hydai authored Dec 29, 2023
1 parent 04ac060 commit 91bb39c
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion ggml-cuda.cu
Original file line number Diff line number Diff line change
Expand Up @@ -6662,7 +6662,7 @@ static void ggml_cuda_pool_free_leg(int device, void * ptr, size_t size) {
// pool with virtual memory
static CUdeviceptr g_cuda_pool_addr[GGML_CUDA_MAX_DEVICES] = {0};
static size_t g_cuda_pool_used[GGML_CUDA_MAX_DEVICES] = {0};
static const size_t CUDA_POOL_VMM_MAX_SIZE = 1ull << 36; // 64 GB
static const size_t CUDA_POOL_VMM_MAX_SIZE = 1ull << 35; // 32 GB

static void * ggml_cuda_pool_malloc_vmm(int device, size_t size, size_t * actual_size) {
scoped_spin_lock lock(g_cuda_pool_lock);
Expand Down

0 comments on commit 91bb39c

Please sign in to comment.