LLM GPU Memory Calculator

Calculating GPU Memory for LLMs

To determine GPU requirements for serving LLMs (like Llama3 70B), we need to calculate the required GPU memory using this formula:

M = ((P × 4B) × (32/Q)) × 1.2

Where:

For a 70B parameter model at 8-bit quantization:

M = ((70B × 4) × (32/8)) × 1.2 = 134.4 GB

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
public		public
scripts		scripts
src		src
.gitignore		.gitignore
README.md		README.md
eslint.config.js		eslint.config.js
index.html		index.html
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
template_config.json		template_config.json
vite.config.js		vite.config.js