Launch requirements #18

maxbaluev · 2022-11-17T08:26:46Z

Who can share what equipment specifications are needed for each of the model sizes?

ohmygoobness · 2022-11-17T09:49:10Z

bump, would like to know the VRAM requirements of each model

lewiswatson55 · 2022-11-22T00:21:08Z

For interest, tested using my 3090ti w/ 24gb dedicated:

The standard model (inference), immediately got cuda memory issue... Ran model in fp16 and runs! Seems to hover around 15gb (standard, 16fp). Not scientific test though haha.

nealmcb · 2022-11-24T23:12:22Z

Here is one more anecdotal data point:

Getting started with the Galactica language model - Prog.World says:

The “basic” version consumes about 11 GB of memory.
.... in the “standard” version, our laptop simply ran out of memory

FWIW, they also say:

Galactica currently works with Python versions 3.8 and 3.9. Model installation is not possible with version 3.10 and above. This limitation is currently due to a library requirement prompt source.

lewiswatson55 · 2022-11-24T23:44:33Z

Haven't tested w/o using 16fp. Or had time to do more tests, but standard using 16fp is giving some fantastic results. I'd imagine basic will be similarly good, though.

edit: article looks like a good set up guide tho - mentions couple issues I also had to work out

nealmcb · 2022-11-25T02:14:46Z

Note that the "base" model works in a free colab notebook, after selecting Runtime/Change runtime time and picking "GPU".

vladislavivanistsev · 2022-11-26T13:40:38Z

Note that the "base" model works in a free colab notebook, after selecting Runtime/Change runtime time and picking "GPU".

Would you be so kind and share an example of a colab notebook?

nealmcb · 2022-11-27T05:03:24Z

@vladislavivanistsev Here is an example of a colab notebook that you should be able to run for free with a GPU runtime: galactica on Colab

I also note, from the paper:

For training the largest 120B model, we use 128 NVIDIA A100 80GB nodes. For inference Galactica 120B
requires a single A100 node.

agisga · 2023-01-11T21:00:17Z

Ran model in fp16 and runs! Seems to hover around 15gb (standard, 16fp). Not scientific test though haha.

My experience is similar: the (standard, 16fp) model runs for me with no issues across two GPUs (one RTX 2080 Ti + one GTX 1080 Ti), using about 15gb total (8gb+7gb). Not scientific test either :) but wanted to mention in case it helps someone.

hwasiti · 2023-01-12T01:12:28Z

Where do you specify that the model should be fp16 and not fp32?

hwasiti · 2023-01-12T01:22:44Z

Ah
Seems something like this:

model = gal.load_model("huge", num_gpus=4, dtype='float16')

kno10 · 2023-03-22T12:23:06Z

Please add a column listing the inference memory requirements for the models, so people can easier judge how much GPU RAM they need for the versions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Launch requirements #18

Launch requirements #18

maxbaluev commented Nov 17, 2022

ohmygoobness commented Nov 17, 2022

lewiswatson55 commented Nov 22, 2022

nealmcb commented Nov 24, 2022

lewiswatson55 commented Nov 24, 2022 •

edited

Loading

nealmcb commented Nov 25, 2022

vladislavivanistsev commented Nov 26, 2022

nealmcb commented Nov 27, 2022

agisga commented Jan 11, 2023

hwasiti commented Jan 12, 2023

hwasiti commented Jan 12, 2023

kno10 commented Mar 22, 2023

Launch requirements #18

Launch requirements #18

Comments

maxbaluev commented Nov 17, 2022

ohmygoobness commented Nov 17, 2022

lewiswatson55 commented Nov 22, 2022

nealmcb commented Nov 24, 2022

lewiswatson55 commented Nov 24, 2022 • edited Loading

nealmcb commented Nov 25, 2022

vladislavivanistsev commented Nov 26, 2022

nealmcb commented Nov 27, 2022

agisga commented Jan 11, 2023

hwasiti commented Jan 12, 2023

hwasiti commented Jan 12, 2023

kno10 commented Mar 22, 2023

lewiswatson55 commented Nov 24, 2022 •

edited

Loading