Memory issues #89

DaniruKun · 2023-02-17T20:38:34Z

The way that the CLI is implemented currently, it is unusable on a medium-size model and e.g. an hour or two long audio file.
The main problem is that after the transcription, the Whisper model is not properly GCed, and the cache isn't cleared, so there is just not enough memory for both the Whisper and the alignment model.
This is a huge issue since many people have consumer RTX 3000-series cards with only 6GB VRAM.

A quick solution is to not load all models at the same time, and properly GC the whisper model before doing the next steps:

del whisper_model
gc.collect()
torch.cuda.empty_cache()

m-bain · 2023-04-04T21:26:40Z

i'll make model flushing an option soon, for cheaper gpu reqs. as well as using faster-whisper (but no ETA yet) as a drop-in replacement for whisper

m-bain · 2023-05-01T11:46:30Z

faster-whisper backend, and model flushing now in v3 version
should reduce gpu mem reqs a lot.
See #159 (comment) for other ways to reduce gpu mem

DaniruKun changed the title ~~Enhancement: include examples of programmatic customisation?~~ Memory issues Feb 18, 2023

DaniruKun mentioned this issue Feb 18, 2023

Fix memory issues #91

Closed

m-bain added the enhancement New feature or request label Apr 4, 2023

m-bain closed this as completed May 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory issues #89

Memory issues #89

DaniruKun commented Feb 17, 2023 •

edited

Loading

m-bain commented Apr 4, 2023

m-bain commented May 1, 2023

Memory issues #89

Memory issues #89

Comments

DaniruKun commented Feb 17, 2023 • edited Loading

m-bain commented Apr 4, 2023

m-bain commented May 1, 2023

DaniruKun commented Feb 17, 2023 •

edited

Loading