Skip to content

Lhotse and the best way to use it #10087

Answered by pzelasko
FredSRichardson asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @FredSRichardson, it's been a while!

If I already have Lhotse format datasets (i.e. in the format of Lhotse JSON manifests), is it optimal to use those or is it actually better to use something else like NeMo tarred manifests with the NeMo lhotse data loading option enabled?

You can use your existing Lhotse data; we support all Lhotse formats and all NeMo formats. You may find this doc helpful to navigate the relevant options: https://docs.nvidia.com/nemo-framework/user-guide/latest/nemotoolkit/asr/datasets.html#enabling-lhotse-via-configuration

What type of data set format did you use for most of your recent large scale training runs which used lhotse?

We mainly use tarred formats…

Replies: 1 comment 5 replies

Comment options

You must be logged in to vote
5 replies
@FredSRichardson
Comment options

@pzelasko
Comment options

@FredSRichardson
Comment options

@pzelasko
Comment options

@scarecrow1123
Comment options

Answer selected by FredSRichardson
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants