[RMP] Performant large embedding table support #733

EvenOldridge · 2022-05-05T23:10:20Z

karlhigley · 2022-05-06T00:18:42Z

This looks good! Two questions:

How does this relate to Refactor InputBlock models#282 (currently slated to be completed at the end of [RMP] Make losses, metrics, masking, and negative sampling configurable from model.compile() #271)? Asking because @marcromeyn said the input block refactor "would also enable kickstarting the work of integrating model-parallelism for large-embedding tables (for instance through the HugeCTR SOK.)" Wondering to what extent the input block changes depend on the rest of the Models API changes, and if we can pull the input block work forward somehow to unblock whichever parts of model parallel support depend on it.
Are there further methods for not storing user embeddings planned here? If the aggregating item embeddings is the main/only one, we might want to capture that in [RMP] Add YouTube DNN ranking model to Merlin Models #279 instead. This looks like a ton of useful stuff that we haven't really captured anywhere before, but that one piece we can probably tackle as part of the YouTube DNN work.

bschifferer · 2022-10-10T07:34:14Z

As an success criteria, we need to have benchmarks for each of the point above:

How does throughput change? (E.g. TF Keras vs. SOK. vs TFDE vs. reduced precision optimizer vs reduced precision embedding)
What is the AUC/performance of the model? (E.g. TF Keras vs. SOK. vs TFDE vs. reduced precision optimizer vs reduced precision embedding)

Customer ask us the questions and if we need to answer them, if we provide the functionality. Only if we add run the experiments, we can ensure that the implementation is correct.

viswa-nvidia · 2022-11-08T17:41:58Z

@marcromeyn , please define this ticket and also create another ticket for SOK

viswa-nvidia · 2023-01-24T18:18:25Z

@EvenOldridge , please help to define this ticket

viswa-nvidia · 2023-03-14T17:08:33Z

@edknv , please check with HCTR team and confirm milestone

EvenOldridge added the roadmap label May 5, 2022

EvenOldridge assigned benfred May 5, 2022

karlhigley added the epic label May 20, 2022

benfred mentioned this issue Jun 21, 2022

[RMP] Model Parallel Embedding Support in merlin-models NVIDIA-Merlin/models#451

Open

3 tasks

EvenOldridge mentioned this issue Jul 4, 2022

[ERMP] Make embedding layers more performant and scalable #412

Closed

3 tasks

EvenOldridge mentioned this issue Aug 17, 2022

[RMP] Larger than GPU memory embedding tables #107

Closed

EvenOldridge added this to the Merlin 22.11 milestone Aug 17, 2022

viswa-nvidia modified the milestones: Merlin 22.11, Merlin 22.12 Aug 29, 2022

viswa-nvidia mentioned this issue Sep 28, 2022

HugeCTR support (models, feature transforms, serving) #256

Closed

EvenOldridge removed this from the Merlin 22.12 milestone Nov 2, 2022

viswa-nvidia assigned marcromeyn and unassigned benfred Nov 8, 2022

viswa-nvidia transferred this issue from NVIDIA-Merlin/models Nov 15, 2022

viswa-nvidia added this to the Merlin 23.02 milestone Nov 15, 2022

viswa-nvidia modified the milestones: Merlin 23.02, Merlin 23.03 Nov 29, 2022

viswa-nvidia removed the epic label Dec 15, 2022

viswa-nvidia assigned EvenOldridge Jan 24, 2023

viswa-nvidia assigned edknv Feb 28, 2023

edknv mentioned this issue Mar 7, 2023

Introduce distributed embeddings NVIDIA-Merlin/models#974

Open

edknv modified the milestones: Merlin 23.03, Merlin 23.04 Mar 21, 2023

viswa-nvidia modified the milestones: Merlin 23.04, Merlin 23.06 Mar 21, 2023

EvenOldridge modified the milestones: Merlin 23.06, Merlin Backlog Apr 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RMP] Performant large embedding table support #733

[RMP] Performant large embedding table support #733

EvenOldridge commented May 5, 2022 •

edited by viswa-nvidia

Loading

karlhigley commented May 6, 2022

bschifferer commented Oct 10, 2022

viswa-nvidia commented Nov 8, 2022

viswa-nvidia commented Jan 24, 2023

viswa-nvidia commented Mar 14, 2023

[RMP] Performant large embedding table support #733

[RMP] Performant large embedding table support #733

Comments

EvenOldridge commented May 5, 2022 • edited by viswa-nvidia Loading

Problem:

Goal:

New Functionality

Constraints:

Starting Point:

Example

karlhigley commented May 6, 2022

bschifferer commented Oct 10, 2022

viswa-nvidia commented Nov 8, 2022

viswa-nvidia commented Jan 24, 2023

viswa-nvidia commented Mar 14, 2023

EvenOldridge commented May 5, 2022 •

edited by viswa-nvidia

Loading