-
Notifications
You must be signed in to change notification settings - Fork 118
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[RMP] Performant large embedding table support #733
Comments
This looks good! Two questions:
|
As an success criteria, we need to have benchmarks for each of the point above:
Customer ask us the questions and if we need to answer them, if we provide the functionality. Only if we add run the experiments, we can ensure that the implementation is correct. |
@marcromeyn , please define this ticket and also create another ticket for SOK |
@EvenOldridge , please help to define this ticket |
@edknv , please check with HCTR team and confirm milestone |
Problem:
Goal:
New Functionality
Constraints:
##Architectural consideration
NA
Starting Point:
Model Parallel Support
Evaluation of HugeCTR, TorchRec, Distributed Embeddings, TFRA, PersiaML for inclusion in MerlinFeature engineering that reduces embedding sizeReduced Precision SupportNot storing user embeddings
Inference Support
Serving
Example
The text was updated successfully, but these errors were encountered: