You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[RMP] Enable users to pass embedding tables directly into the input block in order to more easily support new functionality (non-trainable embeddings, different dimensions, model parallel, etc)
#479
Customers want to be able to create embeddings that support a wide variety of options. The InputBlock handles the selection of these features through arguments, however we're up to 18 arguments and it's only going to get worse.
By separating input block definition from embedding creation we can allow for a much more flexible set of embeddings support with a simpler API.
The RMP #404 introduced the InputBlockV2 and also the Embeddings function that allow the user to provide a custom block for embeddings or infer the embeddings tables and their cardinality from the schema, exposing new options like setting trainable=False for embeddings.
But those new blocks are still experimental, for not being used yet by Merlin Models high-level API and have much less tests than the previous InputBlock
InputBlockV2 - Extend test coverage models#605 - Extend existing tests for InputBlockV2, by creating copies of existing tests for InputBlock, to ensure it is well tested - Moved this ticket to ENG-CI and closing the RMP ticket
gabrielspmoreira
changed the title
[RMP] Leverage InputBlockV2 and EmbeddingTable in higher-level API
[RMP] Leverage InputBlockV2 and EmbeddingTable in higher-level API and expand testing
Jul 25, 2022
EvenOldridge
changed the title
[RMP] Leverage InputBlockV2 and EmbeddingTable in higher-level API and expand testing
[Task] Leverage InputBlockV2 and EmbeddingTable in higher-level API and expand testing
Jul 29, 2022
gabrielspmoreira
changed the title
[Task] Leverage InputBlockV2 and EmbeddingTable in higher-level API and expand testing
[RMP] Leverage InputBlockV2 and EmbeddingTable in higher-level API and expand testing
Aug 1, 2022
EvenOldridge
changed the title
[RMP] Leverage InputBlockV2 and EmbeddingTable in higher-level API and expand testing
[RMP] Enable users to pass embedding tables directly into the input block in order to more easily support new functionality (non-trainable embeddings, different dimensions, model parallel, etc)
Aug 3, 2022
Problem:
Customers want to be able to create embeddings that support a wide variety of options. The InputBlock handles the selection of these features through arguments, however we're up to 18 arguments and it's only going to get worse.
By separating input block definition from embedding creation we can allow for a much more flexible set of embeddings support with a simpler API.
The RMP #404 introduced the
InputBlockV2
and also theEmbeddings
function that allow the user to provide a custom block for embeddings or infer the embeddings tables and their cardinality from the schema, exposing new options like setting trainable=False for embeddings.But those new blocks are still experimental, for not being used yet by Merlin Models high-level API and have much less tests than the previous
InputBlock
Goal:
Embeddings
function is very handy to infer the embeddings definition from the schema. Make it more flexible by adding atrainable: Optional[List[str]] = None
argument, to setEmbeddingTable.trainable=False
(default is True) for the features present in the list. models#627EmbeddingTable
andInputBlockV2
DCNModel
,DeepFMModel
- to useInputBlockV2
and to expose theembeddings
block to the users.InputBlockV2
, by creating copies of existing tests forInputBlock
, to ensure it is well tested - Moved this ticket to ENG-CI and closing the RMP ticketStarting Point:
InputBlockV2
and also theEmbeddings
function. The PR Supports task-specific sample weights, weighed_metrics and fixes InputBlockV2 models#600 introduced some fixes toInputBlockV2
Note:
non-trainable embeddings is the pre-trained embeddings
The text was updated successfully, but these errors were encountered: