Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

⭐️ Add entity embeddings workflow example #278

Open
wants to merge 1 commit into
base: dev
Choose a base branch
from

Conversation

EssamWisam
Copy link
Collaborator

@EssamWisam EssamWisam commented Oct 26, 2024

It dawned on me that entity embeddings is a noteworthy feature of this package and that it deserves a workflow example illustrating: (i), how it can be used and (ii), hints on how it works.

In particular, I was going to proceed with making an interface at MLJTransforms but felt I need to refresh my self on how this works first so I thought making this tutorial is a good way.

@EssamWisam
Copy link
Collaborator Author

image

This seems to be the source of the test fail with the error being MLJBase.fit(model, 0, X, y) UndefRefError: access to undefined reference.

That said, I believe I have only modified the docs folder so that should be completely orthogonal on this fail.

@ablaom ablaom mentioned this pull request Oct 28, 2024
@ablaom ablaom closed this Nov 13, 2024
@ablaom ablaom reopened this Nov 13, 2024
# It employs a set of embedding layers to map each categorical feature into a dense continuous vector in a similar fashion to how they are employed in NLP architectures.

# In MLJFlux, the `NeuralNetworkClassifier`, `NeuralNetworkRegressor`, and the `MultitargetNeuralNetworkRegressor`` can be trained and evaluated with heterogenous data (i.e., containing categorical features) because they have a built-in entity embedding layer.
# Moreover, they now offer a transform which encode the categorical features with the learnt embeddings to be used by an upstream machine learning model.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
# Moreover, they now offer a transform which encode the categorical features with the learnt embeddings to be used by an upstream machine learning model.
# Moreover, they now offer a `transform` method which encodes the categorical features with the learned embeddings to be used by an upstream machine learning model.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm a little confused by what is meant by "to be used by an upstream machine learning model".

Copy link
Collaborator

@ablaom ablaom left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks great! Thanks for your patience with the review.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants