-
Notifications
You must be signed in to change notification settings - Fork 30
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix broken PoC notebook due mismatching output names between config file and saved ranking model #112
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do you have an example of the output column names that will trigger this? Also Can we add a unittest that will test this out?
yes the unit test is already available, it was just not in the correct folder. This PR NVIDIA-Merlin/Merlin#364 pushes it to the The output column name that will trigger this is |
Click to view CI ResultsGitHub pull request #112 of commit 3b85d25ea82d561a3edecb64c553bb7c3e72bbad, no merge conflicts. Running as SYSTEM Setting status of 3b85d25ea82d561a3edecb64c553bb7c3e72bbad to PENDING with url https://10.20.13.93:8080/job/merlin_systems/64/console and message: 'Pending' Using context: Jenkins Building on master in workspace /var/jenkins_home/workspace/merlin_systems using credential fce1c729-5d7c-48e8-90cb-b0c314b1076e > git rev-parse --is-inside-work-tree # timeout=10 Fetching changes from the remote Git repository > git config remote.origin.url https://github.com/NVIDIA-Merlin/systems # timeout=10 Fetching upstream changes from https://github.com/NVIDIA-Merlin/systems > git --version # timeout=10 using GIT_ASKPASS to set credentials login for merlin-systems user + githubtoken > git fetch --tags --force --progress -- https://github.com/NVIDIA-Merlin/systems +refs/pull/112/*:refs/remotes/origin/pr/112/* # timeout=10 > git rev-parse 3b85d25ea82d561a3edecb64c553bb7c3e72bbad^{commit} # timeout=10 Checking out Revision 3b85d25ea82d561a3edecb64c553bb7c3e72bbad (detached) > git config core.sparsecheckout # timeout=10 > git checkout -f 3b85d25ea82d561a3edecb64c553bb7c3e72bbad # timeout=10 Commit message: "fix output names" > git rev-list --no-walk bb98249cfd0f00b4b5ce8e0b130aca93b3834121 # timeout=10 [merlin_systems] $ /bin/bash /tmp/jenkins4224079119260067482.sh ============================= test session starts ============================== platform linux -- Python 3.8.10, pytest-7.1.2, pluggy-1.0.0 rootdir: /var/jenkins_home/workspace/merlin_systems/systems, configfile: pyproject.toml plugins: anyio-3.5.0, xdist-2.5.0, forked-1.4.0, cov-3.0.0 collected 18 items / 1 skipped |
closing due to #117 |
Currently, 02-Deploying-multi-stage-RecSys-with-Merlin-Systems.ipynb nb is broken, due to ranking model output name mismatch between the config files, and saved model.
we get the following errors:
ValueError: Missing columns ['output_1'] found in operatorSubsetColumns during compute_input_schema.
This can be fixed by setting the proper output name
click/click/binary_classification_task
in the following lines:However, this is not enough. Triton also complains about output names, therefoer cannot load
5_predicttensorflow
model.This PR proposes a solution to that issue.