You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
which has a broken format and will cause unexpected behavior when creating the databricks job.
also, submission_params["libraries"][0]["jar"] is not safe to use. If the first dependency library value is not ['jar']: FEATHR_FILL, but something else like pypi or any custom maven dependency users may set, then "libraries"][0]["jar"] this logic will break the format too.
Tracking information
No response
Code to reproduce bug
No response
What component(s) does this bug affect?
Python Client: This is the client users use to interact with most of our API. Mostly written in Python.
Computation Engine: The computation engine that execute the actual feature join and generation work. Mostly in Scala and Spark.
Feature Registry API: The frontend API layer supports SQL, Purview(Atlas) as storage. The API layer is in Python(FAST API)
Feature Registry Web UI: The Web UI for feature registry. Written in React
The text was updated successfully, but these errors were encountered:
Specifiy 'maven': {'coordinates': 'com.github.everit-org.json-schema:org.everit.json.schema:1.9.1', 'repo': 'https://repository.mulesoft.org/nexus/content/repositories/public/'} is no longer needed on notebook side if user is using maven jar with verson greater than v0.10.4-rc5. everit.json.schema:1.9.1 is not available on maven and current cloud sparks can not pull packages from non maven central repos. Thus PR #1043 removes everit.json.schema:1.9.1' deps.
By default, python client uses same version of maven jar, so to get the fix for this issue, either
Wait till main branch pyhton client version is bumped to v0.10.4-rc5, this should happen soon if no new regressions are found on v0.10.4-rc5
Add a cell to force python client use a newer version maven jar os.environ['MAVEN_ARTIFACT_VERSION'] = "0.10.4-rc5" in notebook.
Willingness to contribute
No. I cannot contribute a bug fix at this time.
Feathr version
0.10.4-rc1
System information
Describe the problem
in Databricks job launcher class, adding library dependencies to databricks job api config via index shouldn't be used:
since this will break the job api config.
E.g.
in our feature embedding example, we add pypi dependency of transformer package as follows:
The new codes that are adding maven dependencies to the config will modify the config to be:
which has a broken format and will cause unexpected behavior when creating the databricks job.
also,
submission_params["libraries"][0]["jar"]
is not safe to use. If the first dependency library value is not['jar']: FEATHR_FILL
, but something else likepypi
or any custommaven
dependency users may set, then"libraries"][0]["jar"]
this logic will break the format too.Tracking information
No response
Code to reproduce bug
No response
What component(s) does this bug affect?
Python Client
: This is the client users use to interact with most of our API. Mostly written in Python.Computation Engine
: The computation engine that execute the actual feature join and generation work. Mostly in Scala and Spark.Feature Registry API
: The frontend API layer supports SQL, Purview(Atlas) as storage. The API layer is in Python(FAST API)Feature Registry Web UI
: The Web UI for feature registry. Written in ReactThe text was updated successfully, but these errors were encountered: