First attempt at a parametrized JobCreate #740

javiermtorres · 2025-01-24T17:20:33Z

What's changing

The JobCreate schema is changed to include a separate specific job_config. The openapi produced includes a oneOf constraint:

      "JobCreate": {
        "properties": {
          "name": {
            "type": "string",
            "title": "Name"
          },
[...]
          "job_config": {
            "oneOf": [
              {
                "$ref": "#/components/schemas/JobEvalConfig"
              },
              {
                "$ref": "#/components/schemas/JobEvalLiteConfig"
              },
              {
                "$ref": "#/components/schemas/JobInferenceConfig"
              },
              {
                "$ref": "#/components/schemas/JobAnnotateConfig"
              }
            ],
            "title": "Job Config",
            "discriminator": {
              "propertyName": "job_type",
              "mapping": {
                "annotate": "#/components/schemas/JobAnnotateConfig",
                "eval_lite": "#/components/schemas/JobEvalLiteConfig",
                "evaluate": "#/components/schemas/JobEvalConfig",
                "inference": "#/components/schemas/JobInferenceConfig"
              }
            }
          }
        },

The jobs and experiments services are changed accordingly.

Closes #706

How to test it

Tests should run correctly.

Additional notes for reviewers

N/A

I already...

Tested the changes in a working environment to ensure they work as expected
Added some tests for any new functionality
Updated the documentation (both comments in code and product documentation under /docs)
Checked if a (backend) DB migration step was required and included it if required
- No DB migration needed

njbrake

Looking good so far! Made a code suggestion but looks like a very logical refactor. My only question would be whether you plan on addressing the custom logic of _get_job_params in this PR. If you don't plan on addressing it here, can you make a separate issue to track elevating that out of the service layer?

lumigator/python/mzai/backend/backend/api/routes/jobs.py

lumigator/python/mzai/backend/backend/services/jobs.py

javiermtorres · 2025-01-27T20:07:11Z

The SDK needs to be updated. I have checked the backend unit and integration tests locally and they seem to work.

javiermtorres · 2025-01-29T12:11:38Z

The SDK and notebook tests have been updated. @veekaybee @aittalam I've changed the code of the notebook slightly. One important difference is that I have removed the model param in the eval lite job. AFAICT, it's not needed there. The notebook takes it from the initial model spec in the notebook and not from the output of the summarization job. Since the output is a csv, it didn't make sense to put the model there, but I'll check the results metadata.

njbrake

My concern is that this PR is dropping support for the JobType.EVALUATION, which is needed to support the current frontend design. I may misunderstand the code. Other than that, only minor comments. Thanks for the work on this! (Let me know about JobType.EVALUATION and then I'll approve once that's worked out).

lumigator/backend/backend/api/routes/jobs.py

lumigator/backend/backend/services/jobs.py

lumigator/backend/backend/api/routes/jobs.py

ividal · 2025-02-05T19:20:28Z

Thanks for this! One note, @javiermtorres this PR should still be in sync with the UI and keep an eye on how it interacts with /experiments.

ividal

Just a note that uploading a file w/o ground-truth and then clicking on "generate ground-truth" generates a "job not found error".

See log:
2025-02-10 - PR740.log

lumigator/backend/backend/api/routes/jobs.py

…cords

ividal

Tested #847 in the context of reviewing #740 . I've only looked at the backend code (740), not the frontend code. But on the testing front, I did a demo:

uploading a dataset with and w/o gt
annotating one w/o gt
launching an experiment with both local and API-based models

Overall, works as expected (🥳 ). There are a number of smaller issues, but let's get 847 in here and then 740 into main - and iterate on smaller separate issues :)

…cords

* remove redundant folders * refactor: fix imports * refactor: change folder structure * style: linting * cleanup * First attempt at a parametrized JobCreate * Replace templates with pydantic models * Adapt SDK and SDK tests * Fix sdk unit tests * Fix notebook tests * Fix tests * Fix job definition in workflows * Fix job unit test * Start a default workflow for experiments * Rebase to main * Align with routes in main * Move to experiments new endpoint * Streamline new experiments api * remove redundant folders * refactor: fix imports * refactor: change folder structure * style: linting * cleanup * WIP: migrate to new workflow apis * refactor some more stuff * use the new datastructure, hide runtime * refactor: cleanup Job vs Experiment in ExperimentDetails mess * style: linting * fixing things * style: linting * current state * results working * style: linting * current state * after merge fixes * checkpoint * things working ish * formatting * style: linting --------- Co-authored-by: Javier Torres <javier@mozilla.ai>

ividal

With #847 in here and checks green, we have ourselves a usable new experiments workflow :)

THANK YOU for the effort @javiermtorres @khaledosman

github-actions bot added backend api Changes which impact API/presentation layer schemas Changes to schemas (which may be public facing) labels Jan 24, 2025

javiermtorres force-pushed the javiermtorres/issue-706-organize-creation-records branch from cf6d9aa to d8ae072 Compare January 24, 2025 17:28

javiermtorres requested review from aittalam and njbrake January 24, 2025 17:28

njbrake reviewed Jan 24, 2025

View reviewed changes

lumigator/python/mzai/backend/backend/api/routes/jobs.py Outdated Show resolved Hide resolved

lumigator/python/mzai/backend/backend/api/routes/jobs.py Outdated Show resolved Hide resolved

lumigator/python/mzai/backend/backend/services/jobs.py Outdated Show resolved Hide resolved

github-actions bot added the sdk label Jan 28, 2025

javiermtorres force-pushed the javiermtorres/issue-706-organize-creation-records branch 2 times, most recently from 72585b0 to a17f040 Compare January 28, 2025 15:30

javiermtorres force-pushed the javiermtorres/issue-706-organize-creation-records branch 2 times, most recently from 0c8ef33 to 4ca7e65 Compare January 29, 2025 15:22

javiermtorres marked this pull request as ready for review January 29, 2025 15:55

javiermtorres force-pushed the javiermtorres/issue-706-organize-creation-records branch 2 times, most recently from 9159569 to e6f13f3 Compare January 31, 2025 11:43

njbrake mentioned this pull request Jan 31, 2025

Mlflow implementation of Tracking Interface #768

Merged

4 tasks

javiermtorres force-pushed the javiermtorres/issue-706-organize-creation-records branch 2 times, most recently from c9357b2 to 293ae98 Compare February 4, 2025 16:20

javiermtorres requested review from njbrake and veekaybee February 4, 2025 19:22

njbrake reviewed Feb 4, 2025

View reviewed changes

ividal requested changes Feb 5, 2025

View reviewed changes

lumigator/backend/backend/api/routes/jobs.py Outdated Show resolved Hide resolved

ividal requested changes Feb 10, 2025

View reviewed changes

lumigator/backend/backend/api/routes/jobs.py Outdated Show resolved Hide resolved

njbrake changed the title ~~First attempt at a parametrized JobCreate~~ Jobs return a standardized and flexible output Feb 10, 2025

njbrake changed the title ~~Jobs return a standardized and flexible output~~ First attempt at a parametrized JobCreate Feb 10, 2025

javiermtorres force-pushed the javiermtorres/issue-706-organize-creation-records branch 2 times, most recently from bcad25e to ce76b2b Compare February 10, 2025 18:31

javiermtorres added 8 commits February 12, 2025 09:59

Fix tests

93c10ec

Fix job definition in workflows

33ba0dd

Fix job unit test

47da6a4

Start a default workflow for experiments

16ed6d7

Rebase to main

ae5ff85

Align with routes in main

d7c37cb

Move to experiments new endpoint

5d89417

Streamline new experiments api

02c0b79

javiermtorres force-pushed the javiermtorres/issue-706-organize-creation-records branch from 9113f26 to 02c0b79 Compare February 12, 2025 09:19

javiermtorres added 8 commits February 12, 2025 16:14

Add dataset/samples to experiment, test background tasks

af77fd2

Factor out experiment formatting

64a5e2d

Add lumigator specific tag

dff173a

Add job timings

148ca0a

Merge branch 'main' into javiermtorres/issue-706-organize-creation-re…

5235c4b

…cords

Merge branch 'main' into javiermtorres/issue-706-organize-creation-re…

f33c120

…cords

Fix missing default prompt for external APIs

3169093

Fix notebook

dea433d

ividal self-requested a review February 14, 2025 14:24

ividal reviewed Feb 14, 2025

View reviewed changes

javiermtorres and others added 2 commits February 14, 2025 15:42

Merge branch 'main' into javiermtorres/issue-706-organize-creation-re…

a93988e

…cords

github-actions bot added the frontend label Feb 14, 2025

ividal self-requested a review February 14, 2025 15:13

ividal approved these changes Feb 14, 2025

View reviewed changes

javiermtorres enabled auto-merge (squash) February 14, 2025 15:27

javiermtorres merged commit 4cf82e2 into main Feb 14, 2025
17 checks passed

javiermtorres deleted the javiermtorres/issue-706-organize-creation-records branch February 14, 2025 15:52

This was referenced Feb 14, 2025

[BUG]: After launching an experiment GET /jobs returns a 422, although all jobs are successful #875

Closed

Allow the user to chain multiple (model, parameter) pairs under one experiment #482

Closed

Use pydantic to define configs #96

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

First attempt at a parametrized JobCreate #740

First attempt at a parametrized JobCreate #740

javiermtorres commented Jan 24, 2025 •

edited

Loading

njbrake left a comment

javiermtorres commented Jan 27, 2025

javiermtorres commented Jan 29, 2025

njbrake left a comment

ividal commented Feb 5, 2025

ividal left a comment

ividal left a comment •

edited

Loading

ividal left a comment

First attempt at a parametrized JobCreate #740

First attempt at a parametrized JobCreate #740

Conversation

javiermtorres commented Jan 24, 2025 • edited Loading

What's changing

How to test it

Additional notes for reviewers

I already...

njbrake left a comment

Choose a reason for hiding this comment

javiermtorres commented Jan 27, 2025

javiermtorres commented Jan 29, 2025

njbrake left a comment

Choose a reason for hiding this comment

ividal commented Feb 5, 2025

ividal left a comment

Choose a reason for hiding this comment

ividal left a comment • edited Loading

Choose a reason for hiding this comment

ividal left a comment

Choose a reason for hiding this comment

javiermtorres commented Jan 24, 2025 •

edited

Loading

ividal left a comment •

edited

Loading