chore: remove custom searcher and DSAT #9949

azhou-determined · 2024-09-17T20:23:11Z

Ticket

Description

removes custom searcher and DSAT

Test Plan

Checklist

Changes have been manually QA'd
New features have been approved by the corresponding PM
User-facing API changes have the "User-facing API Change" label
Release notes have been added as a separate file under docs/release-notes/
See Release Note for details.
Licenses have been included for new code which was copied and/or modified from any external code

determined-ci · 2024-09-17T20:23:24Z

Docsite preview being generated for this PR.
You can (eventually) find the generated docsite here.

determined-ci · 2024-09-17T20:23:28Z

Docsite preview being generated for this PR.
You can (eventually) find the generated docsite here.

codecov · 2024-09-17T20:25:05Z

Codecov Report

Attention: Patch coverage is 27.27273% with 8 lines in your changes missing coverage. Please review.

Project coverage is 53.88%. Comparing base (f758303) to head (db5d855).
Report is 1 commits behind head on searcher-context-removal.

Files with missing lines	Patch %	Lines
master/pkg/schemas/expconf/searcher_config.go	0.00%	5 Missing ⚠️
...ter/pkg/schemas/expconf/zgen_searcher_config_v0.go	0.00%	2 Missing ⚠️
master/pkg/searcher/searcher.go	50.00%	1 Missing ⚠️

Additional details and impacted files

@@                     Coverage Diff                      @@
##           searcher-context-removal    #9949      +/-   ##
============================================================
- Coverage                     54.50%   53.88%   -0.62%     
============================================================
  Files                          1255     1240      -15     
  Lines                        156733   153498    -3235     
  Branches                       3601     3599       -2     
============================================================
- Hits                          85424    82711    -2713     
+ Misses                        71176    70654     -522     
  Partials                        133      133

Flag	Coverage Δ
backend	`45.19% <11.11%> (+0.02%)`	⬆️
harness	`71.00% <100.00%> (-1.75%)`	⬇️
web	`54.27% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
harness/determined/common/api/errors.py	`76.36% <100.00%> (ø)`
...s/determined/pytorch/deepspeed/_deepspeed_trial.py	`79.88% <ø> (+1.28%)`	⬆️
master/internal/api_experiment.go	`58.35% <ø> (+1.55%)`	⬆️
master/internal/experiment.go	`32.96% <ø> (+2.83%)`	⬆️
master/internal/experiment/authz_basic_impl.go	`9.37% <ø> (+0.55%)`	⬆️
master/internal/experiment/authz_permissive.go	`2.32% <ø> (+0.15%)`	⬆️
master/internal/experiment/authz_rbac.go	`0.51% <ø> (+<0.01%)`	⬆️
master/pkg/schemas/zgen_schemas.go	`1.11% <ø> (ø)`
master/pkg/searcher/operations.go	`11.53% <ø> (-0.50%)`	⬇️
master/pkg/searcher/search_method.go	`76.19% <ø> (-2.08%)`	⬇️
... and 3 more

... and 7 files with indirect coverage changes

docs/get-started/example-solutions/_index.rst

master/pkg/searcher/operations.go

rb-determined-ai

very nice. -11k!

determined-ci · 2024-09-17T22:59:39Z

Docsite preview being generated for this PR.
You can (eventually) find the generated docsite here.

jgongd · 2024-09-18T16:13:47Z

For my understanding, is deep speed also part of the searcher context as well?
Answer my own question: yes, deep speed imports searcher

from determined import searcher

jgongd

Nice work!

amandavialva01 · 2024-09-18T17:51:02Z

master/pkg/searcher/custom_search.go

-	}
-)
-
-func newCustomSearch(config expconf.CustomConfig) SearchMethod {


I believe the expconf.CustomConfig type still exists, do we want to remove this as well?

This type only seems to be used here in our experiment integration tests

good point, removed it and the calling piece in that intg test

amandavialva01 · 2024-09-18T18:25:35Z

master/pkg/searcher/search_method.go

@@ -85,8 +76,6 @@ func NewSearchMethod(c expconf.SearcherConfig) SearchMethod {
 		return newAsyncHalvingSearch(*c.RawAsyncHalvingConfig, c.SmallerIsBetter())
 	case c.RawAdaptiveASHAConfig != nil:
 		return newAdaptiveASHASearch(*c.RawAdaptiveASHAConfig, c.SmallerIsBetter())
-	case c.RawCustomConfig != nil:
-		return newCustomSearch(*c.RawCustomConfig)


Should we also remove the RawCustomConfig type from SearcherConfigV0?

good idea, but apparently we can't 'cause of the case where master tries to restore a pre-upgrade custom search experiment. so i've kept the config but treat it like how we treat the other removed searcher configs.

amandavialva01

Backend looks great! Left a few comments on a couple of structs that can potentially be removed

determined-ci · 2024-09-18T21:02:26Z

Docsite preview being generated for this PR.
You can (eventually) find the generated docsite here.

determined-ci · 2024-09-18T21:04:03Z

Docsite preview being generated for this PR.
You can (eventually) find the generated docsite here.

determined-ci · 2024-09-18T21:17:52Z

Docsite preview being generated for this PR.
You can (eventually) find the generated docsite here.

determined-ci · 2024-09-18T21:21:52Z

Docsite preview being generated for this PR.
You can (eventually) find the generated docsite here.

determined-ci · 2024-09-20T00:17:28Z

Docsite preview being generated for this PR.
You can (eventually) find the generated docsite here.

determined-ci · 2024-09-20T00:34:30Z

Docsite preview being generated for this PR.
You can (eventually) find the generated docsite here.

determined-ci · 2024-09-20T00:50:15Z

Docsite preview being generated for this PR.
You can (eventually) find the generated docsite here.

azhou-determined · 2024-09-20T15:58:12Z

There's an import of ProjectSpec from the deleted file webui/react/src/services/stream/wire.ts in webui/react/src/stores/projects.tsx

sorry, i'm on a new laptop and having some proto/bindings build issues. those files were added back 😄

determined-ci · 2024-09-20T21:36:26Z

Docsite preview being generated for this PR.
You can (eventually) find the generated docsite here.

determined-ci · 2024-09-20T21:44:57Z

Docsite preview being generated for this PR.
You can (eventually) find the generated docsite here.

determined-ci · 2024-09-20T21:59:18Z

Docsite preview being generated for this PR.
You can (eventually) find the generated docsite here.

determined-ci · 2024-09-20T22:24:15Z

Docsite preview being generated for this PR.
You can (eventually) find the generated docsite here.

determined-ci · 2024-09-23T16:48:57Z

Docsite preview being generated for this PR.
You can (eventually) find the generated docsite here.

determined-ci · 2024-09-23T17:24:45Z

Docsite preview being generated for this PR.
You can (eventually) find the generated docsite here.

determined-ci · 2024-09-24T00:12:05Z

Docsite preview being generated for this PR.
You can (eventually) find the generated docsite here.

determined-ci · 2024-09-24T12:46:03Z

Docsite preview being generated for this PR.
You can (eventually) find the generated docsite here.

determined-ci · 2024-09-24T13:22:10Z

Docsite preview being generated for this PR.
You can (eventually) find the generated docsite here.

determined-ci · 2024-09-24T15:46:34Z

Docsite preview being generated for this PR.
You can (eventually) find the generated docsite here.

determined-ci · 2024-09-24T15:47:29Z

Docsite preview being generated for this PR.
You can (eventually) find the generated docsite here.

rb-determined-ai

siiiick

delete custom searcher (and DSAT)

Eliminate a couple dozen yaml files for controlling the no_op fixture, each of which was tweaked a half dozen ways by different tests. There were 124 usages of the no_op fixture, and it was very hard to know what any particular test was trying to accomplish. All of these (except 6 from the custom searcher tests, which are removed in an upcoming feature branch) have been re-written to use a new python module for creating noop experiments with obvious behaviors. By my measurements, a combined total of 34 minutes of effective sleeping were removed from the individual tests of our test suite. The biggest wins were from cases where the test author probably did not realize how long some of the no_op experiments were configured to run for. Most tests were faithfully preserved, with the following exceptions: - cluster/test_exp_continue:test_continue_config_file_and_args_cli - converted to a unit test - cluster/test_exp_continue:test_continue_config_file_and_args_cli - deleted; with new unit test, adds nothing to test_continue_batches - cluster/test_exp_continue:test_continue_fixing_broken_config - deleted; adds nothing to test_continue_batches - cluster/test_exp_continue:test_continue_workloads_searcher - deleted since it was really a wlsq test - cluster/test_exp_continue:test_continue_pytorch_completed_searcher - deleted since it was really a pytorch trainer test - cluster/test_resource_manager:test_allocation_resources_incremental_release - the test has not been working, I think at least since we defaulted to using det.launch.torch_distributed; the non-chief container was not exiting until the chief exited - experiment/test_core:test_trial_logs - deleted due to cluster/test_logging - experiment/test_core:test_log_null_bytes - deleted, but added null bytes to test_logging.py - experiment/test_noop:test_noop_nan_validations - combined with test_noop_pause - experiment/test_noop:test_cancel_ten_experiments - this test is dumb, also it was pathologically slow - experiment/test_noop:test_cancel_ten_paused_experiments - this test is dumb - experiment/test_noop:test_startup_hook - test_logging tests startup hooks already - run/test_api:test_run_pause_and_resume_filter_skip_empty - renamed to test_run_in_search_not_pausable_or_resumable to match its intended purpose, also simplify it, also make it stricter, also stop leaking adaptive searches onto the cluster after passing chore: remove custom searcher and DSAT (#9949) delete custom searcher (and DSAT) chore: refactor searcher operations out of master side searchers code gen

delete custom searcher (and DSAT)

Eliminate a couple dozen yaml files for controlling the no_op fixture, each of which was tweaked a half dozen ways by different tests. There were 124 usages of the no_op fixture, and it was very hard to know what any particular test was trying to accomplish. All of these (except 6 from the custom searcher tests, which are removed in an upcoming feature branch) have been re-written to use a new python module for creating noop experiments with obvious behaviors. By my measurements, a combined total of 34 minutes of effective sleeping were removed from the individual tests of our test suite. The biggest wins were from cases where the test author probably did not realize how long some of the no_op experiments were configured to run for. Most tests were faithfully preserved, with the following exceptions: - cluster/test_exp_continue:test_continue_config_file_and_args_cli - converted to a unit test - cluster/test_exp_continue:test_continue_config_file_and_args_cli - deleted; with new unit test, adds nothing to test_continue_batches - cluster/test_exp_continue:test_continue_fixing_broken_config - deleted; adds nothing to test_continue_batches - cluster/test_exp_continue:test_continue_workloads_searcher - deleted since it was really a wlsq test - cluster/test_exp_continue:test_continue_pytorch_completed_searcher - deleted since it was really a pytorch trainer test - cluster/test_resource_manager:test_allocation_resources_incremental_release - the test has not been working, I think at least since we defaulted to using det.launch.torch_distributed; the non-chief container was not exiting until the chief exited - experiment/test_core:test_trial_logs - deleted due to cluster/test_logging - experiment/test_core:test_log_null_bytes - deleted, but added null bytes to test_logging.py - experiment/test_noop:test_noop_nan_validations - combined with test_noop_pause - experiment/test_noop:test_cancel_ten_experiments - this test is dumb, also it was pathologically slow - experiment/test_noop:test_cancel_ten_paused_experiments - this test is dumb - experiment/test_noop:test_startup_hook - test_logging tests startup hooks already - run/test_api:test_run_pause_and_resume_filter_skip_empty - renamed to test_run_in_search_not_pausable_or_resumable to match its intended purpose, also simplify it, also make it stricter, also stop leaking adaptive searches onto the cluster after passing chore: remove custom searcher and DSAT (#9949) delete custom searcher (and DSAT) chore: refactor searcher operations out of master side searchers code gen

delete custom searcher (and DSAT)

azhou-determined assigned rb-determined-ai Sep 17, 2024

azhou-determined requested review from a team as code owners September 17, 2024 20:23

azhou-determined requested review from jgongd and amandavialva01 and removed request for a team September 17, 2024 20:23

cla-bot bot added the cla-signed label Sep 17, 2024

determined-ci added the documentation Improvements or additions to documentation label Sep 17, 2024

determined-ci requested a review from a team September 17, 2024 20:23

rb-determined-ai reviewed Sep 17, 2024

View reviewed changes

docs/get-started/example-solutions/_index.rst Outdated Show resolved Hide resolved

rb-determined-ai reviewed Sep 17, 2024

View reviewed changes

master/pkg/searcher/operations.go Show resolved Hide resolved

rb-determined-ai reviewed Sep 17, 2024

View reviewed changes

determined-ci requested a review from a team September 17, 2024 22:59

jgongd approved these changes Sep 18, 2024

View reviewed changes

amandavialva01 reviewed Sep 18, 2024

View reviewed changes

azhou-determined force-pushed the remove-custom-searcher branch from 5d06c6c to 4865e03 Compare September 20, 2024 21:44

azhou-determined force-pushed the remove-custom-searcher branch from edc1848 to c76a5ca Compare September 23, 2024 17:24

chore: remove custom searcher and DSAT

db5d855

azhou-determined force-pushed the remove-custom-searcher branch from a32c75c to db5d855 Compare September 24, 2024 15:47

rb-determined-ai approved these changes Sep 24, 2024

View reviewed changes

azhou-determined merged commit 16779e7 into searcher-context-removal Sep 24, 2024
81 of 94 checks passed

azhou-determined deleted the remove-custom-searcher branch September 24, 2024 19:04

rb-determined-ai pushed a commit that referenced this pull request Oct 1, 2024

chore: remove custom searcher and DSAT (#9949)

73c99ed

delete custom searcher (and DSAT)

rb-determined-ai pushed a commit that referenced this pull request Oct 22, 2024

chore: remove custom searcher and DSAT (#9949)

44cf95a

delete custom searcher (and DSAT)

azhou-determined added a commit that referenced this pull request Oct 22, 2024

chore: remove custom searcher and DSAT (#9949)

23ad428

delete custom searcher (and DSAT)

rb-determined-ai pushed a commit that referenced this pull request Oct 22, 2024

chore: remove custom searcher and DSAT (#9949)

2b63e67

delete custom searcher (and DSAT)

rb-determined-ai pushed a commit that referenced this pull request Oct 24, 2024

chore: remove custom searcher and DSAT (#9949)

34bccb0

delete custom searcher (and DSAT)

rb-determined-ai pushed a commit that referenced this pull request Oct 24, 2024

chore: remove custom searcher and DSAT (#9949)

093af65

delete custom searcher (and DSAT)

azhou-determined mentioned this pull request Oct 25, 2024

feat: remove searcher context from harness and master [MD-498] #10131

Merged

5 tasks

azhou-determined added a commit that referenced this pull request Oct 25, 2024

chore: remove custom searcher and DSAT (#9949)

a761ab4

delete custom searcher (and DSAT)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: remove custom searcher and DSAT #9949

chore: remove custom searcher and DSAT #9949

azhou-determined commented Sep 17, 2024 •

edited by jira bot

Loading

determined-ci commented Sep 17, 2024

determined-ci commented Sep 17, 2024

codecov bot commented Sep 17, 2024 •

edited

Loading

rb-determined-ai left a comment

determined-ci commented Sep 17, 2024

jgongd commented Sep 18, 2024 •

edited

Loading

jgongd left a comment

amandavialva01 Sep 18, 2024

amandavialva01 Sep 18, 2024

azhou-determined Sep 18, 2024

amandavialva01 Sep 18, 2024

azhou-determined Sep 18, 2024

amandavialva01 left a comment

determined-ci commented Sep 18, 2024

determined-ci commented Sep 18, 2024

determined-ci commented Sep 18, 2024

determined-ci commented Sep 18, 2024

determined-ci commented Sep 20, 2024

determined-ci commented Sep 20, 2024

determined-ci commented Sep 20, 2024

azhou-determined commented Sep 20, 2024

determined-ci commented Sep 20, 2024

determined-ci commented Sep 20, 2024

determined-ci commented Sep 20, 2024

determined-ci commented Sep 20, 2024

determined-ci commented Sep 23, 2024

determined-ci commented Sep 23, 2024

determined-ci commented Sep 24, 2024

determined-ci commented Sep 24, 2024

determined-ci commented Sep 24, 2024

determined-ci commented Sep 24, 2024

determined-ci commented Sep 24, 2024

rb-determined-ai left a comment

chore: remove custom searcher and DSAT #9949

chore: remove custom searcher and DSAT #9949

Conversation

azhou-determined commented Sep 17, 2024 • edited by jira bot Loading

Ticket

Description

Test Plan

Checklist

determined-ci commented Sep 17, 2024

determined-ci commented Sep 17, 2024

codecov bot commented Sep 17, 2024 • edited Loading

Codecov Report

rb-determined-ai left a comment

Choose a reason for hiding this comment

determined-ci commented Sep 17, 2024

jgongd commented Sep 18, 2024 • edited Loading

jgongd left a comment

Choose a reason for hiding this comment

amandavialva01 Sep 18, 2024

Choose a reason for hiding this comment

amandavialva01 Sep 18, 2024

Choose a reason for hiding this comment

azhou-determined Sep 18, 2024

Choose a reason for hiding this comment

amandavialva01 Sep 18, 2024

Choose a reason for hiding this comment

azhou-determined Sep 18, 2024

Choose a reason for hiding this comment

amandavialva01 left a comment

Choose a reason for hiding this comment

determined-ci commented Sep 18, 2024

determined-ci commented Sep 18, 2024

determined-ci commented Sep 18, 2024

determined-ci commented Sep 18, 2024

determined-ci commented Sep 20, 2024

determined-ci commented Sep 20, 2024

determined-ci commented Sep 20, 2024

azhou-determined commented Sep 20, 2024

determined-ci commented Sep 20, 2024

determined-ci commented Sep 20, 2024

determined-ci commented Sep 20, 2024

determined-ci commented Sep 20, 2024

determined-ci commented Sep 23, 2024

determined-ci commented Sep 23, 2024

determined-ci commented Sep 24, 2024

determined-ci commented Sep 24, 2024

determined-ci commented Sep 24, 2024

determined-ci commented Sep 24, 2024

determined-ci commented Sep 24, 2024

rb-determined-ai left a comment

Choose a reason for hiding this comment

azhou-determined commented Sep 17, 2024 •

edited by jira bot

Loading

codecov bot commented Sep 17, 2024 •

edited

Loading

jgongd commented Sep 18, 2024 •

edited

Loading