Remove dev comments + reset logging levels #13

glennmoy · 2023-10-03T17:26:50Z

No description provided.

Signed-off-by: Artur Niederfahrenhorst <attaismyname@googlemail.com>

* Update version in dask on ray guide for 2.5.0 release (ray-project#35458) As part of the 2.5.0 release, the dask on ray version in the guide needs to be updated. --------- Signed-off-by: Artur Niederfahrenhorst <attaismyname@googlemail.com> * [Dask on Ray] Attempt to fix line in dask doc (ray-project#35479) ray-project#35458 introduced an issue with the table not being displayed. --------- Signed-off-by: Artur Niederfahrenhorst <attaismyname@googlemail.com> --------- Signed-off-by: Artur Niederfahrenhorst <attaismyname@googlemail.com>

…test (ray-project#35465) (ray-project#35489) The release tests failed due to the incompatible urllib3 version. Pin urllib < 1.27 to fix the ml_user_ray_lightning_user_test_(master|latest).aws release test. Signed-off-by: woshiyyya <yunxuanx@anyscale.com>

* [serve] Shutdown http proxy state (ray-project#35395) Shutdown http proxy state so that it won't run anything in its update loop once a shutdown signal is received. Signed-off-by: Cindy Zhang <cindyzyx9@gmail.com> * [serve] Remove print statement + fix lint (ray-project#35439) Signed-off-by: Cindy Zhang <cindyzyx9@gmail.com> --------- Signed-off-by: Cindy Zhang <cindyzyx9@gmail.com>

…n notebooks (ray-project#34815)" (ray-project#35500)

…ect#35515) Missing import. Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>

… (ray-project#35488) - Organize HuggingFace integrations together. - Additional HuggingFace integrations should have a logical place to be added. - Make it simple for users to import and use integrations. - Imports should not be excessively long - Naming should be intuitive Signed-off-by: Matthew Deng <matt@anyscale.com>

…roject#35399) (ray-project#35464) - Support handle.options(multiplexed_model_id="") - Http proxy to extract model id on the fly - Choose correct replica based on information. - nit: Move handle metrics pusher to router.

) ## Why are these changes needed?  After ray-project#35123 the request in ray is serialized and we should be able to retry the failed redis request in ray. This PR refactor the RedisContext a little bit by remove the CallbackItem and introduce the RequestContext. In side the RequestContext, the failed request will be retried automatically. If still failed in the end, it'll just crash. ## Related issue number ray-project#34014

…ay-project#34726) (ray-project#35542) * [core] Graceful handling of returning bundles when node is removed (ray-project#34726) When a node is dead, we did a series of clean up for the node. There was failure like this where a node was already removed from the internal data structures (i.e. ClusterResourceManager.nodes_) when trying to clean up bundles from the node. * lint Signed-off-by: rickyyx <rickyx@anyscale.com> --------- Signed-off-by: rickyyx <rickyx@anyscale.com>

…ay-project#35525) From ray-project#35143, we found a map() case that is not covered in our numpy support test cases.

…5414) (ray-project#35483) * [Data/Train] Fix ipython representation (ray-project#35414) Fixes a bug where repr would fail on ipython shell --------- Signed-off-by: amogkam <amogkamsetty@yahoo.com> * format Signed-off-by: amogkam <amogkamsetty@yahoo.com> --------- Signed-off-by: amogkam <amogkamsetty@yahoo.com>

…ject#35552) Looking at test_torch_predictor, it is a suite of 35 test cases, and runs roughly a minute of time. Most of the time these tests do finish fine actually. That's why it's just flaky. Make this a medium test to de-flake it. Signed-off-by: Jun Gong <jungong@anyscale.com> Co-authored-by: Jun Gong <jungong@anyscale.com>

…35534) (ray-project#35554) test_multiprocessing_client_mode is very flaky on Windows. This PR skips it on Windows. Related issue number Closes ray-project#35526 Signed-off-by: Archit Kulkarni <architkulkarni@users.noreply.github.com>

…5477) (ray-project#35563) Signed-off-by: Jun Gong <jungong@anyscale.com>

@scv119

…zation. (ray-project#35494) (ray-project#35532) When an async actor is used, we always increase the max recursion limit before we post the function to the event loop because when we have lots of pending async tasks, Python thinks there's a recursion due to a large parallel callstacks (due to fiber which is used to implement async actor). When running an async task, we have 3 steps. run a deserialize function in the event loop, ray/python/ray/_raylet.pyx Line 866 in bfec451 args = core_worker.run_async_func_in_event_loop( increase recursion limit, ray/python/ray/_raylet.pyx Line 831 in bfec451 increase_recursion_limit() run a main function in the event loop The problem here is that you increase the limit always "after" deserializing the object from the event loop. This means when the deserialization happens, the recursion limit is still low, and this can cause the exception. When we have a lots of async tasks that has higher overhead of input deserialization, this can happen (because before we increase the recursion limit, we hit the max recursion error when deserializing the object), which is exactly what 1:1 async-actor calls with args async test does where the microbenchmark failed with the recursion error. This PR fixes the issue by moving increase_recursion_limit inside the run_async_func_in_event_loop, so whenever we post a new async task, the recursion limit is always checked and increased. I found the same issue when I developed a generator, and I verified this fixes the issue in this PR ray-project#35425 (comment). I am not sure how to test this. @scv119 do you have the consistent repro that can run in unit tests? Or can you verify it using this branch as well?

…s to map functions"" (ray-project#35505) (ray-project#35527) Reverts ray-project#35504 --------- Signed-off-by: amogkam <amogkamsetty@yahoo.com>

…y-project#35520) (ray-project#35570) This change adds a dropdown explaining why Ray Serve is a good fit for LLM developers. Link: https://anyscale-ray--35520.com.readthedocs.build/en/35520/serve/index.html#how-can-serve-help-me-as-a Signed-off-by: Shreyas Krishnaswamy <shrekris@anyscale.com>

…ay-project#35345) (ray-project#35564) Signed-off-by: Avnish <avnishnarayan@gmail.com>

We should build jar for release on manylinux2014 to resolve incompatibilities. Refer to https://discuss.ray.io/t/java-glibc-issue-during-ray-init-call/10407. Co-authored-by: Guyang Song <guyang.sgy@gmail.com> Co-authored-by: Candy Lv <90018431+XiaodongLv@users.noreply.github.com>

Signed-off-by: Artur Niederfahrenhorst <attaismyname@googlemail.com>

…to be fetched in release tests and and CI learning tests (ray-project#35588) * [RLlib] Fit ES and ARS results dict to rest of RLlib, enable results to be fetched in release tests and and CI learning tests (ray-project#35533) Signed-off-by: Artur Niederfahrenhorst <attaismyname@googlemail.com> * [RLlib] Fix ARS release test (ray-project#35608) Signed-off-by: Artur Niederfahrenhorst <attaismyname@googlemail.com> --------- Signed-off-by: Artur Niederfahrenhorst <attaismyname@googlemail.com>

…ay-project#35612) <img width="1135" alt="image" src="https://github.com/ray-project/ray/assets/6515354/1c9e2cb4-d911-493b-b764-639299da1554"> Not sure what changes (maybe because of some deps changes), straightforward fix.

…ject#35637) * [Data] Add batch inference object detection example (ray-project#35143) Signed-off-by: Hao Chen <chenh1024@gmail.com> * Fix object detection example test Signed-off-by: Hao Chen <chenh1024@gmail.com> --------- Signed-off-by: Hao Chen <chenh1024@gmail.com>

…ject#35291) (ray-project#35656) This PR introduces TaskManager interfaces to enable streaming generator.

…r interface. (ray-project#35324) (ray-project#35682) This is the second PR to support streaming generator. The detailed design and API proposal can be found from https://docs.google.com/document/d/1hAASLe2sCoay23raqxqwJdSDiJWNMcNhlTwWJXsJOU4/edit#heading=h.w91y1fgnpu0m. The Execution plan can be found from https://docs.google.com/document/d/1hAASLe2sCoay23raqxqwJdSDiJWNMcNhlTwWJXsJOU4/edit#heading=h.kxktymq5ihf7. There will be 4 PRs to enable streaming generator for Ray Serve (phase 1). This PR -> introduce cpp interfaces to handle intermediate task return [1/N] Streaming Generator. Cpp interfaces and implementation ray-project#35291 Support core worker APIs + cython generator interface. [2/N] Streaming Generator. Support core worker APIs + cython generator interface. ray-project#35324 < --- this PR E2e integration [3/N] Streaming Generator. E2e integration ray-project#35325 (review) Support async actors This PR implements the Cython generator interface that users can use to obtain a next available object reference. --------- Signed-off-by: SangBin Cho <rkooo567@gmail.com>

…ray-project#35673) ## Why are these changes needed? In the old resource broadcasting, it uses seq and when the seq got delayed, it'll return immediately and this is the place where leak could happen. Don't reply gRPC will in the end lead to a leak of resource. In ray syncer, we don't have this any more, but if in the wrong setup, a bad GCS might talk this this raylet since we don't have any guards right now and the bad GCS might send node info to this node. In this way, the leak will be triggered. This fix does two things to protect the code: - If it's syncer based, it'll just reject the request. - Also fixed the bug in the old code path. ## Related issue number ray-project#35632 ray-project#35310

…size. (ray-project#35616) (ray-project#35693) Signed-off-by: Jun Gong <jungong@anyscale.com>

ray-project#35683) (ray-project#35718) In order to guarantee that we put ActorTaskSpecTable before ActorTable, we should put ActorTable inside the ActorTaskSpecTable put callback. Otherwise, Redis may receive ActorTable put before ActorTaskSpecTable put. If we crash in the middle, we may end up with actor data inside ActorTable but not ActorTaskSpecTable. Signed-off-by: Jiajun Yao <jeromeyjj@gmail.com>

…oject#35638) (ray-project#35703) Closes ray-project#35586 See ray-project#35586 (comment) Numpy treats variable length byte data as zero-terminated bytes. So if there are zero bytes encoded into the bytestring itself, those will be discarded. Instead, per recommendation in apache/arrow#26470, it seems that variable length bytes should be treated as python objects. --------- Signed-off-by: amogkam <amogkamsetty@yahoo.com>

implement julia function descriptor

…ptor-fix Support JuliaFunctionDescriptor in `==` fallback

Enable debug-level logging on the Ray backend

Signed-off-by: Dave Kleinschmidt <dave.f.kleinschmidt@gmail.com>

Spawn a Julia worker via the Raylet

use Ray.jl for worker runtime

* Support specifying runtime env executable * Support specifying runtime env args * Support specifying executable/args via runtime env * Avoid quoting default * Switch to using command in RuntimeEnvContext * Use separate command for Julia * Add TODO about switching to plugin Co-authored-by: Dave Kleinschmidt <dave.f.kleinschmidt@gmail.com> Signed-off-by: Curtis Vogt <curtis.vogt@gmail.com> --------- Signed-off-by: Curtis Vogt <curtis.vogt@gmail.com> Co-authored-by: Dave Kleinschmidt <dave.f.kleinschmidt@gmail.com>

ArturNiederfahrenhorst and others added 30 commits May 16, 2023 18:14

[release/2.5.0] update version number

11be28a

Signed-off-by: Artur Niederfahrenhorst <attaismyname@googlemail.com>

[CherryPick] Revert "Add a disconnect button to the context widgets i…

5b78ffe

…n notebooks (ray-project#34815)" (ray-project#35500)

[serve] Fix app_builder doc code test (ray-project#35456) (ray-proj…

e0a0635

…ect#35515) Missing import. Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>

[data] Fix ragged tensor conversion with map() (ray-project#35419) (r…

a2c684c

…ay-project#35525) From ray-project#35143, we found a map() case that is not covered in our numpy support test cases.

[Doc] Pin deepspeed version to fix fine_tuning example (ray-project#3…

2812d10

…5477) (ray-project#35563) Signed-off-by: Jun Gong <jungong@anyscale.com>

Revert "Revert "[Data] Add num_cpus and num_gpus as top-level arg…

1ac29e8

…s to map functions"" (ray-project#35505) (ray-project#35527) Reverts ray-project#35504 --------- Signed-off-by: amogkam <amogkamsetty@yahoo.com>

[RLlib][RLlib contrib] add soft deprecation notices to maml and a3c (r…

1bc1ecc

…ay-project#35345) (ray-project#35564) Signed-off-by: Avnish <avnishnarayan@gmail.com>

[Doc] Remove link (ray-project#35575) (ray-project#35620)

23ac335

Signed-off-by: Artur Niederfahrenhorst <attaismyname@googlemail.com>

[1/N] Streaming Generator. Cpp interfaces and implementation (ray-pro…

5c5fdfd

…ject#35291) (ray-project#35656) This PR introduces TaskManager interfaces to enable streaming generator.

[AIR, Doc] Separate computer_vision.py into its own test of enormous …

f635c47

…size. (ray-project#35616) (ray-project#35693) Signed-off-by: Jun Gong <jungong@anyscale.com>

kleinschmidt and others added 29 commits August 2, 2023 16:52

make it compile

b1fd36e

Merge pull request #1 from beacon-biosignals/dfk/juliafunctiondescriptor

f9f5ff5

implement julia function descriptor

Support JuliaFunctionDescriptor in == fallback

1836913

Merge pull request #3 from beacon-biosignals/cv/julia-function-descri…

b247c39

…ptor-fix Support JuliaFunctionDescriptor in `==` fallback

Enable debug-level logging on the Ray backend

c27d4ee

Merge pull request #5 from beacon-biosignals/cv/enable-debug-logging

a679169

Enable debug-level logging on the Ray backend

Add Raylet support for Julia

97790c2

Return Julia language string

209a279

EDIT: modify Java worker_pool tests to run Julia

12d677a

specify julia executable

1ed4207

fix indent, add log_dir arg missing

b791dd0

also includt log_dir when calling

7d2d964

WIP

78ea957

Functional worker

64c55d7

Use JULIA_PROJECT

1dea34d

Refactor RuntimeEnvContext changes

d86a5f0

Funtional passing in JULIA_PROJECT

6ce8308

Cleanup

e3fd658

Rollback non-required changes

02a77d9

Use exported start_worker

d1c502f

Apply suggestions from code review

3d9625d

Signed-off-by: Dave Kleinschmidt <dave.f.kleinschmidt@gmail.com>

Merge pull request #4 from beacon-biosignals/cv/julia-task

c28a99b

Spawn a Julia worker via the Raylet

use Ray.jl as entrypoint to worker runtime

66e053c

Merge pull request #6 from beacon-biosignals/dfk/using-Ray

7ad1f47

use Ray.jl for worker runtime

pass --startup_token to julia worker

1c0cddc

Merge pull request #7 from beacon-biosignals/dfk/startup_token

4ceb62d

Drop DEFAULT_WORKER_EXECUTABLE from build_julia_worker_command (#9)

0b1e739

Avoid escaping command_prefix (#10)

448a83c

glennmoy closed this Oct 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove dev comments + reset logging levels #13

Remove dev comments + reset logging levels #13

glennmoy commented Oct 3, 2023

Remove dev comments + reset logging levels #13

Remove dev comments + reset logging levels #13

Conversation

glennmoy commented Oct 3, 2023