[RLlib] Evaluation logic needs clean-up #44595

simonsays1980 · 2024-04-09T16:37:33Z

What happened + What you expected to happen

What happened

I tested evaluation of algorithms and noted to things:

Algorithm.evaluate() does not count agent steps and environment steps, if a cusotm evaluation function is used.
The evaluaion worker set is not None even if the evaluation_num_workers=0.

What you expected to happen

That metrics are collected identically when a custom evaluation funciton is used and that the evaluation_num_workers=0 lets us run the evaluation on the local worker in Algorothm.workers.

Versions / Dependencies

Ray nightly
Python 3.9.12
Fedora Linux 39

Reproduction script

Run our custom evaluation example.

Issue Severity

Medium: It is a significant difficulty but I can work around it.

The text was updated successfully, but these errors were encountered:

sven1977 · 2024-04-26T11:42:51Z

On the second point:

The evaluaion worker set is not None even if the evaluation_num_workers=0.

This is the expected behavior, as long as evaluation_interval is != 0.

from ray.rllib.algorithms.ppo import PPOConfig

config = (
    PPOConfig()
    .environment("CartPole-v1")
    .evaluation(
        evaluation_num_env_runners=0,
    ),
)

algo = config.build()

print(algo.evaluation_workers is None)

Should print True, however, if I change evaluation_interval to 1 (default is 0), then RLlib will create a eval WorkerSet with num_workers=0 (only 1 local worker, no remote workers).

simonsays1980 · 2024-04-27T11:53:22Z

@sven1977 Are agent and env steps now counted in a custom evaluation function with the new Metrics logger?

simonsays1980 added bug Something that is supposed to be working; but isn't triage Needs triage (eg: priority, bug/not-bug, and owning component) labels Apr 9, 2024

anyscalesam added the rllib RLlib related issues label Apr 15, 2024

simonsays1980 added P1 Issue that should be fixed within a few weeks enhancement Request for new feature and/or capability and removed triage Needs triage (eg: priority, bug/not-bug, and owning component) enhancement Request for new feature and/or capability labels Apr 27, 2024

simonsays1980 self-assigned this Apr 27, 2024

simonsays1980 mentioned this issue May 31, 2024

[RLlib] - Add env and agent steps in custom evaluation function for comformity with metrics logger. #45652

Merged

8 tasks

sven1977 closed this as completed in #45652 Jun 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Evaluation logic needs clean-up #44595

[RLlib] Evaluation logic needs clean-up #44595

simonsays1980 commented Apr 9, 2024

sven1977 commented Apr 26, 2024

simonsays1980 commented Apr 27, 2024

[RLlib] Evaluation logic needs clean-up #44595

[RLlib] Evaluation logic needs clean-up #44595

Comments

simonsays1980 commented Apr 9, 2024

What happened + What you expected to happen

What happened

What you expected to happen

Versions / Dependencies

Reproduction script

Issue Severity

sven1977 commented Apr 26, 2024

simonsays1980 commented Apr 27, 2024