Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[RLlib; docs] RLlib documentation do-over (new API stack): Main index page. #48285

Merged
merged 20 commits into from
Oct 29, 2024

Conversation

sven1977
Copy link
Contributor

@sven1977 sven1977 commented Oct 27, 2024

Update, refactor, fix the main RLlib index.html page (for the new API stack).

  • Fully geared towards new API stack.
  • Simplified (only mention a few high-value features).
  • Better overview tables within the tabs for algos, environments, and features.
  • Redo RLlib overview diagram at bottom of page (also simplified).

Why are these changes needed?

Related issue number

Checks

  • I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
  • I've run scripts/format.sh to lint the changes in this PR.
  • I've included any doc changes needed for https://docs.ray.io/en/master/.
    • I've added any new APIs to the API Reference. For example, if I added a
      method in Tune, I've added it in doc/source/tune/api/ under the
      corresponding .rst file.
  • I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
  • Testing Strategy
    • Unit tests
    • Release tests
    • This PR is not tested :(

Signed-off-by: sven1977 <svenmika1977@gmail.com>
Signed-off-by: sven1977 <svenmika1977@gmail.com>
Signed-off-by: sven1977 <svenmika1977@gmail.com>
Signed-off-by: sven1977 <svenmika1977@gmail.com>
Signed-off-by: sven1977 <svenmika1977@gmail.com>
Copy link
Collaborator

@simonsays1980 simonsays1980 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

@@ -147,7 +150,7 @@
)
)

# Create the env to do inference in.
# Create a env to do inference in.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should bring in here loading the pipelines from checkpoint, too and using them.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We have another example, where we do that (the LSTM one, which requires the connector pipeline for a more sophisticated inference loop w/ state in/outs).

@peytondmurray peytondmurray removed their assignment Oct 27, 2024
Signed-off-by: sven1977 <svenmika1977@gmail.com>
Signed-off-by: sven1977 <svenmika1977@gmail.com>
Signed-off-by: sven1977 <svenmika1977@gmail.com>
Signed-off-by: sven1977 <svenmika1977@gmail.com>
@sven1977 sven1977 added tests-ok The tagger certifies test failures are unrelated and assumes personal liability. rllib RLlib related issues docs An issue or change related to documentation rllib-docs-or-examples Issues related to RLlib documentation or rllib/examples rllib-newstack rllib-oldstack-cleanup Issues related to cleaning up classes, utilities on the old API stack labels Oct 28, 2024
Signed-off-by: Sven Mika <sven@anyscale.io>
Copy link
Contributor

@angelinalg angelinalg left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some style nits. Please correct if the rewrites are inaccurate, esp the ones changing passive voice to active voice.

@@ -147,7 +150,7 @@
)
)

# Create the env to do inference in.
# Create a env to do inference in.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
# Create a env to do inference in.
# Create an env to do inference in.

RLlib is used in production by industry leaders in many different verticals, such as
`gaming <https://www.anyscale.com/events/2021/06/22/using-reinforcement-learning-to-optimize-iap-offer-recommendations-in-mobile-games>`_,
`robotics <https://www.anyscale.com/events/2021/06/23/introducing-amazon-sagemaker-kubeflow-reinforcement-learning-pipelines-for>`_,
`finance <https://www.anyscale.com/events/2021/06/22/a-24x-speedup-for-reinforcement-learning-with-rllib-+-ray>`_,
`climate control <https://www.anyscale.com/events/2021/06/23/applying-ray-and-rllib-to-real-life-industrial-use-cases>`_,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

climate control and industrial control links point to the same link. Is that intentional?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

fixed by merging them ...

<div class="termynal" data-termynal>
<span data-ty="input">pip install "ray[rllib]" tensorflow torch</span>
</div>
For installation on computers running Apple Silicon (such as M1),
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
For installation on computers running Apple Silicon (such as M1),
For installation on computers running Apple Silicon such as M1,

<span data-ty="input">pip install "ray[rllib]" tensorflow torch</span>
</div>
For installation on computers running Apple Silicon (such as M1),
`follow instructions here. <https://docs.ray.io/en/latest/ray-overview/installation.html#m1-mac-apple-silicon-support>`_
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
`follow instructions here. <https://docs.ray.io/en/latest/ray-overview/installation.html#m1-mac-apple-silicon-support>`_
see `M1 Mac Support. <https://docs.ray.io/en/latest/ray-overview/installation.html#m1-mac-apple-silicon-support>`_

`here. <https://docs.ray.io/en/latest/ray-overview/installation.html#m1-mac-apple-silicon-support>`_
To be able to run our Atari examples, you should also install
`pip install "gym[atari]" "gym[accept-rom-license]" atari_py`.
To be able to run our Atari or MuJoCo examples, you also need to run:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
To be able to run our Atari or MuJoCo examples, you also need to run:
To run the Atari or MuJoCo examples, you also need to run:

doc/source/rllib/rllib-examples.rst Outdated Show resolved Hide resolved
doc/source/rllib/rllib-examples.rst Outdated Show resolved Hide resolved
doc/source/rllib/rllib-examples.rst Outdated Show resolved Hide resolved
doc/source/rllib/rllib-examples.rst Outdated Show resolved Hide resolved
doc/source/rllib/rllib-examples.rst Outdated Show resolved Hide resolved
sven1977 and others added 3 commits October 29, 2024 10:09
Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com>
Signed-off-by: Sven Mika <sven@anyscale.io>
Signed-off-by: sven1977 <svenmika1977@gmail.com>
Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com>
Signed-off-by: Sven Mika <sven@anyscale.io>
…' into documentation_do_over_index_page

# Conflicts:
#	doc/source/rllib/rllib-algorithms.rst
…' into documentation_do_over_index_page

# Conflicts:
#	doc/source/rllib/rllib-algorithms.rst
sven1977 and others added 4 commits October 29, 2024 12:12
Signed-off-by: Sven Mika <sven@anyscale.io>
Signed-off-by: sven1977 <svenmika1977@gmail.com>
Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com>
Signed-off-by: Sven Mika <sven@anyscale.io>
Copy link
Contributor

@angelinalg angelinalg left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for addressing the comments. I must've missed the "off-policy'ness" in the first review. It'll be good to fix that if you can.

doc/source/rllib/rllib-algorithms.rst Outdated Show resolved Hide resolved
doc/source/rllib/rllib-algorithms.rst Outdated Show resolved Hide resolved
@@ -217,13 +217,13 @@ Asynchronous Proximal Policy Optimization (APPO)
**APPO architecture:** APPO is an asynchronous variant of :ref:`Proximal Policy Optimization (PPO) <ppo>` based on the IMPALA architecture,
but using a surrogate policy loss with clipping, allowing for multiple SGD passes per collected train batch.
In a training iteration, APPO requests samples from all EnvRunners asynchronously and the collected episode
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"RLlib" was my guess and the point was just to clarify who's doing the returning, if needed. If it's obvious to the reader, just ignore my suggestion.

@sven1977 sven1977 enabled auto-merge (squash) October 29, 2024 19:48
@github-actions github-actions bot added the go add ONLY when ready to merge, run all tests label Oct 29, 2024
Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com>
Signed-off-by: Sven Mika <sven@anyscale.io>
@sven1977 sven1977 enabled auto-merge (squash) October 29, 2024 20:18
@sven1977 sven1977 merged commit 7f52a4e into ray-project:master Oct 29, 2024
5 of 6 checks passed
@sven1977 sven1977 deleted the documentation_do_over_index_page branch October 30, 2024 14:21
Jay-ju pushed a commit to Jay-ju/ray that referenced this pull request Nov 5, 2024
JP-sDEV pushed a commit to JP-sDEV/ray that referenced this pull request Nov 14, 2024
mohitjain2504 pushed a commit to mohitjain2504/ray that referenced this pull request Nov 15, 2024
… page. (ray-project#48285)

Signed-off-by: mohitjain2504 <mohit.jain@dream11.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
docs An issue or change related to documentation go add ONLY when ready to merge, run all tests rllib RLlib related issues rllib-docs-or-examples Issues related to RLlib documentation or rllib/examples rllib-newstack rllib-oldstack-cleanup Issues related to cleaning up classes, utilities on the old API stack tests-ok The tagger certifies test failures are unrelated and assumes personal liability.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants