Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[docs] simple web crawler example #31900

Merged
merged 14 commits into from
Jan 28, 2023
Merged

[docs] simple web crawler example #31900

merged 14 commits into from
Jan 28, 2023

Conversation

maxpumperla
Copy link
Contributor

We want to continue building more, and more easily accessible examples on our docs. This PR starts that initiative by also setting up the testing harness, so that we can just add more notebooks. No particular reason to start with a crawler, just turned out to be a quick one. We'll work on more relevant examples, like @richardliaw 's AIR example wish list in as a follow-up.

Signed-off-by: Max Pumperla <max.pumperla@googlemail.com>
Signed-off-by: Max Pumperla <max.pumperla@googlemail.com>
Signed-off-by: Max Pumperla <max.pumperla@googlemail.com>
@maxpumperla maxpumperla requested a review from a team as a code owner January 24, 2023 19:15
"In this example we'll quickly demonstrate how to build a simple web scraper in Python and\n",
"parallelize it with Ray Tasks with minimal code changes.\n",
"\n",
"To run this example locally on your machine, please first install `ray` and `beautifulsoup` with\n",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

let's also specify the version you tested against?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

Comment on lines 99 to 108
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"CPU times: user 19.3 s, sys: 340 ms, total: 19.7 s\n",
"Wall time: 25.8 s\n"
]
},
{
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

somehow when this renders, it looks like code, so it'd be nice if we can differentiate somehow...

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yes, good point. I made the output bold for now. We can iterate on that.

Signed-off-by: Max Pumperla <max.pumperla@googlemail.com>
Signed-off-by: Max Pumperla <max.pumperla@googlemail.com>
Signed-off-by: Max Pumperla <max.pumperla@googlemail.com>
Signed-off-by: Max Pumperla <max.pumperla@googlemail.com>
Signed-off-by: Max Pumperla <max.pumperla@googlemail.com>
Signed-off-by: Max Pumperla <max.pumperla@googlemail.com>
Signed-off-by: Max Pumperla <max.pumperla@googlemail.com>
Copy link
Contributor

@pcmoritz pcmoritz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice, this looks great!

The ecosystem gallery looks a little bit misaligned (e.g. it would probably be better if all the github stars were at the bottom and the logos be a little more aligned) :)

Also for the use case gallery, the logos are sometimes a little bit too close to the top.

The content of the new example looks great!

Signed-off-by: Max Pumperla <max.pumperla@googlemail.com>
Signed-off-by: Max Pumperla <max.pumperla@googlemail.com>
Signed-off-by: Max Pumperla <max.pumperla@googlemail.com>
@richardliaw
Copy link
Contributor

tests not passing?

Signed-off-by: Max Pumperla <max.pumperla@googlemail.com>
@maxpumperla
Copy link
Contributor Author

@richardliaw should be good now, let's see.

@richardliaw richardliaw merged commit 80d13d1 into master Jan 28, 2023
@richardliaw richardliaw deleted the mp_crawler branch January 28, 2023 17:34
edoakes pushed a commit to edoakes/ray that referenced this pull request Mar 22, 2023
Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants