Move code samples to master branch #3271

andy-stark-redis · 2024-06-07T10:33:01Z

Pull Request check-list

Please make sure to review and check all of these items:

Do tests and lints pass with this change?
Do the CI tests pass with this change (enable it first in your forked repo and wait for the github action build to finish)?
Is the new or changed code fully tested?
Was the change added to CHANGES file?

NOTE: these things are not required to open a PR and can be done
afterwards / while the PR is open.

Description of change

The code samples for the main documentation pages are kept in this repo but they were previously in a separate branch (emb-examples). The plan is to move them to the master branch and run tests on them so they get checked frequently.

akx · 2024-06-12T06:18:21Z

doctests/search_vss.py

+URL = ("https://raw.githubusercontent.com/bsbodden/redis_vss_getting_started"
+       "/main/data/bikes.json"
+       )
+response = requests.get(URL, timeout=10)


Maybe this data should be included here instead of being pulled from another repo that can change at any moment. That'd drop the requests dependency.

@akx Thanks for the review - very much appreciated! I did wonder myself if it would be better to add the JSON to the code. However, I think the tutorial gets the data from a URL to illustrate the kind of task the user might want to do in real life. Also, the file on the Github repo was deliberately created for this tutorial, so it shouldn't change unless we want it to. I'd say we should leave it like this for now but I'll find out if we've got any feedback about this page and see if the users have any problems with the HTTP request.

akx · 2024-06-12T06:18:57Z

doctests/search_vss.py

+    # Optional: convert the table to Markdown using Pandas
+    queries_table = pd.DataFrame(results_list)
+    queries_table.sort_values(
+        by=["query", "score"], ascending=[True, False], inplace=True
+    )
+    queries_table["query"] = queries_table.groupby("query")["query"].transform(
+        lambda x: [x.iloc[0]] + [""] * (len(x) - 1)
+    )
+    queries_table["description"] = queries_table["description"].apply(
+        lambda x: (x[:497] + "...") if len(x) > 500 else x
+    )
+    queries_table.to_markdown(index=False)


Given that this is "optional" and nothing is actually even done with the markdown return value, this should probably be removed so pandas and tabulate aren't required to run the tests.

@akx Again, I think it's trying to show the user a real world example. The code sample doesn't use the Markdown explicitly but the output Markdown table is actually shown in the tutorial page. I think the doc text on the page could probably use some improvement but that's a separate PR on the docs repo.

to_markdown() returns the table as a string, so this would just do a lot of work to dataframe-ify things, group them, transform them... and then throw the result away. I really don't think it's a great idea to spend CI CPU cycles (and contribute to our planet burning in a minor way) for that.

I assume these are somehow extracted from something like a notebook session, where the value of the last expression might actually be shown?

OK, I've added a return value to the function and then the main code prints that out when the function is called. The doc page shows the Markdown table as rendered, but it looks like it only shows the first query currently (this is an issue with the doc page, not the code sample - I'll fix that separately).

I'm sorry, I'm not following. Why bother with that either? Is there something asserting on the output?

If there is, if the idea of these being here (and e.g. being run as part of CI?) is to test that "typical" Redis-utilizing code works, then it would break if pandas or tabulate decide to change their (human-oriented) output.

If there isn't, I don't see a point in doing the work to human-format data that no one will care about.

(On that note, running sentence-transformers in CI also sounds like a bad idea, since it will have to download the transformers models, then run embedding computation on CPU, ...)

I think you might be misunderstanding what this code is for :-) The code samples in this folder are included in the docs (eg, https://redis.io/docs/latest/develop/get-started/vector-database/). Their purpose is to explain to the user how to do stuff with Redis and to supply bits of code for copy/paste. The reason for putting them in the client repo as tests is to check that the sample code works against the current version of the client, not to act as real world example tests for the client.

@akx , these tests were living in a separate branch until now, this is a first step to put them as close as possible to the code they are documenting. Right now there is no CI in this repo for them, I guess we can add something in other PRs. The purpose so far is educational, and we want to decrease maintenance effort. I'll merge this as it is.

Move the code samples used for documentation into the master branch, so they sit next to the code they document.

andy-stark-redis and others added 7 commits May 16, 2024 11:10

DOC-3801 initial move of samples to main branch

2adafc2

Merge branch 'redis:master' into master

9ccd6af

DOC-3801 add lint suggestions

ae3779e

Merge branch 'master' of github.com:andy-stark-redis/redis-py

fe63b5d

Merge branch 'redis:master' into master

a15172b

DOC-3801 fixed bugs in list and vector db examples

9161bf5

Merge branch 'master' of github.com:andy-stark-redis/redis-py

1d10b72

andy-stark-redis self-assigned this Jun 7, 2024

Merge branch 'master' into master

843c8d5

andy-stark-redis requested a review from gerzse June 7, 2024 11:05

andy-stark-redis marked this pull request as ready for review June 7, 2024 11:06

andy-stark-redis requested a review from dmaier-redislabs as a code owner June 7, 2024 11:06

andy-stark-redis added 2 commits June 7, 2024 12:50

DOC-3801 updated CHANGES file

ac20f42

Merge branch 'master' of github.com:andy-stark-redis/redis-py

1e903ed

akx reviewed Jun 12, 2024

View reviewed changes

andy-stark-redis and others added 4 commits June 12, 2024 13:46

DOC-3801 add and print explicit return values for tables

3fe0c32

Merge branch 'master' into master

393e96e

Merge branch 'master' into master

cd86b7c

Merge branch 'master' into master

dc490af

gerzse approved these changes Jun 13, 2024

View reviewed changes

gerzse merged commit 733f800 into redis:master Jun 13, 2024
46 checks passed

gerzse added the maintenance Maintenance (CI, Releases, etc) label Jun 14, 2024

gerzse changed the title ~~DOC-3801 move code samples to master branch~~ Move code samples to master branch Jun 20, 2024

agnesnatasya pushed a commit to agnesnatasya/redis-py that referenced this pull request Jul 20, 2024

Move code samples to master branch (redis#3271)

b4d355f

Move the code samples used for documentation into the master branch, so they sit next to the code they document.

vladvildanov pushed a commit that referenced this pull request Sep 27, 2024

Move code samples to master branch (#3271)

7e3ef3c

Move the code samples used for documentation into the master branch, so they sit next to the code they document.

vladvildanov pushed a commit that referenced this pull request Sep 27, 2024

Move code samples to master branch (#3271)

4ba1b56

Move the code samples used for documentation into the master branch, so they sit next to the code they document.

vladvildanov pushed a commit that referenced this pull request Sep 27, 2024

Move code samples to master branch (#3271)

4935e99

Move the code samples used for documentation into the master branch, so they sit next to the code they document.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move code samples to master branch #3271

Move code samples to master branch #3271

andy-stark-redis commented Jun 7, 2024 •

edited

Loading

akx Jun 12, 2024

andy-stark-redis Jun 12, 2024

akx Jun 12, 2024

andy-stark-redis Jun 12, 2024

akx Jun 12, 2024

andy-stark-redis Jun 12, 2024

akx Jun 12, 2024

andy-stark-redis Jun 12, 2024

gerzse Jun 13, 2024

Move code samples to master branch #3271

Move code samples to master branch #3271

Conversation

andy-stark-redis commented Jun 7, 2024 • edited Loading

Pull Request check-list

Description of change

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

andy-stark-redis commented Jun 7, 2024 •

edited

Loading