Add embeddings #11

dabdine · 2022-11-17T20:38:03Z

Adds embeddings. Tested it locally with a client I have to ensure it works.

pullrequest

✅ This pull request was sent to the PullRequest network.

@dabdine you can click here to see the review status or cancel the code review job.

pullrequest

PullRequest Breakdown

Reviewable lines of change

+ 111
- 0

79% Go
21% Go (tests)

Type of change

Feature - These changes are adding a new feature or improvement to existing code.

pullrequest

This update looks good and complete. Have you considered writing any unit tests? It's good that you're using interfaces for dependencies to make it easier to mock things out for tests.

Andy W

Was this helpful? Yes | No

Reviewers will be notified any time you reply to their comments or commit new changes.
You can also request a full follow-up review from your PullRequest dashboard.

pullrequest · 2022-11-18T02:41:16Z

gpt3.go

+		return nil, err
+	}
+
+	output := new(EmbeddingResponse)


I usually just do output := EmbeddingRequest{} and then pass &output to the encoder but this works too

🔹 Giving Information (Nice to have)

Andy W

Yeah that's fair. When I write for public repos I try to copy the methodology used elsewhere in the same repository (this was almost identical to another method--though that also can just be refactored into a shared request method) so it follows a predictable tenor. I don't mind adjusting this though. It's typically how I would write it as well.

Yeah, I mean, both ways initialize things to their zero values but EmbeddingRequest{} makes it easier to later substitute in values to override those zero values.

Andy W

Should be all set for both tests and reference passing.

Note: I opted to add tests using the existing methodology already in the repository. However, these tests are mostly duplicative, as they're testing the same underlying code over and over. My thought is that in a separate PR, someone can refactor the identical code blocks issuing the HTTP request for the majority of the calls, then test those methods, which would eliminate the per-API tests that don't test anything other than that common business logic.

@tylermann We may want to consider adding this to the issues for this repo so it can be addressed in the future if its a real concern.

dabdine · 2022-11-18T03:01:58Z

This update looks good and complete. Have you considered writing any unit tests? It's good that you're using interfaces for dependencies to make it easier to mock things out for tests.

Andy W

Was this helpful? Yes | No

Reviewers will be notified any time you reply to their comments or commit new changes.
You can also request a full follow-up review from your PullRequest dashboard.

Yep, I'll add some tests.

pullrequest · 2022-11-19T16:02:05Z

gpt3_test.go

+			func() (interface{}, error) {
+				return client.Embeddings(ctx, gpt3.EmbeddingsRequest{})
+			},
+			"Post \"https://api.openai.com/v1/embeddings\": request error",


So does this test actually hit this URL? I typically try to isolate "unit" tests and fake out external things that might change, break or be down causing flaky tests.

🔸 Improve Test Coverage (Important)

Andy W

It does not. On line 34 in gpt3_test.go, TestRequestCreationFails sets a fake transport on the http client passed to the gpt3 client. The fake transport is a FakeRoundTripper (already defined in this code base), which just mocks a transport layer, so no network sockets are opened.

However, these test cases are not testing anything unique. You can see all the same cases are just testing an error response. The error response is exactly the same except for the URL in the error message. This implies the entire set of test cases can be simplified to a single test case that tests the constructed error message meets an expected string.

This is why I claimed in a separate comment that the test cases would benefit from a refactor. In the end, the only thing different for each API call (outside of the streaming completion request) is the struct type passed to functions that are doing the same thing. ~~There's no reason to test the golang JSON unmarshaler or HTTP client over and over~~ (edit: the specific thing you'd want to test is that unmarshaling/marshaling to the struct type works -- that struct tags are set properly). Since all the API methods use the same or copy pasted code (except for the streaming completion endpoint), the code could be refactored, we could test the refactored/reused API call method once and have the same or better confidence that the test cases cover what we need.

The main reason why I did not pick up refactoring the code base and test cases is to keep this PR clean so it's easy to review.

That makes sense and it is tricky testing HTTP calls, especially because things like http.NewRequest returns an error but only in cases where url.Parse() returns an error so it can be tricky to force a test to cover that.
I missed where you use the FakeRoundTripper, that's a great way to set up http client tests.

Andy W

tylermann

Thanks for submitting this PR! The code here looks good to me and matches existing patterns. I left one comment just on a response field that I didn't see in the docs so wanted to double check it is intended. Otherwise this looks good to merge.

tylermann · 2022-11-22T16:34:04Z

models.go

+type EmbeddingsResponse struct {
+	Object string             `json:"object"`
+	Data   []EmbeddingsResult `json:"data"`
+	Model  string             `json:"model"`


Just to double check, is this field included in the response? I don't see this in the docs here: https://beta.openai.com/docs/api-reference/embeddings/create

No! Great catch. Removed Model and added Usage. I updated the test cases as well.

pullrequest

Hey @dabdine,
Was the feedback from PullRequest helpful? Yes | No

dabdine added 6 commits November 17, 2022 09:30

Add embeddings

81aa160

Just use strings

64e84aa

Use float64 for the model, not strings

b243c0b

Change index datatype

0be1c94

Actually return the response

1703ff6

Remove userID info as this is needed in the request body

bbc9f63

pullrequest bot reviewed Nov 17, 2022

View reviewed changes

pullrequest bot reviewed Nov 18, 2022

View reviewed changes

dabdine added 3 commits November 18, 2022 09:21

Improve documentation, add tests

33e9b4f

Improve documentation

71d278d

Typo on embeddings API test case name

6e33173

dabdine mentioned this pull request Nov 18, 2022

Comparison to https://github.com/sashabaranov/go-gpt3 #9

Open

pullrequest bot reviewed Nov 19, 2022

View reviewed changes

tylermann approved these changes Nov 22, 2022

View reviewed changes

Remove model; add usage

6a0d223

tylermann merged commit 6604991 into PullRequestInc:main Nov 22, 2022

pullrequest bot reviewed Nov 22, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add embeddings #11

Add embeddings #11

dabdine commented Nov 17, 2022

pullrequest bot left a comment

pullrequest bot left a comment •

edited

Loading

pullrequest bot left a comment

pullrequest bot Nov 18, 2022

dabdine Nov 18, 2022 •

edited

Loading

pullrequest bot Nov 18, 2022

dabdine Nov 18, 2022

macaugh Nov 18, 2022

dabdine commented Nov 18, 2022

pullrequest bot Nov 19, 2022

dabdine Nov 19, 2022 •

edited

Loading

pullrequest bot Nov 20, 2022

tylermann left a comment

tylermann Nov 22, 2022

dabdine Nov 22, 2022

pullrequest bot left a comment

Add embeddings #11

Add embeddings #11

Conversation

dabdine commented Nov 17, 2022

pullrequest bot left a comment

Choose a reason for hiding this comment

pullrequest bot left a comment • edited Loading

Choose a reason for hiding this comment

PullRequest Breakdown

pullrequest bot left a comment

Choose a reason for hiding this comment

pullrequest bot Nov 18, 2022

Choose a reason for hiding this comment

🔹 Giving Information (Nice to have)

dabdine Nov 18, 2022 • edited Loading

Choose a reason for hiding this comment

pullrequest bot Nov 18, 2022

Choose a reason for hiding this comment

dabdine Nov 18, 2022

Choose a reason for hiding this comment

macaugh Nov 18, 2022

Choose a reason for hiding this comment

dabdine commented Nov 18, 2022

pullrequest bot Nov 19, 2022

Choose a reason for hiding this comment

🔸 Improve Test Coverage (Important)

dabdine Nov 19, 2022 • edited Loading

Choose a reason for hiding this comment

pullrequest bot Nov 20, 2022

Choose a reason for hiding this comment

tylermann left a comment

Choose a reason for hiding this comment

tylermann Nov 22, 2022

Choose a reason for hiding this comment

dabdine Nov 22, 2022

Choose a reason for hiding this comment

pullrequest bot left a comment

Choose a reason for hiding this comment

pullrequest bot left a comment •

edited

Loading

dabdine Nov 18, 2022 •

edited

Loading

dabdine Nov 19, 2022 •

edited

Loading