[API] Add octoai back-end #936

vvchernov · 2023-10-19T15:18:57Z

Need OCTOAI_API_KEY to use it
Add octoai (https://octoai.cloud/) as back-end for models.
Greedy_until path is enabled (while likelihood is WIP)

CLAassistant · 2023-10-19T15:19:03Z

All committers have signed the CLA.

vvchernov · 2023-10-23T13:27:17Z

Hello @haileyschoelkopf, @lintangsutawika! Could you review the PR?

StellaAthena · 2023-10-23T19:30:21Z

Thank you for this PR! I notice that you say "no support for logits"... Logits are a crucial resource for machine learning researchers and lacking logits will heavily restrict the tasks that can be run.

Are there plans to add support for this in the future? If not, what is the overriding value that OctoAI adds? For example, are there popular models that are only available through that platform?

vvchernov · 2023-10-24T06:16:37Z

Thank you for this PR! I notice that you say "no support for logits"... Logits are a crucial resource for machine learning researchers and lacking logits will heavily restrict the tasks that can be run.

Are there plans to add support for this in the future?

Hello @StellaAthena! I think you meant loglikelyhood approach which works with logits inside. Yes, I work to support it just now and it will be done in the closest future

binarybana · 2023-10-24T15:35:07Z

Hi @StellaAthena,

As @vvchernov mentioned, we'll be supporting logprobs (and enabling loglikelihood evals) shortly and can hold this PR until that lands if you'd like.

One benefit of adding OpenAI endpoint support is because our LLM's are exposing an OpenAI compatible API, then these integrations will be useful for people beyond just OctoAI's endpoints. Also, for people using fine-tuned endpoints (we have a whole roadmap for this), it's helpful to be able to run these evals against their finetuned models without needing to host them locally.

Also, we've got some other improvements/fixes for some lm-evaluation-harness workflows incoming as well separate from these integrations.

StellaAthena · 2023-10-25T15:15:28Z

OpenAI compatible API

Okay this is really helpful. We already support multiple websites with an OpenAI-compatible API. Right now these are all bundled together for convenience of maintenance and controlled via aliases. It looks like we can do something similar here to make support easier to maintain.

That said OpenAI is rapidly depreciating its support for functionality other than chat-completions. It's been on my radar that we might want to separate the "official" implementation (which necessarily will need to change to track their actual API) and support for a "legacy" version that represents how the API worked before they depreciated things like loglikelihood.

I assume that you're an OctoAI employee? Do you know if the current plan is to maintain consistency with the OpenAI API regardless of how it changes or if you're planning on preserving the original verison that you currently support?

@haileyschoelkopf @lintangsutawika We should ask this of goose.ai as well. It may be that the best path forward is to have a primary class for the official OpenAI API and subclasses for different frameworks that implement and extend it?

haileyschoelkopf · 2023-10-25T15:27:48Z

We should ask this of goose.ai as well. It may be that the best path forward is to have a primary class for the official OpenAI API and subclasses for different frameworks that implement and extend it?

Yes, definitely agree. vLLM and GGML are other frameworks that could be used via a self-hosted API endpoint with this. I'm not certain about goose.ai but will check

Will take a look at this PR and leave comments, though will wait until logprobs land to merge.

Also of note: we have a new version release in big-refactor that we are planning to have replace the current main very soon (once documentation is tidied up slightly). If you all at OctoAI would be willing to direct your changes to that branch that would be very helpful, though not mandatory (thank you for the other PRs so far!)

vvchernov · 2023-10-25T17:50:39Z

Also of note: we have a new version release in big-refactor that we are planning to have replace the current main very soon (once documentation is tidied up slightly). If you all at OctoAI would be willing to direct your changes to that branch that would be very helpful, though not mandatory (thank you for the other PRs so far!)

Hello @haileyschoelkopf! Yes, we follow to big-refactor branch. The reason that the master branch is used is Hugging Face leaderboard uses it. Highly likely when big-refactor will be released HF will go to it and we will do it too.

binarybana · 2023-10-25T17:59:15Z

I assume that you're an OctoAI employee? Do you know if the current plan is to maintain consistency with the OpenAI API regardless of how it changes or if you're planning on preserving the original verison that you currently support?

Yes. We will maintain API compatibility with the core functionality (eg, most LLM providers don't support the plugins pieces) around both /completions and /chat/completions even though /completions is deprecated. Luckily, I've also heard that OpenAI is planning on adding logprobs into /chat/completions at a future date as well.

The idea to support OpenAI API with plugins/mixins to indicate which aspects of the API are supported sounds great to us.

…d was implemented

haileyschoelkopf · 2023-12-12T14:43:13Z

We've now switched to v0.4.0 / main as the new version of lm_eval. We'd be happy to help you port this to the updated version of the codebase--let us know if you have any questions, or, if you'd like us to port it, then if there is an API key we can use for developing the integration.

vvchernov added 2 commits October 17, 2023 17:47

add octoai front-end

a74fa56

small fixes

cc2613b

vvchernov requested review from haileyschoelkopf and lintangsutawika as code owners October 19, 2023 15:18

vvchernov changed the title ~~[API] Add octoai front-end~~ [API] Add octoai back-end Oct 24, 2023

add temperature and top_p parameters to octoai request

bbfc019

vvchernov added 4 commits November 8, 2023 19:00

add url arg to octoai

56b26c9

transfer message to call_octoai_inference for a moment

4151c04

implementation of loglikelihood in parallel to greedy_until

e98b465

refactor and clean code. Two runner for greedy_until and loglikelihoo…

4f13d0e

…d was implemented

vvchernov requested a review from StellaAthena as a code owner November 10, 2023 09:58

vvchernov added 3 commits November 13, 2023 10:08

update octoai logprob API for loglikelihood calculations

b29849e

implement abstract method

dada5ad

add response check methods

b1ce0bb

LSinev mentioned this pull request Aug 26, 2024

Why no results for closed-sourced models? #2225

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[API] Add octoai back-end #936

[API] Add octoai back-end #936

vvchernov commented Oct 19, 2023 •

edited

Loading

CLAassistant commented Oct 19, 2023 •

edited

Loading

vvchernov commented Oct 23, 2023

StellaAthena commented Oct 23, 2023

vvchernov commented Oct 24, 2023 •

edited

Loading

binarybana commented Oct 24, 2023

StellaAthena commented Oct 25, 2023

haileyschoelkopf commented Oct 25, 2023

vvchernov commented Oct 25, 2023

binarybana commented Oct 25, 2023

haileyschoelkopf commented Dec 12, 2023

[API] Add octoai back-end #936

Are you sure you want to change the base?

[API] Add octoai back-end #936

Conversation

vvchernov commented Oct 19, 2023 • edited Loading

CLAassistant commented Oct 19, 2023 • edited Loading

vvchernov commented Oct 23, 2023

StellaAthena commented Oct 23, 2023

vvchernov commented Oct 24, 2023 • edited Loading

binarybana commented Oct 24, 2023

StellaAthena commented Oct 25, 2023

haileyschoelkopf commented Oct 25, 2023

vvchernov commented Oct 25, 2023

binarybana commented Oct 25, 2023

haileyschoelkopf commented Dec 12, 2023

vvchernov commented Oct 19, 2023 •

edited

Loading

CLAassistant commented Oct 19, 2023 •

edited

Loading

vvchernov commented Oct 24, 2023 •

edited

Loading