Implement asynch iterator for Open AI stream #14920

colin-grant-work · 2025-02-13T22:21:02Z

What it does

Fixes #14902 with an alternative to #14914 based on the implementation of the chunk iterator in the Open AI client itself - I couldn't find a clean way to piggyback off of that implementation while adding messages, as we want to, but it did seem worth following. The main differences between the existing implementation and this one are

the introduction of queues for reads and a cache for messages received. I think this is preferable to the current system of pinning resolve / reject in the closure, because I believe that that system requires that clients operate synchronously on the data they receive, which isn't guaranteed. If the client did asynch work on a chunk they'd received and our iterator received more than one chunk in the meantime, it would resolve those chunks on the same promise as before, which would cause them to be lost. It also assumes that the client won't request more than one chunk before waiting on them, which is how the client should behave in the asynch iterator protocol, but isn't strictly required. In the current implementation, if the client did so, we could create new promises for each request, but only retain a reference to the resolver for the last, so the promises in the middle would be irresolvable.
I've converted some of the emitted calls to once calls. Internally, emitted delegates to once, so I think we should still get the same results, but please correct me if I'm wrong.
Added logic to dispose of our listeners when we're done.

How to test

Listener memory leak in AI system #14902 should be solved: no more warnings when interacting with Open AI.
Other operations should behave normally.

NB: today or yesterday, it seems that on master, the progress display for tool calls is a bit different - they mostly appear once they're already complete. I don't believe it's a symptom of any change in message delivery implemented here.

Follow-ups

Breaking changes

This PR introduces breaking changes and requires careful review. If yes, the breaking changes section in the changelog has been updated.

Attribution

Review checklist

As an author, I have thoroughly tested my changes and carefully followed the review guidelines

Reminder for reviewers

As a reviewer, I agree to behave in accordance with the review guidelines

- adds openai async iterator test cases - adapts end of stream handling - log aborts with debug severity

sdirix

Hi @colin-grant-work! Thank you for your great work ❤️

Although the problematic conditions did not (yet) occur in the Theia codebase, an adopter who themselves invoked the API might have run into them. Also in general the code is much cleaner encapsulated this way. So, thanks for the initiative!

I added a number of unit tests and noticed a weird issue with the 'end' handling. We always handed over finalChatCompletion, however that is actually a promise. The code did not handle that correctly and therefore emitted an undefined before completing.

Now I don't know why we ever used finalChatCompletion as the result of that promise is actually the full completion, but we handled the chunks before anyway. So I don't think we need to handle it at all.

Please have a look whether you agree and are fine with the code change. Thanks!

packages/ai-openai/src/node/openai-streaming-iterator.ts

colin-grant-work · 2025-02-14T16:00:52Z

@sdirix, thanks for adding the tests for such a variety of cases. That certainly helps increase confidence that things should work correctly. I seem to have left one log in that I didn't mean to, so I'll remove that, but otherwise, it looks good to me.

packages/ai-openai/src/node/openai-streaming-iterator.spec.ts

colin-grant-work requested a review from sdirix February 13, 2025 22:22

colin-grant-work force-pushed the bugfix/async-iterator-implementation branch from 8cdad62 to 86207e4 Compare February 13, 2025 22:28

Implement asynch iterator for Open AI stream

8520a43

colin-grant-work force-pushed the bugfix/async-iterator-implementation branch from 86207e4 to 8520a43 Compare February 13, 2025 22:32

JonasHelming mentioned this pull request Feb 14, 2025

Theia AI G&A - AI-powered Theia IDE alpha - Release epic #14923

Open

46 tasks

colin-grant-work mentioned this pull request Feb 14, 2025

fix: use single cancellation listener in Open AI model #14914

Closed

2 tasks

chore: add openai async iterator test cases

8565a63

- adds openai async iterator test cases - adapts end of stream handling - log aborts with debug severity

sdirix approved these changes Feb 14, 2025

View reviewed changes

colin-grant-work commented Feb 14, 2025

View reviewed changes

packages/ai-openai/src/node/openai-streaming-iterator.ts Outdated Show resolved Hide resolved

minor cleanup

8c4c0bf

colin-grant-work force-pushed the bugfix/async-iterator-implementation branch from be72949 to 8c4c0bf Compare February 14, 2025 16:03

colin-grant-work commented Feb 14, 2025

View reviewed changes

packages/ai-openai/src/node/openai-streaming-iterator.spec.ts Show resolved Hide resolved

sdirix approved these changes Feb 14, 2025

View reviewed changes

colin-grant-work merged commit b0f91ae into eclipse-theia:master Feb 14, 2025
10 of 11 checks passed

colin-grant-work deleted the bugfix/async-iterator-implementation branch February 14, 2025 22:10

github-actions bot added this to the 1.59.0 milestone Feb 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement asynch iterator for Open AI stream #14920

Implement asynch iterator for Open AI stream #14920

colin-grant-work commented Feb 13, 2025 •

edited

Loading

sdirix left a comment

colin-grant-work commented Feb 14, 2025

Implement asynch iterator for Open AI stream #14920

Implement asynch iterator for Open AI stream #14920

Conversation

colin-grant-work commented Feb 13, 2025 • edited Loading

What it does

How to test

Follow-ups

Breaking changes

Attribution

Review checklist

Reminder for reviewers

sdirix left a comment

Choose a reason for hiding this comment

colin-grant-work commented Feb 14, 2025

colin-grant-work commented Feb 13, 2025 •

edited

Loading