Consolidate SpeechModel APIs #1518

ThomasVitale · 2024-10-09T19:32:59Z

Consolidate SpeechModel APIs into the spring-ai-core module, make it null-safe and covered by unit tests.
Refactor OpenAiSpeechModel APIs to implement the new consolidated APIs.
Delete leftover ImageResponseMetadata class in the spring-ai-openai module.

* Consolidate SpeechModel APIs into the spring-ai-core module, make it null-safe and covered by unit tests. * Refactor OpenAiSpeechModel APIs to implement the new consolidated APIs. * Delete leftover ImageResponseMetadata class in the spring-ai-openai module. Fixes spring-projectsgh-1496

ThomasVitale · 2024-10-09T19:34:18Z

Once this PR is merged, I'll have 2 followups PRs: one to update the documentation and one to add observability instrumentation to the OpenAiSpeechModel.

markpollack · 2024-10-11T16:00:14Z

The biggest concern for me is how portable are the abstractions, we need at least two impls to ensure that we have the correct abstraction before moving into main. Also, most of the changes seem sort of "shallow", e.g. just changing the name of the class and the types, but aside from the name change they have the same functionality as the 'non speach' core api.

Thoughts?

ThomasVitale · 2024-10-14T15:54:21Z

I also thought about that, so I was conservative with the changes. The main benefit I see in having the Speech APIs in Core is that it's simpler to demonstrate and try out new implementations, or even for users to implement their own custom ones. Mainly for the semantics given by the interfaces and the few conventions they have.

But I don't have strong feeling about it, so I'd be fine parking this for now. I was looking into adding observability for speech models, but we can wait for that (I wouldn't do it inside the OpenAI module so to keep consistency with the rest of the observability implementation).

@habuma do you have any thoughts about this change?

markpollack self-assigned this Oct 11, 2024

markpollack added this to the 1.0.0-M4 milestone Oct 11, 2024

markpollack added the speech label Oct 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consolidate SpeechModel APIs #1518

Consolidate SpeechModel APIs #1518

ThomasVitale commented Oct 9, 2024

ThomasVitale commented Oct 9, 2024

markpollack commented Oct 11, 2024 •

edited

Loading

ThomasVitale commented Oct 14, 2024

Consolidate SpeechModel APIs #1518

Are you sure you want to change the base?

Consolidate SpeechModel APIs #1518

Conversation

ThomasVitale commented Oct 9, 2024

ThomasVitale commented Oct 9, 2024

markpollack commented Oct 11, 2024 • edited Loading

ThomasVitale commented Oct 14, 2024

markpollack commented Oct 11, 2024 •

edited

Loading