[audio] Add pcm audio websocket with dialog support #4032

GiviMAD · 2024-01-10T20:45:06Z

These PR adds a WebSocket Adapter that allows transferring PCM audio to a sink and source whose existence will be tied to the WebSocket connection (register/unregister the components on connection/disconnection) also allows to spawn a dialog processor instance connected to those (will also be tear down on disconnection).

The PR is incomplete and untested, I'll try to add a client to the UI and develop both PR in parallel.

Also IDK if placing the code in the voice bundle is correct, I did it there because it already requires the audio bundle. For me it makes more sense to do it on the audio bundle but I'll have to require both the voice and websocket bundles there.

Best regards!

kaikreuzer · 2024-01-13T09:34:55Z

Also IDK if placing the code in the voice bundle is correct

As the websocket adapter is for audio streams in general, I would say that it should ideally be placed in the audio bundle.
The dependency on the voice manager is very small; maybe you could come up with some small hook, which could be used by the voice bundle to register the dialog processing logic into the websocket adapter?

GiviMAD · 2024-01-13T18:58:10Z

Also IDK if placing the code in the voice bundle is correct

As the websocket adapter is for audio streams in general, I would say that it should ideally be placed in the audio bundle. The dependency on the voice manager is very small; maybe you could come up with some small hook, which could be used by the voice bundle to register the dialog processing logic into the websocket adapter?

Yes, having it on the voice bundle is quite unintuitive, I haven't though on that solution seems like the correct one.

Thank you for the feedback!

Signed-off-by: Miguel Álvarez <miguelwork92@gmail.com>

GiviMAD · 2024-01-14T23:19:10Z

I have to refactor the code into the websocket.io bundle for it to build without modifying the BOM features, but I think it also makes sense to have it there with the other WebSocketAdapter implementations.

Signed-off-by: Miguel Álvarez <miguelwork92@gmail.com>

GiviMAD · 2024-01-16T18:15:58Z

I think these is ready for the review.

I was able to write a POC in the UI and everything seems to work.

A quick video recorded with my mobile:

video_2024-01-16.mp4

Still need a lot of work on the UI side, I'm going to create a linked issue there asking for some guidance, I wanted to first have a POC to know there are no big issues with the integration.

Best regards!

Signed-off-by: Miguel Álvarez <miguelwork92@gmail.com>

florian-h05 · 2024-11-02T09:33:17Z

@openhab/core-maintainers Would be great if this could be reviewed for openHAB 5, as it would allow us to have a voice assistant inside the UI.

GiviMAD requested a review from a team as a code owner January 10, 2024 20:45

GiviMAD mentioned this pull request Jan 13, 2024

Upgrade from webpack v4 to v5 openhab/openhab-webui#2267

Merged

[audio] Add pcm audio websocket with dialog support

17970d5

Signed-off-by: Miguel Álvarez <miguelwork92@gmail.com>

GiviMAD force-pushed the voice/audio_pcm_websocket branch from 11d3025 to 17970d5 Compare January 14, 2024 11:33

GiviMAD changed the title ~~[WIP] [audio/voice] Add pcm audio websocket with dialog support~~ [WIP] [audio] Add pcm audio websocket with dialog support Jan 14, 2024

GiviMAD added 4 commits January 14, 2024 14:55

improve KSEdgeService support

c5a3ec7

Signed-off-by: Miguel Álvarez <miguelwork92@gmail.com>

fix/spotless voice provider

be537e4

Signed-off-by: Miguel Álvarez <miguelwork92@gmail.com>

fix dep version

af83a81

Signed-off-by: Miguel Álvarez <miguelwork92@gmail.com>

refactor code into websocket io

cdc08a9

Signed-off-by: Miguel Álvarez <miguelwork92@gmail.com>

GiviMAD added 2 commits January 16, 2024 11:57

fixes

74d2457

Signed-off-by: Miguel Álvarez <miguelwork92@gmail.com>

fixes and improvements

d356b6d

Signed-off-by: Miguel Álvarez <miguelwork92@gmail.com>

wborn added the enhancement An enhancement or new feature of the Core label Jan 16, 2024

improvements

ad559e6

Signed-off-by: Miguel Álvarez <miguelwork92@gmail.com>

GiviMAD changed the title ~~[WIP] [audio] Add pcm audio websocket with dialog support~~ [audio] Add pcm audio websocket with dialog support Jan 16, 2024

fix comments

2e808a4

Signed-off-by: Miguel Álvarez <miguelwork92@gmail.com>

GiviMAD mentioned this pull request Jan 17, 2024

[Main UI] Voice support openhab/openhab-webui#2275

Open

GiviMAD mentioned this pull request Jan 26, 2024

[WIP] [UI] Add voice dialog openhab/openhab-webui#2285

Open

Merge branch 'main' into voice/audio_pcm_websocket

dc9aa43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[audio] Add pcm audio websocket with dialog support #4032

[audio] Add pcm audio websocket with dialog support #4032

GiviMAD commented Jan 10, 2024 •

edited

Loading

kaikreuzer commented Jan 13, 2024

GiviMAD commented Jan 13, 2024

GiviMAD commented Jan 14, 2024

GiviMAD commented Jan 16, 2024

florian-h05 commented Nov 2, 2024

[audio] Add pcm audio websocket with dialog support #4032

Are you sure you want to change the base?

[audio] Add pcm audio websocket with dialog support #4032

Conversation

GiviMAD commented Jan 10, 2024 • edited Loading

kaikreuzer commented Jan 13, 2024

GiviMAD commented Jan 13, 2024

GiviMAD commented Jan 14, 2024

GiviMAD commented Jan 16, 2024

florian-h05 commented Nov 2, 2024

GiviMAD commented Jan 10, 2024 •

edited

Loading