Whisper pipeline: implement chunk streamer for long-form audio processing #1148

as-suvorov · 2024-11-05T14:23:13Z

No description provided.

ilya-lavrenov · 2024-11-05T17:32:31Z

src/cpp/include/openvino/genai/streamer_base.hpp

 class StreamerBase {
 public:
    /// @brief put is called every time new token is decoded,
    /// @return bool flag to indicate whether generation should be stopped, if return true generation stops
    virtual bool put(int64_t token) = 0;
-    
+
    /// @brief end is called at the end of generation. It can be used to flush cache if your own streamer has one
    virtual void end() = 0;

    virtual ~StreamerBase() = default;


it's better to move dtor definition to .cpp file and export this class

it will help with RTTI issue on some platforms

ilya-lavrenov · 2024-11-05T17:33:48Z

src/cpp/src/utils.hpp

@@ -50,6 +51,7 @@ Config from_config_json_if_exists(const std::filesystem::path& models_path, cons
 }

 ov::genai::StreamerVariant get_streamer_from_map(const ov::AnyMap& config_map);
+ov::genai::ChunkStreamerVariant get_chunk_streamer_from_map(const ov::AnyMap& config_map);


it's whisper specific entity. Maybe we can move it to whisper files? the same in other places like py_utils.hpp

these utils are supposed to be generic ones

ilya-lavrenov · 2024-11-05T17:36:20Z

src/python/py_openvino_genai.cpp

@@ -76,6 +78,28 @@ class ConstructableStreamer: public StreamerBase {
    }
 };

+class ConstructableChunkStreamer: public ChunkStreamerBase {


move to src/python/py_whisper_pipeline.cpp ?

ilya-lavrenov · 2024-11-05T17:38:13Z

src/cpp/include/openvino/genai/whisper_pipeline.hpp

@@ -105,4 +109,7 @@ class OPENVINO_GENAI_EXPORTS WhisperPipeline {
    WhisperGenerationConfig get_generation_config() const;
    void set_generation_config(const WhisperGenerationConfig& config);
 };
+
+OPENVINO_GENAI_EXPORTS std::pair<std::string, Any> chunk_streamer(ChunkStreamerVariant func);
+OPENVINO_GENAI_EXPORTS std::pair<std::string, Any> whisper_generation_config(const WhisperGenerationConfig& config);


I think we can overload existing function like streamer and generation_config instead of introducing Whisper specific.

Example:

openvino.genai/src/cpp/include/openvino/genai/image_generation/generation_config.hpp

Lines 99 to 100 in 2370f6a

OPENVINO_GENAI_EXPORTS

std::pair<std::string, ov::Any> generation_config(const ImageGenerationConfig& generation_config);

Use chunk streamer

5b84666

as-suvorov added category: whisper Whisper pipeline category: Python API Python API for GenAI category: GenAI C++ API Changes in GenAI C++ public headers labels Nov 5, 2024

as-suvorov added this to the 2025.0 milestone Nov 5, 2024

github-actions bot added the category: sampling Sampling / Decoding algorithms label Nov 5, 2024

as-suvorov changed the title ~~Whisper pipeline: implement chunk streamer for long-form audio~~ Whisper pipeline: implement chunk streamer for long-form audio processing Nov 5, 2024

as-suvorov added do_not_review and removed do_not_review labels Nov 5, 2024

as-suvorov requested review from ilya-lavrenov and Wovchena November 5, 2024 16:38

as-suvorov assigned ilya-lavrenov and Wovchena Nov 5, 2024

as-suvorov marked this pull request as ready for review November 5, 2024 16:38

ilya-lavrenov reviewed Nov 5, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Whisper pipeline: implement chunk streamer for long-form audio processing #1148

Whisper pipeline: implement chunk streamer for long-form audio processing #1148

as-suvorov commented Nov 5, 2024

ilya-lavrenov Nov 5, 2024

ilya-lavrenov Nov 5, 2024

ilya-lavrenov Nov 5, 2024

ilya-lavrenov Nov 5, 2024

	OPENVINO_GENAI_EXPORTS
	std::pair<std::string, ov::Any> generation_config(const ImageGenerationConfig& generation_config);

Whisper pipeline: implement chunk streamer for long-form audio processing #1148

Are you sure you want to change the base?

Whisper pipeline: implement chunk streamer for long-form audio processing #1148

Conversation

as-suvorov commented Nov 5, 2024

ilya-lavrenov Nov 5, 2024

Choose a reason for hiding this comment

ilya-lavrenov Nov 5, 2024

Choose a reason for hiding this comment

ilya-lavrenov Nov 5, 2024

Choose a reason for hiding this comment

ilya-lavrenov Nov 5, 2024

Choose a reason for hiding this comment