Debounce transcript processing 300ms for controlling event loop better #1211

beastoin · 2024-10-29T22:25:47Z

Summary by CodeRabbit

New Features
- Introduced a new WebSocket utility for handling audio data and transcripts.
- Added a WebSocket endpoint for real-time audio processing.
- Enhanced WebSocket connection management for improved robustness.
Bug Fixes
- Improved error handling and logging for WebSocket and webhook interactions, ensuring better traceability and context during failures.
Refactor
- Streamlined internal logic for audio transcription and WebSocket handling, improving efficiency and readability.

coderabbitai · 2024-10-29T22:25:54Z

Walkthrough

The changes in this pull request introduce a new WebSocket utility function, _websocket_util_trigger, to manage audio data and segments based on header types. A new WebSocket endpoint, websocket_endpoint_trigger, is also added. The transcribe_v2.py file is modified to enhance WebSocket handling for audio transcription, including improvements in error handling and data packaging. Additionally, a new function, connect_to_trigger_pusher, is introduced in the pusher.py file for establishing WebSocket connections. The webhook functions in webhooks.py are updated to improve error handling and logging.

Changes

File Path	Change Summary
backend/routers/pusher.py	Added `_websocket_util_trigger` and `websocket_endpoint_trigger` for handling audio data via WebSocket.
backend/routers/transcribe_v2.py	Refactored `stream_transcript` to `stream_transcript_process`, improved error handling, and updated WebSocket connection logic.
backend/utils/pusher.py	Introduced `connect_to_trigger_pusher` for establishing WebSocket connections with error handling.
backend/utils/webhooks.py	Enhanced error handling and logging for webhook requests, including timeout settings for HTTP requests.

Possibly related PRs

Disable all external triggers including realtime integration, realtime… #1178: This PR modifies the backend/routers/transcribe_v2.py file, which is also affected by the main PR. It focuses on the handling of audio data and WebSocket communication, which is relevant to the new WebSocket utility function introduced in the main PR.

Poem

🐇 In the burrow where echoes play,
New webs of sound weave night and day.
With triggers and streams, we hop along,
Transcripts and bytes in a joyful song!
Through wires we dance, with hearts so light,
Celebrating changes, all feels just right! 🌟

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

…ct your system.

coderabbitai

Actionable comments posted: 8

🧹 Outside diff range and nitpick comments (6)

backend/utils/pusher.py (1)

Line range hint 1-11: Add missing websockets import

The websockets module is used but not explicitly imported. Add the import to prevent potential runtime errors.

Add this import at the top of the file:

import uuid
import os
from datetime import datetime, timezone, timedelta
from enum import Enum
+import websockets

import opuslib

🧰 Tools

🪛 Ruff

53-53: websockets may be undefined, or defined from star imports

(F405)

backend/utils/webhooks.py (3)

Line range hint 62-78: Critical: Implement debouncing and use async HTTP client.

Several issues need to be addressed:

The PR objective mentions debouncing transcript processing (300ms), but no debouncing logic is implemented.
Using synchronous requests.post in an async function can block the event loop.

Consider this implementation:

import aiohttp
from functools import partial
from async_timeout import timeout
from typing import List, Dict, Any

def debounce(wait_time: float):
    """Decorator to debounce a function"""
    def decorator(fn):
        task = None
        async def debounced(*args, **kwargs):
            nonlocal task
            if task is not None:
                task.cancel()
            task = asyncio.create_task(
                _delayed_call(wait_time, fn, *args, **kwargs)
            )
        return debounced
    return decorator

async def _delayed_call(wait_time: float, fn, *args, **kwargs):
    await asyncio.sleep(wait_time)
    return await fn(*args, **kwargs)

@debounce(wait_time=0.3)  # 300ms debounce
async def realtime_transcript_webhook(uid: str, segments: List[dict]):
    if not (toggled := user_webhook_status_db(uid, WebhookType.realtime_transcript)):
        return
    if not (webhook_url := get_user_webhook_db(uid, WebhookType.realtime_transcript)):
        return

    webhook_url = f'{webhook_url}?uid={uid}'
    async with aiohttp.ClientSession() as session:
        try:
            async with timeout(15):
                async with session.post(
                    webhook_url,
                    json={'segments': segments, 'session_id': uid},
                    headers={'Content-Type': 'application/json'}
                ) as response:
                    logging.info(
                        'Realtime transcript webhook response [url=%s]: %d',
                        webhook_url,
                        response.status
                    )
        except Exception as e:
            logging.error(
                'Realtime transcript webhook failed [url=%s]: %s',
                webhook_url,
                str(e)
            )

Line range hint 1-150: Consider implementing robust webhook management system.

The current webhook implementation could benefit from several architectural improvements:

Webhook Manager class to handle different webhook types consistently
Circuit breaker pattern to handle failing webhooks
Rate limiting to prevent overwhelming webhook endpoints
Retry mechanism with exponential backoff
Webhook health monitoring and automatic disable/enable

This would improve reliability and maintainability of the webhook system.

Example architecture:

from dataclasses import dataclass
from enum import Enum
from typing import Optional, Callable, Any
import asyncio
from datetime import datetime, timedelta

@dataclass
class WebhookConfig:
    timeout: int
    max_retries: int
    backoff_factor: float
    circuit_breaker_threshold: int
    rate_limit_per_minute: int

class WebhookManager:
    def __init__(self):
        self._configs: Dict[WebhookType, WebhookConfig] = {}
        self._circuit_breakers: Dict[str, CircuitBreaker] = {}
        self._rate_limiters: Dict[str, RateLimiter] = {}
    
    async def send_webhook(
        self,
        webhook_type: WebhookType,
        url: str,
        payload: Any,
        retry_strategy: Optional[RetryStrategy] = None
    ) -> None:
        # Implementation details...

Would you like me to create a detailed proposal for this architecture improvement?

Async HTTP client and thread safety improvements needed

The codebase scan reveals several issues that confirm the review comment's concerns:

The function uses synchronous requests library despite being marked as async, which can block the event loop. All webhook-related code uses requests instead of an async HTTP client like aiohttp.
The TODO comment's concerns are valid:
- No locking mechanism exists for thread safety when handling audio bytes
- No segmentation of large audio payloads
- No URL validation for webhook endpoints
The function is called from multiple places in backend/routers/pusher.py using both asyncio.create_task() and asyncio.run_coroutine_threadsafe(), making thread safety even more critical.

🔗 Analysis chain

Line range hint 98-113: Address TODO comment and async implementation.

The TODO comment mentions several important issues that should be addressed:

Adding a lock for thread safety
Sending shorter segments
Validating regex

Additionally, like the transcript webhook, this function should use an async HTTP client.
Would you like me to help implement these improvements and create a GitHub issue to track them?

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Check if there are any other TODOs in the codebase related to webhooks
rg -i "todo.*webhook" --type py

Length of output: 33

Script:

#!/bin/bash
# Let's check for webhook-related code and async patterns
rg -A 5 "async.*webhook" --type py

# Also check for any webhook-related functions
rg -A 5 "def.*webhook" --type py

# Check for any webhook imports and HTTP client usage
rg "import.*requests|import.*aiohttp" --type py

Length of output: 11697

backend/routers/transcribe_v2.py (2)

Line range hint 2-24: Add missing imports for asyncio, time, os, json, and websockets modules

Several modules used in the code are not imported, which could lead to NameError exceptions at runtime. Please ensure all used modules are properly imported.

Apply this diff to add the missing imports:

 import uuid
 import struct
+import asyncio
+import time
+import os
+import json
+import websockets
 from datetime import datetime, timezone, timedelta
 from enum import Enum

Line range hint 417-467: Consider refactoring receive_audio function for better readability

The receive_audio function is quite lengthy and contains multiple nested conditional statements. Refactoring it into smaller helper functions or restructuring the logic can improve readability and maintainability.

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Files that changed from the base of the PR and between b106a9c and c9e2cf5.

📒 Files selected for processing (4)

backend/routers/pusher.py (4 hunks)
backend/routers/transcribe_v2.py (18 hunks)
backend/utils/pusher.py (1 hunks)
backend/utils/webhooks.py (4 hunks)

🧰 Additional context used

🪛 Ruff

backend/routers/pusher.py

211-211: asyncio may be undefined, or defined from star imports

(F405)

230-230: json may be undefined, or defined from star imports

(F405)

231-231: asyncio may be undefined, or defined from star imports

(F405)

232-232: asyncio may be undefined, or defined from star imports

(F405)

240-240: asyncio may be undefined, or defined from star imports

(F405)

258-258: asyncio may be undefined, or defined from star imports

(F405)

272-272: asyncio may be undefined, or defined from star imports

(F405)

275-275: asyncio may be undefined, or defined from star imports

(F405)

276-276: asyncio may be undefined, or defined from star imports

(F405)

backend/routers/transcribe_v2.py

24-24: utils.pusher.connect_to_transcript_pusher imported but unused

Remove unused import

(F401)

24-24: utils.pusher.connect_to_audio_bytes_pusher imported but unused

Remove unused import

(F401)

115-115: asyncio may be undefined, or defined from star imports

(F405)

120-120: f-string without any placeholders

Remove extraneous f prefix

(F541)

160-160: asyncio may be undefined, or defined from star imports

(F405)

237-237: asyncio may be undefined, or defined from star imports

(F405)

256-256: get_profile_audio_if_exists may be undefined, or defined from star imports

(F405)

261-261: process_audio_dg may be undefined, or defined from star imports

(F405)

265-265: process_audio_dg may be undefined, or defined from star imports

(F405)

270-270: send_initial_file_path may be undefined, or defined from star imports

(F405)

273-273: process_audio_soniox may be undefined, or defined from star imports

(F405)

279-279: process_audio_speechmatics may be undefined, or defined from star imports

(F405)

283-283: send_initial_file_path may be undefined, or defined from star imports

(F405)

313-313: asyncio may be undefined, or defined from star imports

(F405)

319-319: json may be undefined, or defined from star imports

(F405)

322-322: websockets may be undefined, or defined from star imports

(F405)

342-342: asyncio may be undefined, or defined from star imports

(F405)

351-351: websockets may be undefined, or defined from star imports

(F405)

372-372: asyncio may be undefined, or defined from star imports

(F405)

514-514: asyncio may be undefined, or defined from star imports

(F405)

515-515: asyncio may be undefined, or defined from star imports

(F405)

518-518: asyncio may be undefined, or defined from star imports

(F405)

520-520: asyncio may be undefined, or defined from star imports

(F405)

523-523: asyncio may be undefined, or defined from star imports

(F405)

backend/utils/pusher.py

53-53: websockets may be undefined, or defined from star imports

(F405)

🔇 Additional comments (3)

backend/utils/pusher.py (1)

49-58: Verify integration with debounce mechanism

The PR's objective is to implement a 300ms debounce for transcript processing, but this implementation doesn't show how it integrates with that mechanism. Please ensure this new trigger connection supports the debounce functionality.

Let's check for debounce-related code in the codebase:

🧰 Tools

🪛 Ruff

53-53: websockets may be undefined, or defined from star imports

(F405)

backend/routers/pusher.py (1)

196-293: Ensure consistent exception handling and resource cleanup in WebSocket utility

While the code handles exceptions and attempts to close the WebSocket connection properly, ensure that all exceptions are caught and resources are cleaned up to prevent potential memory leaks or dangling connections.

[approve]

To verify, you can review the exception handling blocks to make sure all possible exceptions are accounted for and the websocket.close() method is called in all exit paths.

🧰 Tools

🪛 Ruff

211-211: asyncio may be undefined, or defined from star imports

(F405)

230-230: json may be undefined, or defined from star imports

(F405)

231-231: asyncio may be undefined, or defined from star imports

(F405)

232-232: asyncio may be undefined, or defined from star imports

(F405)

240-240: asyncio may be undefined, or defined from star imports

(F405)

258-258: asyncio may be undefined, or defined from star imports

(F405)

272-272: asyncio may be undefined, or defined from star imports

(F405)

275-275: asyncio may be undefined, or defined from star imports

(F405)

276-276: asyncio may be undefined, or defined from star imports

(F405)

backend/routers/transcribe_v2.py (1)

23-23: 🛠️ Refactor suggestion

Replace wildcard import with explicit imports from utils.stt.streaming

Using wildcard imports (from module import *) is discouraged as it can lead to namespace pollution and reduce code readability. Import specific functions or classes instead.

Apply this diff to import specific functions:

-from utils.stt.streaming import *
+from utils.stt.streaming import (
+    get_profile_audio_if_exists,
+    process_audio_dg,
+    process_audio_soniox,
+    process_audio_speechmatics,
+    send_initial_file_path,
+)

Likely invalid or redundant comment.

coderabbitai · 2024-10-30T05:02:53Z

backend/utils/pusher.py

+async def connect_to_trigger_pusher(uid: str, sample_rate: int = 8000):
+    try:
+        print("Connecting to Pusher transcripts trigger WebSocket...")
+        ws_host = PusherAPI.replace("http", "ws")
+        socket = await websockets.connect(f"{ws_host}/v1/trigger/listen?uid={uid}&sample_rate={sample_rate}")
+        print("Connected to Pusher transcripts trigger WebSocket.")
+        return socket
+    except Exception as e:
+        print(f"Exception in connect_to_transcript_pusher: {e}")
+        raise


⚠️ Potential issue

Fix incorrect log messages

The log messages and error handling reference "transcripts trigger" instead of just "trigger", and the error message references the wrong function name.

Apply these corrections:

async def connect_to_trigger_pusher(uid: str, sample_rate: int = 8000): try: - print("Connecting to Pusher transcripts trigger WebSocket...") + print("Connecting to Pusher trigger WebSocket...") ws_host = PusherAPI.replace("http", "ws") socket = await websockets.connect(f"{ws_host}/v1/trigger/listen?uid={uid}&sample_rate={sample_rate}") - print("Connected to Pusher transcripts trigger WebSocket.") + print("Connected to Pusher trigger WebSocket.") return socket except Exception as e: - print(f"Exception in connect_to_transcript_pusher: {e}") + print(f"Exception in connect_to_trigger_pusher: {e}") raise

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

async def connect_to_trigger_pusher(uid: str, sample_rate: int = 8000):

try:

print("Connecting to Pusher transcripts trigger WebSocket...")

ws_host = PusherAPI.replace("http", "ws")

socket = await websockets.connect(f"{ws_host}/v1/trigger/listen?uid={uid}&sample_rate={sample_rate}")

print("Connected to Pusher transcripts trigger WebSocket.")

return socket

except Exception as e:

print(f"Exception in connect_to_transcript_pusher: {e}")

raise

async def connect_to_trigger_pusher(uid: str, sample_rate: int = 8000):

try:

print("Connecting to Pusher trigger WebSocket...")

ws_host = PusherAPI.replace("http", "ws")

socket = await websockets.connect(f"{ws_host}/v1/trigger/listen?uid={uid}&sample_rate={sample_rate}")

print("Connected to Pusher trigger WebSocket.")

return socket

except Exception as e:

print(f"Exception in connect_to_trigger_pusher: {e}")

raise

🧰 Tools

🪛 Ruff

53-53: websockets may be undefined, or defined from star imports

(F405)

coderabbitai · 2024-10-30T05:02:53Z

backend/utils/webhooks.py

+        try:
+            response = requests.post(
+                webhook_url,
+                json={
+                    'summary': summary,
+                    'uid': uid,
+                    'created_at': datetime.now().isoformat()
+                },
+                headers={'Content-Type': 'application/json'},
+                timeout=30,
+            )
+            print('day_summary_webhook:', webhook_url, response.status_code)
+        except Exception as e:
+            print(f"Error sending day summary to developer webhook: {e}")


🛠️ Refactor suggestion

Consider extracting common webhook logic to reduce duplication.

The webhook implementations share similar patterns for request handling, timeouts, and error logging. Consider extracting this to a common utility function.

async def _send_webhook(webhook_url: str, payload: dict, timeout: int = 10) -> None: try: response = requests.post( webhook_url, json=payload, headers={'Content-Type': 'application/json'}, timeout=timeout, ) logging.info('Webhook response [url=%s]: %d', webhook_url, response.status_code) except Exception as e: logging.error('Webhook failed [url=%s]: %s', webhook_url, str(e)) # Usage example: def day_summary_webhook(uid, summary: str): if not (toggled := user_webhook_status_db(uid, WebhookType.day_summary)): return if not (webhook_url := get_user_webhook_db(uid, WebhookType.day_summary)): return await _send_webhook( f'{webhook_url}?uid={uid}', { 'summary': summary, 'uid': uid, 'created_at': datetime.now().isoformat() } )

coderabbitai · 2024-10-30T05:02:53Z

backend/utils/webhooks.py

+        try:
+            response = requests.post(
+                webhook_url,
+                json=memory.as_dict_cleaned_dates(),
+                headers={'Content-Type': 'application/json'},
+                timeout=30,
+            )
+            print('memory_created_webhook:', webhook_url, response.status_code)
+        except Exception as e:
+            print(f"Error sending memory created to developer webhook: {e}")


🛠️ Refactor suggestion

Consider reducing timeout and improving error logging.

While the error handling is a good addition, there are a few improvements to consider:

30 seconds is a long timeout for a webhook. Consider reducing it to 5-10 seconds to fail fast.

Replace print statements with proper logging.

+import logging + def memory_created_webhook(uid, memory: Memory): # ... try: response = requests.post( webhook_url, json=memory.as_dict_cleaned_dates(), headers={'Content-Type': 'application/json'}, - timeout=30, + timeout=10, ) - print('memory_created_webhook:', webhook_url, response.status_code) + logging.info('Memory webhook response [url=%s]: %d', webhook_url, response.status_code) except Exception as e: - print(f"Error sending memory created to developer webhook: {e}") + logging.error('Memory webhook failed [url=%s]: %s', webhook_url, str(e))

Committable suggestion was skipped due to low confidence.

coderabbitai · 2024-10-30T05:02:53Z

backend/routers/pusher.py

+                data = await websocket.receive_bytes()
+                header_type = struct.unpack('<I', data[:4])[0]
+
+                # Transcript


⚠️ Potential issue

Add data length check before unpacking to prevent exceptions

When receiving data from the WebSocket, there is a potential for struct.unpack to raise an exception if data is less than 4 bytes in length. This can happen if the client sends insufficient data or disconnects unexpectedly.

Apply this diff to add a check before unpacking:

while websocket_active: data = await websocket.receive_bytes() + if len(data) < 4: + print("Received data too short to contain header.") + continue header_type = struct.unpack('<I', data[:4])[0]

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

data = await websocket.receive_bytes()

header_type = struct.unpack('<I', data[:4])[0]

# Transcript

data = await websocket.receive_bytes()

if len(data) < 4:

print("Received data too short to contain header.")

continue

header_type = struct.unpack('<I', data[:4])[0]

# Transcript

coderabbitai · 2024-10-30T05:02:53Z

backend/routers/pusher.py

@@ -1,4 +1,5 @@
 import uuid
+import struct


⚠️ Potential issue

Import 'asyncio' and 'json' modules to prevent NameError

The modules asyncio and json are used in the code but are not imported. This will result in a NameError when the code is executed.

Apply this diff to fix the missing imports:

import uuid import struct +import asyncio +import json from datetime import datetime, timezone, timedelta from enum import Enum

Committable suggestion was skipped due to low confidence.

coderabbitai · 2024-10-30T05:02:53Z

backend/routers/pusher.py

+    async def send_heartbeat():
+        nonlocal websocket_active
+        nonlocal websocket_close_code
+        try:
+            while websocket_active:
+                await asyncio.sleep(20)
+                if websocket.client_state == WebSocketState.CONNECTED:
+                    await websocket.send_json({"type": "ping"})
+                else:
+                    break
+        except WebSocketDisconnect:
+            print("WebSocket disconnected")
+        except Exception as e:
+            print(f'Heartbeat error: {e}')
+            websocket_close_code = 1011
+        finally:
+            websocket_active = False
+


🛠️ Refactor suggestion

Refactor duplicated send_heartbeat functions to improve maintainability

The send_heartbeat function is duplicated across multiple WebSocket utility functions (_websocket_util_transcript, _websocket_util_audio_bytes, and _websocket_util_trigger). Refactoring this into a single function will reduce code duplication and improve maintainability.

Consider defining a shared send_heartbeat function outside the utility functions:

+async def send_heartbeat(websocket, websocket_active, websocket_close_code): + try: + while websocket_active: + await asyncio.sleep(20) + if websocket.client_state == WebSocketState.CONNECTED: + await websocket.send_json({"type": "ping"}) + else: + break + except WebSocketDisconnect: + print("WebSocket disconnected") + except Exception as e: + print(f'Heartbeat error: {e}') + websocket_close_code = 1011 + finally: + websocket_active = False

And then, update your utility functions to use this shared function:

heartbeat_task = asyncio.create_task( - send_heartbeat() + send_heartbeat(websocket, websocket_active, websocket_close_code) )

Committable suggestion was skipped due to low confidence.

🧰 Tools

🪛 Ruff

258-258: asyncio may be undefined, or defined from star imports

(F405)

coderabbitai · 2024-10-30T05:02:53Z

backend/routers/transcribe_v2.py

        try:
            await asyncio.sleep(delay_seconds)

            # recheck session
            memory = retrieve_in_progress_memory(uid)
            if not memory or memory['finished_at'] > finished_at:
-                print(f"_trigger_create_memory_with_delay not memory or not last session")
+                print(f"_trigger_create_memory_with_delay not memory or not last session", uid)


⚠️ Potential issue

Remove unnecessary f prefix from the print statement

The print statement is an f-string but does not contain any placeholders. Remove the f prefix to avoid confusion.

Apply this diff:

- print(f"_trigger_create_memory_with_delay not memory or not last session", uid) + print("_trigger_create_memory_with_delay not memory or not last session", uid)

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

print(f"_trigger_create_memory_with_delay not memory or not last session", uid)

print("_trigger_create_memory_with_delay not memory or not last session", uid)

🧰 Tools

🪛 Ruff

120-120: f-string without any placeholders

Remove extraneous f prefix

(F541)

coderabbitai · 2024-10-30T05:02:53Z

backend/routers/transcribe_v2.py

@@ -20,7 +21,7 @@
 from utils.stt.streaming import *
 from utils.webhooks import send_audio_bytes_developer_webhook, realtime_transcript_webhook, \
    get_audio_bytes_webhook_seconds
-from utils.pusher import connect_to_transcript_pusher, connect_to_audio_bytes_pusher
+from utils.pusher import connect_to_transcript_pusher, connect_to_audio_bytes_pusher, connect_to_trigger_pusher


⚠️ Potential issue

Remove unused imports connect_to_transcript_pusher and connect_to_audio_bytes_pusher

These imported functions are not used in the code and can be removed to clean up the import statements.

Apply this diff:

-from utils.pusher import connect_to_transcript_pusher, connect_to_audio_bytes_pusher, connect_to_trigger_pusher +from utils.pusher import connect_to_trigger_pusher

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

from utils.pusher import connect_to_transcript_pusher, connect_to_audio_bytes_pusher, connect_to_trigger_pusher

from utils.pusher import connect_to_trigger_pusher

🧰 Tools

🪛 Ruff

24-24: utils.pusher.connect_to_transcript_pusher imported but unused

Remove unused import

(F401)

24-24: utils.pusher.connect_to_audio_bytes_pusher imported but unused

Remove unused import

(F401)

beastoin changed the title ~~Debounce transcript processing 300s for controlling event loop better~~ Debounce transcript processing 300ms for controlling event loop better Oct 29, 2024

beastoin added 4 commits October 30, 2024 10:05

Debounce transcript processing 300s for controlling event loop better

8b94451

Limit 1 connection to pusher, use /trigger/listen

cf81732

Add sample rate to trigger/listen connections

ca8e62b

Put timeout to web hook's POST request. Always set the limit to prote…

5709bbb

…ct your system.

beastoin force-pushed the wuaif_external_triggers branch from 04ee1ed to 5709bbb Compare October 30, 2024 03:13

Tracking uid for /listen API

c9e2cf5

beastoin marked this pull request as ready for review October 30, 2024 04:55

coderabbitai bot reviewed Oct 30, 2024

View reviewed changes

beastoin merged commit f070bdb into main Oct 30, 2024
1 check passed

beastoin deleted the wuaif_external_triggers branch October 30, 2024 07:15

This was referenced Nov 7, 2024

Use async with pusher connects #1263

Merged

Fix pusher ws close #1264

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Debounce transcript processing 300ms for controlling event loop better #1211

Debounce transcript processing 300ms for controlling event loop better #1211

beastoin commented Oct 29, 2024 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Oct 29, 2024 •

edited

Loading

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (`.coderabbit.yaml`)

Documentation and Community

coderabbitai bot left a comment

coderabbitai bot Oct 30, 2024

coderabbitai bot Oct 30, 2024

coderabbitai bot Oct 30, 2024

coderabbitai bot Oct 30, 2024

coderabbitai bot Oct 30, 2024

coderabbitai bot Oct 30, 2024

coderabbitai bot Oct 30, 2024

coderabbitai bot Oct 30, 2024

	print(f"_trigger_create_memory_with_delay not memory or not last session", uid)
	print("_trigger_create_memory_with_delay not memory or not last session", uid)

	from utils.pusher import connect_to_transcript_pusher, connect_to_audio_bytes_pusher, connect_to_trigger_pusher
	from utils.pusher import connect_to_trigger_pusher

Debounce transcript processing 300ms for controlling event loop better #1211

Debounce transcript processing 300ms for controlling event loop better #1211

Conversation

beastoin commented Oct 29, 2024 • edited by coderabbitai bot Loading

Summary by CodeRabbit

coderabbitai bot commented Oct 29, 2024 • edited Loading

Walkthrough

Changes

Possibly related PRs

Poem

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot Oct 30, 2024

Choose a reason for hiding this comment

coderabbitai bot Oct 30, 2024

Choose a reason for hiding this comment

coderabbitai bot Oct 30, 2024

Choose a reason for hiding this comment

coderabbitai bot Oct 30, 2024

Choose a reason for hiding this comment

coderabbitai bot Oct 30, 2024

Choose a reason for hiding this comment

coderabbitai bot Oct 30, 2024

Choose a reason for hiding this comment

coderabbitai bot Oct 30, 2024

Choose a reason for hiding this comment

coderabbitai bot Oct 30, 2024

Choose a reason for hiding this comment

beastoin commented Oct 29, 2024 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Oct 29, 2024 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)