Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

View transcript agent #70

Open
1 task done
ashish-spext opened this issue Nov 14, 2024 · 3 comments
Open
1 task done

View transcript agent #70

ashish-spext opened this issue Nov 14, 2024 · 3 comments
Assignees
Labels
agent backend enhancement New feature or request

Comments

@ashish-spext
Copy link
Contributor

Confirm this is a new agent request

  • I've checked the current issues, and there's no record of this agent request

Describe the agent

This agent can simply get the transcript of the video and add it into the context for other agents.

Additional Context

No response

@ashish-spext ashish-spext added enhancement New feature or request backend agent labels Nov 14, 2024
@sarfarazsiddiquii
Copy link

Hi @ashish-spext, Is this issue open for contributions?
If so, can you assign it to me? I’d like to work on it.

@ashish-spext
Copy link
Contributor Author

Sure @sarfarazsiddiquii

For v1 let's implement basic agent with

  1. Spoken Index Check:

    • The agent indexes spoken words if the index does not exists.
  2. Default Mode - Text Transcription:

    • It should send the transcription text as TextContent.
  3. Timestamp Mode (Optional):

    • If the user requests transcription with timestamps, it should group the transcript into specified time range (default 2 minutes).
    • It should format the above clubbed text with timestamps and send it as TextContent.
  4. Transcript Context for LLMs:

    • The agent should return transcript dictionary in its response.

@sarfarazsiddiquii
Copy link

Hey @ashish-spext, I made a pull request regarding this issue. Let me know if any changes are required.
Also, I would love to contribute to other open issues as well.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
agent backend enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants