Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ability to output time tags for each word when assessing audio of multiple languages + the Quran. #24

Open
osamaramihafez opened this issue Jul 1, 2023 · 0 comments

Comments

@osamaramihafez
Copy link

osamaramihafez commented Jul 1, 2023

Salam, I ignored the preset message to ask questions on discourse.mozzilla because to my understanding that's not specific to this project, but rather the repo from which this project was forked.

I'm very new to speech recognition and have not really learned any ML (something I do plan on learning in the future). So my area of understanding doesn't really align with this project.

I'm looking for a tool that is able to timestamp a video based on the recitation of verses from the Quran. However, the videos I'm dealing with could contain two languages (English + Arabic). The idea would be to run a speech recognition tool that is able to do some timestamping based on when the speaker recites a verse of the Quran. I noticed that in one of the readme files it mentions the ability to "also output time tags for each word". I'm wondering if having a ~2hr video where verses of the Quran are only a small portion of the audio would be possible to timestamp... or does the audio have to be strictly Quran?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant