-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Youtube ingestion doesn't work #1648
Comments
I fixed CPU mode: #1643 However, yes, CPU ASR is very slow. I tried same video as you used in CPU vs. GPU, and CPU ASR goes for a while on my i9 8 core system, maybe 2 minutes. So I don't think ASR is good on CPU. However, after that, things actually hang. Logs are like this for me using CPU:
|
Seems to be stuck in DocTR for me. I should disable DocTR if have CPU mode. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hi,
Youtube ingestion doesn't work .Its converting to audio but 3).
[download] Destination: /tmp/gradio/6ce6b3f2-2e2a-4a1f-81e8-a58785d61295/What Is LangChain? - LangChain + ChatGPT Overview.m4a
[download] 100% of 5.92MiB in 00:00:00 at 39.03MiB/s
[FixupM4a] Correcting container of "/tmp/gradio/6ce6b3f2-2e2a-4a1f-81e8-a58785d61295/What Is LangChain? - LangChain + ChatGPT Overview.m4a"
[ExtractAudio] Not converting audio /tmp/gradio/6ce6b3f2-2e2a-4a1f-81e8-a58785d61295/What Is LangChain? - LangChain + ChatGPT Overview.m4a; file is already in target format m4a
Transcribing part /tmp/gradio/6ce6b3f2-2e2a-4a1f-81e8-a58785d61295/What Is LangChain? - LangChain + ChatGPT Overview.m4a!
Due to a bug fix in huggingface/transformers#28687 transcription using a multilingual Whisper will default to language detection followed by transcription instead of translation to English.This might be a breaking change for your use case. If you want to instead always translate your audio to English, make sure to pass
language='en'
.Taking long run and killing automatically after sometime.
@pseudotensor I think its issue with database . can you please confirm. I'm getting python src/make_db.pyExceptions: 0/0 []Traceback (most recent call last): File "/home/anushaharish538/as/h2ogpt/src/make_db.py", line 403, in
H2O_Fire(make_db_main)
File "/home/anushaharish538/as/h2ogpt/src/utils.py", line 73, in H2O_Fire
fire.Fire(component=component, command=args)
File "/home/anushaharish538/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/fire/core.py", line 141, in Fire
component_trace = _Fire(component, args, parsed_flag_args, context, name)
File "/home/anushaharish538/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/fire/core.py", line 475, in _Fire
component, remaining_args = _CallAndUpdateTrace(
File "/home/anushaharish538/miniconda3/envs/h2ogpt/lib/python3.10/site-packages/fire/core.py", line 691, in _CallAndUpdateTrace
component = fn(*varargs, **kwargs)
File "/home/anushaharish538/as/h2ogpt/src/make_db.py", line 389, in make_db_main
assert len(sources) > 0 or not fail_if_no_sources, "No sources found"
AssertionError: No sources found
It's issue with only YouTube . Pdfs working good
The text was updated successfully, but these errors were encountered: