Process doesn't exit / hangs at the end on Windows #64

thewh1teagle · 2024-06-20T14:17:50Z

I tried examples/nllb.rs and it works. but when the translate finish, the process hangs for few minutes, and ctrl + c doesn't exit it.
Is there some cleanup? can we speed it up?

And another question:
Is there a place I can download the needed folder for translate that will work for mac / windows / linux? I use facebook/nllb-200-distilled-600M
I tried to zip the folder created on mac and use it in windows but it didn't worked. I had to use transformers to create it on each platform.
The goal is to have something I can use right away for cross platform desktop app for offline translations.

Thanks for this amazing library!

Update

Looks like it hangs on drop(t) where t is the translator instance

The text was updated successfully, but these errors were encountered:

jkawamoto · 2024-06-21T06:50:03Z

Thank you for reporting this issue. I was able to reproduce it and will investigate what is blocking the termination.

Regarding model files, I am using Hugging Face and have uploaded some models for CTranslate2. You can create an account and upload your model files there.

This code snippet downloads the model files and returns the directory path:

let api = hf_hub::api::sync::Api::new()?;
let repo = api.model("<your account name>/<repo>");

let mut res = None;
for f in repo.info()?.siblings {
    let path = repo.get(&f.rfilename)?;
    if res.is_none() {
        res = path.parent().map(PathBuf::from);
    }
}

// path to the directory that contains the model file
res.ok_or_else(|| anyhow!("no model files are found"))

thewh1teagle · 2024-06-21T13:26:27Z

Thank you for reporting this issue. I was able to reproduce it and will investigate what is blocking the termination.

Let me know if you have any ideas about where it might be. I tried to debug it but couldn't find a good way to do so in Windows. If you find a way, I'd appreciate it if you could share your insights.

Regarding model files, I am using Hugging Face and have uploaded some models for CTranslate2. You can create an account and upload your model files there.

This code snippet downloads the model files and returns the directory path:

Thanks for the code. eventually I created custom downloader since I needed to get progress callbacks

jkawamoto · 2024-06-22T05:52:33Z

It looks like the join method here blocks forever:

https://github.com/OpenNMT/CTranslate2/blob/master/src/thread_pool.cc#L106

However, I’m not sure why this happens, since the worker threads appear to end correctly.

thewh1teagle · 2024-07-18T16:14:02Z

I've identified the root cause of the issue. As I initially suspected, cxx or the FFI wasn't freeing the model class properly at the endend (It doesn't call the destructor). Typically, I implement Drop for FFI objects using bindgen. While I'm not entirely sure how to do this with cxx, adding the following snippet at the end of the Whisper example resolved the issue:

    unsafe {
        std::ptr::drop_in_place(model.ptr.into_raw());
    }

Make sure to expose the ptr in whisper.rs like:

pub struct Whisper {
    model: OsString,
    pub ptr: UniquePtr<ffi::Whisper>,
}

jkawamoto · 2024-07-19T07:49:13Z

Thank you for investigating this issue. Your suggestion works for me and resolves the hang, although bypassing the release of UniquePtr might fail to release resources. I’m wondering if your VRAM is being freed correctly.

I opened PR #74 and will merge it if there appear to be no resource leaks.

thewh1teagle · 2024-07-19T18:40:14Z

After checking it again I think that it doesn't release the memory.
As you said the join() of the thread hangs although the worker getting into the last line (and maybe truly return the function and exit)
I suspect that once the worker almost returning, the destructors of the objects in the worker invoked and some of them hanging for some reason.

jkawamoto · 2024-07-20T16:26:05Z

Thank you for testing. So, it seems we need to solve the thread issue.

Maybe these are related, even though we don’t use dynamic libraries:

jkawamoto · 2024-07-22T06:32:53Z

With the workaround implemented in #74, VRAM/RAM are not released even though Translator/Generator/Whisper are dropped. However, the RAM is released when the main process is terminated. I think this is still better than having the process blocked forever. So, I’ll merge the PR.

thewh1teagle · 2024-10-15T13:16:51Z

Hi @jkawamoto

Any news about that issue?

jkawamoto · 2024-10-15T18:41:57Z

If I use CUDA, the process will not get blocked for me. But if I use CPU, it still gets blocked.

It’s still a mystery why joining threads gets blocked in some cases.

thewh1teagle mentioned this issue Jun 21, 2024

windows hangs thewh1teagle/lingo#1

Open

This was referenced Jul 13, 2024

Add support for OpenAi Whisper Model #19

Closed

Download ready to use model OpenNMT/CTranslate2#1729

Closed

jkawamoto mentioned this issue Jul 19, 2024

Bypass dropping UniquePtrs for Translator, Generator, and Whisper on Windows #74

Merged

jkawamoto added the help wanted Extra attention is needed label Jul 22, 2024

jkawamoto mentioned this issue Oct 12, 2024

Revert #74 #84

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Process doesn't exit / hangs at the end on Windows #64

Process doesn't exit / hangs at the end on Windows #64

thewh1teagle commented Jun 20, 2024 •

edited

Loading

jkawamoto commented Jun 21, 2024

thewh1teagle commented Jun 21, 2024

jkawamoto commented Jun 22, 2024

thewh1teagle commented Jul 18, 2024 •

edited

Loading

jkawamoto commented Jul 19, 2024

thewh1teagle commented Jul 19, 2024 •

edited

Loading

jkawamoto commented Jul 20, 2024 •

edited

Loading

jkawamoto commented Jul 22, 2024

thewh1teagle commented Oct 15, 2024

jkawamoto commented Oct 15, 2024

Process doesn't exit / hangs at the end on Windows #64

Process doesn't exit / hangs at the end on Windows #64

Comments

thewh1teagle commented Jun 20, 2024 • edited Loading

Update

jkawamoto commented Jun 21, 2024

thewh1teagle commented Jun 21, 2024

jkawamoto commented Jun 22, 2024

thewh1teagle commented Jul 18, 2024 • edited Loading

jkawamoto commented Jul 19, 2024

thewh1teagle commented Jul 19, 2024 • edited Loading

jkawamoto commented Jul 20, 2024 • edited Loading

jkawamoto commented Jul 22, 2024

thewh1teagle commented Oct 15, 2024

jkawamoto commented Oct 15, 2024

thewh1teagle commented Jun 20, 2024 •

edited

Loading

thewh1teagle commented Jul 18, 2024 •

edited

Loading

thewh1teagle commented Jul 19, 2024 •

edited

Loading

jkawamoto commented Jul 20, 2024 •

edited

Loading