Dealing with blocking. #22

ghost · 2013-11-02T19:44:46Z

I was doing a few experiments with running hundreds of tasks each enqueueing and waiting on kernels before sending data back to the main task. And this ran head first into an expected problem with light weight threads. You can't block them and expect things to work correctly.

OpenCL has one way of working around this, The use of a callbacks. If you can setup a callback to do a message send when an event completes there is no need to do a event wait.

This would look something like this:

extern fn trampoline(_: cl_event, _: cl_int, arg: *libc::c_void)
{
    println(format!("trampoline"));
    let f = unsafe {
        let f: ~Cell<&fn()> = cast::transmute(arg);
        f.take()
    };

    f();
}

impl Event {
    #[fixed_stack_segment] #[inline(never)]
    fn callback(&self, cn_type :cl_uint, f: extern "C" fn(cl_event, cl_int, *libc::c_void), arg: *libc::c_void)
    {
        unsafe {
            clSetEventCallback(self.event,
                               cn_type,
                               f,
                               arg);
        }
    }

    fn rwait(&self) {
        let (p, c) : (PortOne<()>, ChanOne<()>) = oneshot();
        let c = Cell::new(c);
        let f = ~Cell::new(|| {c.take().send(())});
        unsafe {
            self.callback(CL_COMPLETE, trampoline, cast::transmute(f));
        }
        p.recv();
    }
}

I would not be writing this as an issue if that worked. The rust scheduler relies on some thread local storage to be set for it to work. So as soon as the trampoline fires an abort() occurs.

OpenCL does not offer a callback to initialize it's threads, so I can't see an easy way to set this thread local storage.

There are probably a few alternatives that could work instead of using channels. Using a pipe could work, the callback writes a byte to it, and the waiting function uses the native rust runtime read on the pipe. That sounds like an extremely round about and slow mechanise.

The text was updated successfully, but these errors were encountered:

eholk · 2013-11-04T15:59:51Z

If you can make this work with pipes, that's the way to go. The last time I measured, the overhead of sending pipe messages was very small, and this is the desired mechanism in Rust for synchronizing tasks.

Along these lines, it seems like we should be careful to make sure the Rust OpenCL context only runs from one task or thread. We make this happen now by using RUST_THREADS=1, but this is less than ideal. Unfortunately, there isn't a good way yet of making sure there is only one provider of a service in Rust.

ghost · 2013-11-04T23:05:52Z

I think the RUST_THREADS=1 is mostly caused by the clGetPlatforms api failing. From my understanding of Appendix A.2, it is safe to share the context pointer. Buffers and kernels are obviously not thread safe. But I'm surprised that the command queue is also considered unsafe.

I think the best route might be to just use ownership via ~. With how the API is designed right now it would just be a matter of not implementing the Clone trait to prevent more then one task from accessing it.

eholk · 2013-11-05T17:34:51Z

One difficulty is still making sure only one task tries to create an OpenCL
object. I guess we could use an atomic option to say whether a context
already exists and just fail otherwise. The clients could take care of
sharing it by message passing if they need to.

On Mon, Nov 4, 2013 at 6:05 PM, Colin Sherratt notifications@github.comwrote:

I think the RUST_THREADS=1 is mostly caused by the clGetPlatforms api
failing. From my understanding of Appendix A.2, it is safe to share the
context pointer. Buffers and kernels are obviously not thread safe. But I'm
surprised that the command queue is also considered unsafe.

I think the best route might be to just use ownership via ~. With how the
API is designed right now it would just be a matter of not implementing
the Clone trait to prevent more then one task from accessing it.

—
Reply to this email directly or view it on GitHubhttps://github.com//issues/22#issuecomment-27732317
.

ghost · 2013-11-10T18:36:57Z

I played with adding a spinlock to work around this issue earlier this week. https://github.com/csherratt/rust-opencl/commit/a6258024bf76ca6001db73d52d971515e736bb1c

I'd rather not push these changes up because the fix is pretty ugly. If rust-lang/rust#9105 is resolved I think this could work.

ghost · 2014-01-14T05:37:39Z

With the new libgreen/libnative split and the fact that #9105 is complete it should be possible to resolve this issue with a little work. I might investigate this later this week. It'd be nice to remove the RUST_THREADS=1 for the make check.

eholk · 2014-12-26T19:07:03Z

Is this issue still active? I notice we have most things protected by mutexes, and cargo test works just fine for me without RUST_THREADS=1.

ghost · 2014-12-27T05:12:13Z

This is actually no longer relevant. Rust no longer supports light weight threads so what opencl does by default is the correct behavior.

ghost closed this as completed Dec 27, 2014

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dealing with blocking. #22

Dealing with blocking. #22

ghost commented Nov 2, 2013

eholk commented Nov 4, 2013

ghost commented Nov 4, 2013

eholk commented Nov 5, 2013

ghost commented Nov 10, 2013

ghost commented Jan 14, 2014

eholk commented Dec 26, 2014

ghost commented Dec 27, 2014

Dealing with blocking. #22

Dealing with blocking. #22

Comments

ghost commented Nov 2, 2013

eholk commented Nov 4, 2013

ghost commented Nov 4, 2013

eholk commented Nov 5, 2013

ghost commented Nov 10, 2013

ghost commented Jan 14, 2014

eholk commented Dec 26, 2014

ghost commented Dec 27, 2014