with_clean_env / with_original_env is not threadsafe? #4232

jjb · 2016-01-24T22:43:22Z

I have a script that fires off many thread which each run a shell commands. Here is the code which invokes the shell commands:

Bundler.with_clean_env { `#{command}` }

On one of my runs I got this error:

RuntimeError: can't add a new key into hash during iteration
                        []= at org/jruby/RubyHash.java:1007
                    replace at org/jruby/RubyHash.java:1744
          with_original_env at /Users/john/.rbenv/versions/jruby-9.0.4.0/lib/ruby/gems/shared/gems/bundler-1.11.2/lib/bundler.rb:198
             with_clean_env at /Users/john/.rbenv/versions/jruby-9.0.4.0/lib/ruby/gems/shared/gems/bundler-1.11.2/lib/bundler.rb:205

The suspicious bundler code is here. Line 198 refers to ENV.replace(ORIGINAL_ENV):

    def with_original_env
      bundled_env = ENV.to_hash
      ENV.replace(ORIGINAL_ENV)
      yield
    ensure
      ENV.replace(bundled_env.to_hash)
    end

    def with_clean_env
      with_original_env do
        ENV["MANPATH"] = ENV["BUNDLE_ORIG_MANPATH"]
        ENV.delete_if {|k, _| k[0, 7] == "BUNDLE_" }

        if ENV.key?("RUBYOPT")
          ENV["RUBYOPT"] = ENV["RUBYOPT"].sub "-rbundler/setup", ""
        end

        if ENV.key?("RUBYLIB")
          rubylib = ENV["RUBYLIB"].split(File::PATH_SEPARATOR)
          rubylib.delete(File.expand_path("..", __FILE__))
          ENV["RUBYLIB"] = rubylib.join(File::PATH_SEPARATOR)
        end

        yield
      end
    end

Subsequent runs of my script never caused the problem again.

I tried writing a script to stress test this race condition and reproduce the problem but I wasn't able to.

I'm not entirely sure where in that code (or elsewhere) the problem is originating. Is it simply that any two threads in a runtime can't iterate over and add a new key to a hash at the same time? Or does the violating key addition have to happen within the actual iteration block? If the former then I think it's simply a matter of a low-probability race condition. If the latter, then I have no idea where this could be occurring.

The text was updated successfully, but these errors were encountered:

indirect · 2016-01-24T22:48:46Z

It's impossible for those methods to be threadsafe. They modify global variables. Adding a mutex wouldn't help because that simply ensures that the threads have to take turns modifying global state. You'll need to handle the thread safety yourself if you call those methods.

indirect · 2016-01-24T22:50:28Z

Oops, after reading your ticket again, I think a mutex would prevent this particular exception. My bad, sorry.

However, the underlying problem would still be there, even though the exception would not be. If you're using these methods in threads, you need to make sure they have exclusive execution because they modify global state.

segiddins · 2016-01-24T23:17:39Z

@indirect since the methods take blocks, we could add our own mutex inside them, a BUNDLER_ENV_MUTEX of sorts

jjb · 2016-01-25T00:08:42Z

the underlying problem would still be there, even though the exception would not be. If you're using these methods in threads, you need to make sure they have exclusive execution because they modify global state

ahh, gotcha. I just rewrote my script so that there is only one invocation of Bundler.with_clean_env in which everything is run, and that works fine.

indirect · 2016-01-25T02:39:03Z

@segiddins think we should? the mutex will be held the entire time the subcommand is running, which effectively undoes threading....

indirect · 2016-01-25T02:40:13Z

I guess we could release the mutex by forking, setting the env back, and then releasing the lock? That sounds... much more complicated than what we have now. 😝

njam · 2016-02-15T09:12:53Z

Maybe there should be a Bundler.clean_env method that just returns the clean/original environment?

Then other ruby applications could retrieve this and pass it on for when launching new processes (e.g. with Process.exec(env, command)), and we wouldn't change the state of any environment variables in the parent process.

(there is Bundler.ORIGINAL_ENV - but is it public API? And what is the difference between "clean_env" and "original_env"?)

segiddins · 2016-02-15T15:19:49Z

I think a Bundler.clean_env method would be fine?

njam · 2016-02-15T15:35:10Z

Agree!
And what is the difference between "clean_env" and "original_env"? Shouldn't there be just 1 mechanism that resets the environment to how it was before bundler modified it? If the previous environment contained bundler-related variables then I think there's no reliable way to revert those back.
(sorry for the off topic)

indirect · 2016-02-15T21:36:01Z

At one time, the difference was that one went up one layer of env, while the other removed all Bundler values. I'm not sure if that's still true today. A Bundler.clean_env method seems great to me.

Introduce "Bundler.clean_env" Via #4232 Which should return the environment as it was *before* bundler modified it. Should this use "original_env" or "clean_env" or both? > At one time, the difference was that one went up one layer of env, while the other removed all Bundler values. I'm not sure if that's still true today. You can assign this ticket to me if you want.

RochesterinNYC · 2016-03-05T16:30:15Z

Closed by #4303.

njam mentioned this issue Feb 16, 2016

Introduce "Bundler.clean_env" #4303

Merged

RochesterinNYC closed this as completed Mar 5, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

with_clean_env / with_original_env is not threadsafe? #4232

with_clean_env / with_original_env is not threadsafe? #4232

jjb commented Jan 24, 2016

indirect commented Jan 24, 2016

indirect commented Jan 24, 2016

segiddins commented Jan 24, 2016

jjb commented Jan 25, 2016

indirect commented Jan 25, 2016

indirect commented Jan 25, 2016

njam commented Feb 15, 2016

segiddins commented Feb 15, 2016

njam commented Feb 15, 2016

indirect commented Feb 15, 2016

RochesterinNYC commented Mar 5, 2016

with_clean_env / with_original_env is not threadsafe? #4232

with_clean_env / with_original_env is not threadsafe? #4232

Comments

jjb commented Jan 24, 2016

indirect commented Jan 24, 2016

indirect commented Jan 24, 2016

segiddins commented Jan 24, 2016

jjb commented Jan 25, 2016

indirect commented Jan 25, 2016

indirect commented Jan 25, 2016

njam commented Feb 15, 2016

segiddins commented Feb 15, 2016

njam commented Feb 15, 2016

indirect commented Feb 15, 2016

RochesterinNYC commented Mar 5, 2016