[Grace Kelly] Exceptions in `call_runner` cause thread pool to hang #1214

drewbanin · 2019-01-05T19:39:03Z

Issue

Issue description

With dbt's new approach to multithreading, exceptions raised in runner.call_runner cause the thread pool to hang. I'm having a hard time finding the expected behavior of a thread pool when a worker raises an exception, but it appears that 1) the exception message is swallowed and 2) the pool ends up blocking on the next get() call.

I can think of two ways to work around this:

methods called by call_runner should never raise
a wrapper function around call_runner should be provided which logs exceptions but does not raise them

This is the general idea behind the bug, but the specific situation where is related to the raise_on_first_error attribute of certain node runners. The CompileRunner has this value set to True, so exceptions during compilation cause an exception to be raise in the call_runner method. No other runners have raise_on_first_error enabled, so I'm only able to reproduce this bug with dbt compile (and dbt docs generate, which invokes dbt compile).

In addition to ensuring that call_runner never raises (or handling the exceptions in the pool, if that's possible), we should do something about raise_on_first_error. I'm unsure how useful this attribute is in its current incarnation, and I think I'd be ok with just removing it. The other option is to keep that attribute, and then flip some flag that ultimately terminates the pool from the main thread.

Results

The thread pool blocked on a get() call after an exception was raised in dbt compile.

System information

dbt version: 0.12.2-rc1
python version: 3.7.1
python path: /Users/drew/fishtown/dbt/env/bin/python3
os info: Darwin-18.2.0-x86_64-i386-64bit

Steps to reproduce

This bug manifests when a CompilationError exists at "compile time". Crucially though, most compile-time errors are also parse-time errors. Here's an example of a model that will succeed at parse-time (allowing dbt to proceed to compilation), but that will fail at compile time;

-- models/my_model.sql
{% if execute %}
    select * from {{ ref('notfound') }}
{% else %}
    select 1 as id
{% endif %}

Then run:

$ dbt compile

The text was updated successfully, but these errors were encountered:

…-errors move cleanup into the executing thread (#1214)

drewbanin added the bug Something isn't working label Jan 5, 2019

drewbanin added this to the Grace Kelly milestone Jan 5, 2019

drewbanin mentioned this issue Jan 5, 2019

(fixes 1214) Do not raise an exception in call_runner #1215

Closed

beckjake mentioned this issue Jan 7, 2019

move cleanup into the executing thread (#1214) #1218

Merged

beckjake added a commit that referenced this issue Jan 7, 2019

Merge pull request #1218 from fishtown-analytics/fix/lets-not-hang-on…

7179d13

…-errors move cleanup into the executing thread (#1214)

drewbanin closed this as completed Jan 7, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Grace Kelly] Exceptions in `call_runner` cause thread pool to hang #1214

[Grace Kelly] Exceptions in `call_runner` cause thread pool to hang #1214

drewbanin commented Jan 5, 2019

[Grace Kelly] Exceptions in call_runner cause thread pool to hang #1214

[Grace Kelly] Exceptions in call_runner cause thread pool to hang #1214

Comments

drewbanin commented Jan 5, 2019

Issue

Issue description

Results

System information

Steps to reproduce

[Grace Kelly] Exceptions in `call_runner` cause thread pool to hang #1214

[Grace Kelly] Exceptions in `call_runner` cause thread pool to hang #1214