[TASK][JNI] Investigate train of `null_count` after `explode` #11923

abellina · 2022-10-14T16:22:33Z

While analyzing an nsys trace for a Spark job with deeply nested tables, we see an explode kernel call that is followed by a train of null_count, which end in is_valid.

After we call cudf::explode we build up a table, and construct java ColumnVector objects. I think the construction of these objects is triggering it.

This task is to confirm that the columns with missing a null count are coming from the explode kernels. If they are coming from explode, it would be great if explode could compute null count as part of that kernel.

In this screenshot, it is the ~20ms at the end after explode:

The text was updated successfully, but these errors were encountered:

GregoryKimball · 2022-10-21T20:17:55Z

I'd like to cross-reference this issue with #11968. It's likely that the null_count appearances in profiles will change as we refactor null_count for compatibility with user-provided streams.

jrhemstad · 2022-10-25T14:52:45Z

it would be great if explode could compute null count as part of that kernel.

I'm not following. explode returns a table:

cudf/cpp/include/cudf/lists/explode.hpp

Line 72 in 7d173c9

std::unique_ptr<table> explode(

Are you then constructing a column_view for each of the lists that were exploded?

If so, then yeah, you're going to have a problem with computing the null count of each of those column_views individually.

To make that efficient, you'd have to do what we do in cudf::split where we compute the individual null counts in bulk with a single segmented_null_count:

cudf/cpp/include/cudf/detail/null_mask.hpp

Lines 186 to 204 in 9c06330

    
           /** 
        
            * @brief Given a validity bitmask, counts the number of null elements (unset 
        
            * bits) in every range `[indices[2*i], indices[(2*i)+1])` (where 0 <= i < 
        
            * indices.size() / 2). 
        
            * 
        
            * If `bitmask == nullptr`, all elements are assumed to be valid and a vector of 
        
            * length `indices.size()` containing all zeros is returned. 
        
            * 
        
            * @throws cudf::logic_error if `indices.size() % 2 != 0` 
        
            * @throws cudf::logic_error if `indices[2*i] < 0 or indices[2*i] > indices[(2*i)+1]` 
        
            * 
        
            * @param[in] bitmask Validity bitmask residing in device memory. 
        
            * @param[in] indices A host_span of indices specifying ranges to count the number of null elements. 
        
            * @param[in] stream CUDA stream used for device memory operations and kernel launches. 
        
            * @return A vector storing the number of null elements in each specified range. 
        
            */ 
        
           std::vector<size_type> segmented_null_count(bitmask_type const* bitmask, 
        
                                                       host_span<size_type const> indices, 
        
                                                       rmm::cuda_stream_view stream);

abellina added feature request New feature or request Needs Triage Need team to review and classify Spark Functionality that helps Spark RAPIDS labels Oct 14, 2022

GregoryKimball added this to the Enable streams milestone Oct 21, 2022

GregoryKimball added libcudf Affects libcudf (C++/CUDA) code. Performance Performance related issue and removed feature request New feature or request Needs Triage Need team to review and classify labels Oct 21, 2022

GregoryKimball removed this from the Enable streams milestone Oct 21, 2022

ttnghia mentioned this issue Aug 2, 2024

[FEA] Support batch construction of strings columns #16486

Closed

GregoryKimball added this to libcudf Oct 23, 2024

GregoryKimball moved this to Needs owner in libcudf Oct 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TASK][JNI] Investigate train of `null_count` after `explode` #11923

[TASK][JNI] Investigate train of `null_count` after `explode` #11923

abellina commented Oct 14, 2022

GregoryKimball commented Oct 21, 2022

jrhemstad commented Oct 25, 2022

[TASK][JNI] Investigate train of null_count after explode #11923

[TASK][JNI] Investigate train of null_count after explode #11923

Comments

abellina commented Oct 14, 2022

GregoryKimball commented Oct 21, 2022

jrhemstad commented Oct 25, 2022

[TASK][JNI] Investigate train of `null_count` after `explode` #11923

[TASK][JNI] Investigate train of `null_count` after `explode` #11923