refactor(rust): Refactor compute kernels in polars-arrow to avoid using gather #19669

nameexhaustion · 2024-11-06T22:23:40Z

Before we can move the gather logic to polars-compute we need to remove all uses of it in polars-arrow, as it will no longer be accessible in polars-arrow after the move.

nameexhaustion · 2024-11-06T22:29:55Z

crates/polars-arrow/src/compute/cast/mod.rs

        }
-        let take_values = unsafe {
-            crate::compute::take::take_unchecked(list.values().as_ref(), &indices.freeze())


Casting nullable List -> FixedSizeList, used a gather to ensure the width of the null slots - have updated this to use Growable instead.

nameexhaustion · 2024-11-06T22:31:41Z

crates/polars-arrow/src/legacy/kernels/fixed_size_list.rs

    }

    let values = arr.values();
-    // SAFETY:
-    // the indices we generate are in bounds
-    unsafe { Ok(take_unchecked(&**values, &take_by)) }


list.get() / array.get() were building selection indices and then calling gather with them - I've re-written them to use loops instead.

nameexhaustion · 2024-11-06T23:46:45Z

py-polars/tests/unit/operations/namespaces/array/test_array.py

        out = s.arr.get(100, null_on_oob=False)

+    with pytest.raises(ComputeError, match="get index -3 is out of bounds"):


drive-by - print the oob index in error message

codecov · 2024-11-07T00:39:08Z

Codecov Report

Attention: Patch coverage is 91.92547% with 13 lines in your changes missing coverage. Please review.

Project coverage is 79.73%. Comparing base (8335f75) to head (5dff00a).
Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
...polars-arrow/src/legacy/kernels/fixed_size_list.rs	87.37%	13 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##             main   #19669   +/-   ##
=======================================
  Coverage   79.72%   79.73%           
=======================================
  Files        1542     1542           
  Lines      212208   212232   +24     
  Branches     2449     2449           
=======================================
+ Hits       169182   169220   +38     
+ Misses      42472    42458   -14     
  Partials      554      554

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

ritchie46 · 2024-11-07T12:09:07Z

There might be some performance implications in those rewrites. It's hard to tell. I think we actually should move the cast to polars-compute as well. Then the dependency problem is resolved.

…ng gather

nameexhaustion · 2024-11-07T23:58:13Z

There might be some performance implications in those rewrites. It's hard to tell. I think we actually should move the cast to polars-compute as well. Then the dependency problem is resolved.

I tried to move the casting code, but I don't think it's possible as the cast is currently used by ArrowArray::new() in the polars-arrow crate -

polars/crates/polars-arrow/src/ffi/array.rs

Lines 111 to 114 in 3cdb7c2

    
           let variadic_buffer_sizes = if needs_variadic_buffer_sizes { 
        
               #[cfg(feature = "compute_cast")] 
        
               { 
        
                   let arr = crate::compute::cast::cast_unchecked(

From benchmarking, the PR as it is improves list->array casting performance, while regressing on list.get() / array.get() performance -

# DF
shape: (20_000_000, 2)
┌──────────┬─────────────────────────────────┐
│ i64      ┆ list                            │
│ ---      ┆ ---                             │
│ i64      ┆ list[i64]                       │
╞══════════╪═════════════════════════════════╡
│ 6765403  ┆ [6765403, 6765403, … 6765403]   │
│ 16059030 ┆ [16059030, 16059030, … 1605903… │
...
# This PR
cast list->array 0.7143477080389857
list.get(i64) 0.2533301250077784
arr.get(col(indices)) 0.860774208791554
# 1.12.0
cast list->array 1.0722814579494298
list.get(i64) 0.2003235830925405
arr.get(col(indices)) 0.5676086670719087

I think for list.get() / array.get(), switching to the growable introduced dynamic dispatch for every row, is the cause of the performance regression. On the other hand cast() performance improved as it was already dynamic dispatch, but we removed the extra step of materializing selection indices.

I think, maybe I can leave the cast in polars-arrow, but spend some time resolving the list.get() performance, before continuing with moving the gather() to polars-compute?

github-actions bot added internal An internal refactor or improvement rust Related to Rust Polars labels Nov 6, 2024

nameexhaustion commented Nov 6, 2024

View reviewed changes

nameexhaustion marked this pull request as ready for review November 7, 2024 02:39

nameexhaustion requested review from ritchie46, c-peters, alexander-beedie, MarcoGorelli, reswqa and orlp as code owners November 7, 2024 02:39

nameexhaustion added 9 commits November 8, 2024 08:59

refactor(rust): Refactor compute kernels in polars-arrow to avoid usi…

b8d4b60

…ng gather

c

3223228

c

20d0b1d

c

e869def

c

8971c10

c

1630aa2

c

4ca943b

c

2815c94

c

5dff00a

nameexhaustion force-pushed the arrow-compute-no-take branch from 7e5705e to 5dff00a Compare November 7, 2024 22:18

nameexhaustion marked this pull request as draft November 7, 2024 22:49

c

5cb0ae4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor(rust): Refactor compute kernels in polars-arrow to avoid using gather #19669

refactor(rust): Refactor compute kernels in polars-arrow to avoid using gather #19669

nameexhaustion commented Nov 6, 2024 •

edited

Loading

nameexhaustion Nov 6, 2024 •

edited

Loading

nameexhaustion Nov 6, 2024

nameexhaustion Nov 6, 2024

codecov bot commented Nov 7, 2024 •

edited

Loading

ritchie46 commented Nov 7, 2024

nameexhaustion commented Nov 7, 2024

		out = s.arr.get(100, null_on_oob=False)

		with pytest.raises(ComputeError, match="get index -3 is out of bounds"):

refactor(rust): Refactor compute kernels in polars-arrow to avoid using gather #19669

Are you sure you want to change the base?

refactor(rust): Refactor compute kernels in polars-arrow to avoid using gather #19669

Conversation

nameexhaustion commented Nov 6, 2024 • edited Loading

nameexhaustion Nov 6, 2024 • edited Loading

Choose a reason for hiding this comment

nameexhaustion Nov 6, 2024

Choose a reason for hiding this comment

nameexhaustion Nov 6, 2024

Choose a reason for hiding this comment

codecov bot commented Nov 7, 2024 • edited Loading

Codecov Report

ritchie46 commented Nov 7, 2024

nameexhaustion commented Nov 7, 2024

nameexhaustion commented Nov 6, 2024 •

edited

Loading

nameexhaustion Nov 6, 2024 •

edited

Loading

codecov bot commented Nov 7, 2024 •

edited

Loading