Fix: ViewType gc on huge batch would produce bad output #8694

mapleFU · 2025-10-23T15:13:05Z

Which issue does this PR close?

Closes Array: ViewType gc() has bug when array sum length exceed i32::MAX #8681.

Rationale for this change

Previously, gc() will produce a single buffer. However, for buffer size greater than 2GiB, it would be buggy, since buffer-offset it's a 4-byte signed integer.

What changes are included in this PR?

Add a GcCopyGroup type, and do gc for it.

Are these changes tested?

Yes

Are there any user-facing changes?

gc would produce more buffers

mapleFU · 2025-10-26T13:21:48Z

arrow-array/src/array/byte_view_array.rs

-            .map(|i| unsafe { self.copy_view_to_buffer(i, &mut data_buf) })
-            .collect();
+            for view in self.views() {
+                let len = *view as u32;


This part is so slow, but it's right, I can make it faster(by handling the numbers via grouping or batching) if required

I am not sure how you would make this much faster - I think the code needs to find the locations to split in any event

Even if the buffer size is greater than i32::MAX, it's possible that a single buffer is much smaller than i32::MAX, so this can find batch-by-batch, rather than just adding small buffer one-by-one?

I see -- you are saying you could potentially optimize the input by looking at each input buffer or something and gcing it individually or something.

That would be interesting, though it would probably take a lot of care to make it fast.

arrow-array/src/array/byte_view_array.rs

mapleFU · 2025-10-26T13:23:44Z

@alamb Besides, I meet this bug when I have 4GiB StringViewArray, arrow-rs regard offset as u32, however, in arrow standard, this uses i32. So I limit it to 2GiB

There're other places uses u32::MAX in view handling, should I also fix them in other patch?

mapleFU · 2025-10-26T13:27:06Z

arrow-array/src/array/byte_view_array.rs

+            };
+            vec![gc_copy_group]
+        };
+        assert!(gc_copy_groups.len() <= i32::MAX as usize);


This assertion can be removed, I just ensure it would pass

May be change to assert debug here.

alamb

Thank you @mapleFU -- this is a good find. I left some comments, let me know what you think

cc @zhuqi-lucas perhaps you have some thoughts

arrow-array/src/array/byte_view_array.rs

alamb · 2025-10-28T19:25:12Z

arrow-array/src/array/byte_view_array.rs

-            .map(|i| unsafe { self.copy_view_to_buffer(i, &mut data_buf) })
-            .collect();
+            for view in self.views() {
+                let len = *view as u32;


I am not sure how you would make this much faster - I think the code needs to find the locations to split in any event

arrow-array/src/array/byte_view_array.rs

alamb · 2025-10-28T19:29:50Z

🤖 ./gh_compare_arrow.sh Benchmark Script Running
Linux aal-dev 6.14.0-1017-gcp #18~24.04.1-Ubuntu SMP Tue Sep 23 17:51:44 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux
Comparing gc-fixing-huge-batch-bug (142284c) to a7572eb diff
BENCH_NAME=view_types
BENCH_COMMAND=cargo bench --features=arrow,async,test_common,experimental --bench view_types
BENCH_FILTER=
BENCH_BRANCH_NAME=gc-fixing-huge-batch-bug
Results will be posted here when complete

alamb · 2025-10-28T19:30:58Z

The MIRI test is probably failing due to the massive memory use in https://github.com/apache/arrow-rs/actions/runs/18818674867/job/53690752815?pr=8694

I suggest we don't run that test under miri by disabling it, with something like

    #[cfg_attr(miri, ignore)] // Takes too long

For example

https://github.com/apache/arrow-rs/blob/ba22a214b3c469da5466f60a74ab201a268bc0fc/arrow-array/src/array/boolean_array.rs#L789-L788

alamb · 2025-10-28T19:33:25Z

🤖: Benchmark completed

Details

group                                             gc-fixing-huge-batch-bug               main
-----                                             ------------------------               ----
gc view types all without nulls[100000]           1.05  1600.4±85.77µs        ? ?/sec    1.00  1527.4±58.01µs        ? ?/sec
gc view types all without nulls[8000]             1.07     68.0±3.35µs        ? ?/sec    1.00     63.5±1.04µs        ? ?/sec
gc view types all[100000]                         1.16    296.3±9.94µs        ? ?/sec    1.00    255.3±1.11µs        ? ?/sec
gc view types all[8000]                           1.14     21.5±0.32µs        ? ?/sec    1.00     18.9±0.11µs        ? ?/sec
gc view types slice half without nulls[100000]    1.06   552.7±15.42µs        ? ?/sec    1.00    520.1±7.45µs        ? ?/sec
gc view types slice half without nulls[8000]      1.06     28.9±0.20µs        ? ?/sec    1.00     27.2±0.15µs        ? ?/sec
gc view types slice half[100000]                  1.15    145.5±3.36µs        ? ?/sec    1.00    126.3±0.63µs        ? ?/sec
gc view types slice half[8000]                    1.14     11.0±0.12µs        ? ?/sec    1.00      9.6±0.05µs        ? ?/sec
view types slice                                  1.00    709.6±1.75ns        ? ?/sec    1.00    706.8±1.32ns        ? ?/sec

zhuqi-lucas

Nice finding, thank you!

zhuqi-lucas · 2025-10-29T06:09:06Z

arrow-array/src/array/byte_view_array.rs

+            };
+            vec![gc_copy_group]
+        };
+        assert!(gc_copy_groups.len() <= i32::MAX as usize);


May be change to assert debug here.

arrow-array/src/array/byte_view_array.rs

mapleFU · 2025-10-30T02:05:22Z

arrow-array/src/array/byte_view_array.rs

+            }
+            &groups
+        } else {
+            one_group.as_slice()


I've add a one_group for gc-copy-group in stack, hopefully this avoids allocation

let one_group = [GcCopyGroup { total_buffer_bytes: total_large, total_len: len, }];

mapleFU · 2025-10-31T17:54:31Z

@alamb I've ran the new code on MacOS M4 Pro. Don't know whether the performance comes from unstable environment or other

This patch:

view types slice        time:   [112.32 ns 112.44 ns 112.56 ns]
Found 3 outliers among 100 measurements (3.00%)
  2 (2.00%) low severe
  1 (1.00%) low mild

gc view types all[100000]
                        time:   [246.89 µs 249.22 µs 251.63 µs]
Found 8 outliers among 100 measurements (8.00%)
  4 (4.00%) low mild
  4 (4.00%) high mild

gc view types slice half[100000]
                        time:   [79.469 µs 81.771 µs 84.060 µs]
Found 1 outliers among 100 measurements (1.00%)
  1 (1.00%) high mild

gc view types all without nulls[100000]
                        time:   [406.50 µs 411.58 µs 417.23 µs]
Found 14 outliers among 100 measurements (14.00%)
  9 (9.00%) high mild
  5 (5.00%) high severe

gc view types slice half without nulls[100000]
                        time:   [373.40 µs 379.37 µs 385.93 µs]
Found 2 outliers among 100 measurements (2.00%)
  1 (1.00%) high mild
  1 (1.00%) high severe

gc view types all[8000] time:   [11.603 µs 11.663 µs 11.729 µs]
Found 3 outliers among 100 measurements (3.00%)
  3 (3.00%) high mild

gc view types slice half[8000]
                        time:   [9.4269 µs 10.204 µs 11.012 µs]
Found 15 outliers among 100 measurements (15.00%)
  13 (13.00%) low severe
  2 (2.00%) low mild

gc view types all without nulls[8000]
                        time:   [27.001 µs 27.155 µs 27.368 µs]
Found 9 outliers among 100 measurements (9.00%)
  5 (5.00%) high mild
  4 (4.00%) high severe

gc view types slice half without nulls[8000]
                        time:   [13.570 µs 13.594 µs 13.621 µs]
Found 4 outliers among 100 measurements (4.00%)
  1 (1.00%) low severe
  2 (2.00%) high mild
  1 (1.00%) high severe

main:

view types slice        time:   [118.73 ns 121.33 ns 124.80 ns]
                        change: [+16.208% +20.497% +25.223%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 2 outliers among 100 measurements (2.00%)
  2 (2.00%) high mild

gc view types all[100000]
                        time:   [175.56 µs 177.37 µs 179.60 µs]
                        change: [-26.662% -25.460% -24.243%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 2 outliers among 100 measurements (2.00%)
  1 (1.00%) high mild
  1 (1.00%) high severe

gc view types slice half[100000]
                        time:   [76.242 µs 76.520 µs 76.843 µs]
                        change: [-2.6612% -0.4692% +1.8536%] (p = 0.69 > 0.05)
                        No change in performance detected.
Found 10 outliers among 100 measurements (10.00%)
  3 (3.00%) high mild
  7 (7.00%) high severe

gc view types all without nulls[100000]
                        time:   [385.38 µs 393.14 µs 401.00 µs]
                        change: [-3.7465% -2.3347% -0.8310%] (p = 0.00 < 0.05)
                        Change within noise threshold.
Found 12 outliers among 100 measurements (12.00%)
  5 (5.00%) low mild
  4 (4.00%) high mild
  3 (3.00%) high severe

gc view types slice half without nulls[100000]
                        time:   [422.70 µs 426.62 µs 430.63 µs]
                        change: [+2.9492% +5.2722% +7.6808%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 6 outliers among 100 measurements (6.00%)
  1 (1.00%) low mild
  4 (4.00%) high mild
  1 (1.00%) high severe

gc view types all[8000] time:   [16.079 µs 17.457 µs 18.815 µs]
                        change: [+47.350% +57.679% +66.216%] (p = 0.00 < 0.05)
                        Performance has regressed.

gc view types slice half[8000]
                        time:   [14.127 µs 15.144 µs 16.037 µs]
                        change: [+26.202% +34.352% +42.179%] (p = 0.00 < 0.05)
                        Performance has regressed.
Found 9 outliers among 100 measurements (9.00%)
  9 (9.00%) low mild

gc view types all without nulls[8000]
                        time:   [27.309 µs 28.280 µs 29.285 µs]
                        change: [-2.8581% -1.0174% +0.8766%] (p = 0.29 > 0.05)
                        No change in performance detected.
Found 15 outliers among 100 measurements (15.00%)
  1 (1.00%) low mild
  3 (3.00%) high mild
  11 (11.00%) high severe

gc view types slice half without nulls[8000]
                        time:   [15.009 µs 15.857 µs 16.825 µs]

alamb

Thank you @mapleFU - I went through this again and it looks good to me.

I would like to run the benchmarks one more time before merging this (I have queued them up on my machine so hopefully they'll post to this PR in a few hours)

arrow-array/src/array/byte_view_array.rs

alamb · 2025-10-31T19:59:00Z

arrow-array/src/array/byte_view_array.rs

-            .map(|i| unsafe { self.copy_view_to_buffer(i, &mut data_buf) })
-            .collect();
+            for view in self.views() {
+                let len = *view as u32;


I see -- you are saying you could potentially optimize the input by looking at each input buffer or something and gcing it individually or something.

That would be interesting, though it would probably take a lot of care to make it fast.

alamb · 2025-11-01T00:34:17Z

🤖 ./gh_compare_arrow.sh Benchmark Script Running
Linux aal-dev 6.14.0-1017-gcp #18~24.04.1-Ubuntu SMP Tue Sep 23 17:51:44 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux
Comparing gc-fixing-huge-batch-bug (fdefa5f) to 2eabb59 diff
BENCH_NAME=view_types
BENCH_COMMAND=cargo bench --features=arrow,async,test_common,experimental --bench view_types
BENCH_FILTER=
BENCH_BRANCH_NAME=gc-fixing-huge-batch-bug
Results will be posted here when complete

alamb · 2025-11-01T00:37:55Z

🤖: Benchmark completed

Details

group                                             gc-fixing-huge-batch-bug               main
-----                                             ------------------------               ----
gc view types all without nulls[100000]           1.15  1826.5±78.25µs        ? ?/sec    1.00  1592.5±58.43µs        ? ?/sec
gc view types all without nulls[8000]             1.08     70.4±3.71µs        ? ?/sec    1.00     65.2±2.12µs        ? ?/sec
gc view types all[100000]                         1.12    311.7±8.19µs        ? ?/sec    1.00    277.7±3.63µs        ? ?/sec
gc view types all[8000]                           1.08     23.1±0.20µs        ? ?/sec    1.00     21.4±0.11µs        ? ?/sec
gc view types slice half without nulls[100000]    1.10   589.6±17.27µs        ? ?/sec    1.00   535.4±22.47µs        ? ?/sec
gc view types slice half without nulls[8000]      1.01     28.4±0.23µs        ? ?/sec    1.00     28.2±0.26µs        ? ?/sec
gc view types slice half[100000]                  1.11    155.6±2.98µs        ? ?/sec    1.00    140.0±3.30µs        ? ?/sec
gc view types slice half[8000]                    1.11     11.8±0.13µs        ? ?/sec    1.00     10.6±0.08µs        ? ?/sec
view types slice                                  1.00    707.2±1.74ns        ? ?/sec    1.00    705.9±3.69ns        ? ?/sec

Dandandan · 2025-11-01T05:09:44Z

arrow-array/src/array/byte_view_array.rs

+                if len > MAX_INLINE_VIEW_LEN {
+                    if current_length + len > i32::MAX as u32 {
+                        // Start a new group
+                        groups.push(GcCopyGroup {


I think you can preallocate groups (with_capacity) or use .collect

Dandandan · 2025-11-01T05:11:09Z

arrow-array/src/array/byte_view_array.rs

+            for view_idx in current_view_idx..current_view_idx + gc_copy_group.total_len {
+                let view =
+                    unsafe { self.copy_view_to_buffer(view_idx, group_idx as i32, &mut data_buf) };
+                views_buf.push(view);


This can use Vec::collect

views_buf is preallocated once, would Vec::collect allocate more buffer than currently?

I think using collect doesn't allocate additional memory, but it can be faster than push because collect does the capacity check once where push has to check capacity on each call

However, since both Vecs are being modified at the same time, I couldn't figure out a way to use collect here -- I could use extend

I also think collect is possible here, would do it after work tonight

Dandandan · 2025-11-01T05:12:31Z

arrow-array/src/array/byte_view_array.rs

+                views_buf.push(view);
+            }
+
+            data_blocks.push(Buffer::from_vec(data_buf));


This coul as well use collect.

alamb

I went over this PR again and I think the benchmarks show it slowing down GC, which is not good. I have some idea on how to get performance back (along the lines of what @Dandandan suggested)

alamb · 2025-11-06T11:52:37Z

arrow-array/src/array/byte_view_array.rs

+            for view_idx in current_view_idx..current_view_idx + gc_copy_group.total_len {
+                let view =
+                    unsafe { self.copy_view_to_buffer(view_idx, group_idx as i32, &mut data_buf) };
+                views_buf.push(view);


I think using collect doesn't allocate additional memory, but it can be faster than push because collect does the capacity check once where push has to check capacity on each call

alamb · 2025-11-06T12:08:39Z

arrow-array/src/array/byte_view_array.rs

+            for view_idx in current_view_idx..current_view_idx + gc_copy_group.total_len {
+                let view =
+                    unsafe { self.copy_view_to_buffer(view_idx, group_idx as i32, &mut data_buf) };
+                views_buf.push(view);


However, since both Vecs are being modified at the same time, I couldn't figure out a way to use collect here -- I could use extend

alamb · 2025-11-06T12:24:18Z

Here is a proposal to avoid the performance drop in this PR:

Try and improve GC performance mapleFU/arrow-rs#1

alamb · 2025-11-06T12:26:52Z

🤖 ./gh_compare_arrow.sh Benchmark Script Running
Linux aal-dev 6.14.0-1018-gcp #19~24.04.1-Ubuntu SMP Wed Sep 24 23:23:09 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux
Comparing gc-fixing-huge-batch-bug (fdefa5f) to 2eabb59 diff
BENCH_NAME=view_types
BENCH_COMMAND=cargo bench --features=arrow,async,test_common,experimental --bench view_types
BENCH_FILTER=
BENCH_BRANCH_NAME=gc-fixing-huge-batch-bug
Results will be posted here when complete

alamb · 2025-11-06T12:30:50Z

🤖: Benchmark completed

Details

group                                             gc-fixing-huge-batch-bug               main
-----                                             ------------------------               ----
gc view types all without nulls[100000]           1.00  1598.1±86.02µs        ? ?/sec    1.00  1600.7±40.51µs        ? ?/sec
gc view types all without nulls[8000]             1.09     69.9±3.72µs        ? ?/sec    1.00     64.2±2.67µs        ? ?/sec
gc view types all[100000]                         1.20    310.4±6.89µs        ? ?/sec    1.00    258.7±4.62µs        ? ?/sec
gc view types all[8000]                           1.18     21.9±0.40µs        ? ?/sec    1.00     18.6±0.05µs        ? ?/sec
gc view types slice half without nulls[100000]    1.00   535.2±12.38µs        ? ?/sec    1.06   566.1±14.03µs        ? ?/sec
gc view types slice half without nulls[8000]      1.05     28.7±0.40µs        ? ?/sec    1.00     27.4±0.23µs        ? ?/sec
gc view types slice half[100000]                  1.20    152.3±2.72µs        ? ?/sec    1.00    127.0±2.12µs        ? ?/sec
gc view types slice half[8000]                    1.18     11.0±0.16µs        ? ?/sec    1.00      9.4±0.03µs        ? ?/sec
view types slice                                  1.00    706.0±1.90ns        ? ?/sec    1.00    704.2±1.79ns        ? ?/sec

alamb · 2025-11-06T19:49:46Z

The performance results on

Try and improve GC performance mapleFU/arrow-rs#1

Seem to confirm it avoids the performance regression

Try and improve GC performance

mapleFU

(Lets continue here)

mapleFU · 2025-11-08T12:17:49Z

arrow-array/src/array/byte_view_array.rs

+            let data_blocks = vec![data_block];
+            (views_buf, data_blocks)
+        } else {
+            // slow path, need to split into multiple buffers


Would it better if I extract this into a new function?

I think so!

mapleFU · 2025-11-08T12:18:19Z

arrow-array/src/array/byte_view_array.rs

+            (views_buf, data_blocks)
+        };

        // 5) Wrap up buffers


Should this change?

mapleFU · 2025-11-08T12:19:01Z

arrow-array/src/array/byte_view_array.rs

-        let views_buf: Vec<u128> = (0..len)
-            .map(|i| unsafe { self.copy_view_to_buffer(i, &mut data_buf) })
-            .collect();
+            let mut groups = Vec::with_capacity(total_large / (i32::MAX as usize) + 1);


I think this might not good, and allocation count in this slow path might not matters?

I think you are referring tot his comment

Try and improve GC performance mapleFU/arrow-rs#1 (comment)

I didn't quite follow the concern, see mapleFU#1 (comment)

However that being said we can revert this part too -- I don't feel strongly about it and I agree that the allocation likely doesn't matter on the slow path

alamb · 2025-11-09T16:59:49Z

🤖 ./gh_compare_arrow.sh Benchmark Script Running
Linux aal-dev 6.14.0-1018-gcp #19~24.04.1-Ubuntu SMP Wed Sep 24 23:23:09 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux
Comparing gc-fixing-huge-batch-bug (7bab606) to 2eabb59 diff
BENCH_NAME=view_types
BENCH_COMMAND=cargo bench --features=arrow,async,test_common,experimental --bench view_types
BENCH_FILTER=
BENCH_BRANCH_NAME=gc-fixing-huge-batch-bug
Results will be posted here when complete

alamb

I think this PR is now ready to go and doesn't regress performance. I started another benchmark just to make sure

Thanks @mapleFU -- I think we can hone it a bit more too if you want (extract the slow path as a function, for example) but we could also do it as a follow on as well

alamb · 2025-11-09T17:03:37Z

🤖: Benchmark completed

Details

group                                             gc-fixing-huge-batch-bug               main
-----                                             ------------------------               ----
gc view types all without nulls[100000]           1.00  1547.1±45.23µs        ? ?/sec    1.06  1645.8±51.47µs        ? ?/sec
gc view types all without nulls[8000]             1.00     62.9±1.90µs        ? ?/sec    1.11     69.6±3.63µs        ? ?/sec
gc view types all[100000]                         1.05    289.0±7.28µs        ? ?/sec    1.00    276.1±7.07µs        ? ?/sec
gc view types all[8000]                           1.08     21.4±0.05µs        ? ?/sec    1.00     19.9±0.06µs        ? ?/sec
gc view types slice half without nulls[100000]    1.00   538.3±10.61µs        ? ?/sec    1.04   558.2±11.69µs        ? ?/sec
gc view types slice half without nulls[8000]      1.00     26.6±0.13µs        ? ?/sec    1.08     28.8±0.19µs        ? ?/sec
gc view types slice half[100000]                  1.12    143.5±1.88µs        ? ?/sec    1.00    128.4±0.49µs        ? ?/sec
gc view types slice half[8000]                    1.07     10.8±0.02µs        ? ?/sec    1.00     10.1±0.02µs        ? ?/sec
view types slice                                  1.00    707.4±1.61ns        ? ?/sec    1.00    704.3±1.20ns        ? ?/sec

alamb · 2025-11-09T17:03:42Z

🤖 ./gh_compare_arrow.sh Benchmark Script Running
Linux aal-dev 6.14.0-1018-gcp #19~24.04.1-Ubuntu SMP Wed Sep 24 23:23:09 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux
Comparing gc-fixing-huge-batch-bug (7bab606) to 2eabb59 diff
BENCH_NAME=view_types
BENCH_COMMAND=cargo bench --features=arrow,async,test_common,experimental --bench view_types
BENCH_FILTER=
BENCH_BRANCH_NAME=gc-fixing-huge-batch-bug
Results will be posted here when complete

alamb · 2025-11-09T17:07:14Z

🤖: Benchmark completed

Details

group                                             gc-fixing-huge-batch-bug               main
-----                                             ------------------------               ----
gc view types all without nulls[100000]           1.00  1536.3±42.37µs        ? ?/sec    1.00  1541.4±48.54µs        ? ?/sec
gc view types all without nulls[8000]             1.00     62.4±0.94µs        ? ?/sec    1.00     62.5±0.96µs        ? ?/sec
gc view types all[100000]                         1.11    291.5±7.29µs        ? ?/sec    1.00    263.6±8.16µs        ? ?/sec
gc view types all[8000]                           1.14     21.4±0.10µs        ? ?/sec    1.00     18.8±0.06µs        ? ?/sec
gc view types slice half without nulls[100000]    1.02   533.8±10.33µs        ? ?/sec    1.00    522.4±9.07µs        ? ?/sec
gc view types slice half without nulls[8000]      1.01     27.3±0.20µs        ? ?/sec    1.00     27.0±0.17µs        ? ?/sec
gc view types slice half[100000]                  1.10    140.8±2.45µs        ? ?/sec    1.00    128.6±3.44µs        ? ?/sec
gc view types slice half[8000]                    1.14     10.8±0.05µs        ? ?/sec    1.00      9.5±0.03µs        ? ?/sec
view types slice                                  1.01    708.5±4.41ns        ? ?/sec    1.00    704.3±1.54ns        ? ?/sec

alamb · 2025-11-09T17:16:08Z

🤖 ./gh_compare_arrow.sh Benchmark Script Running
Linux aal-dev 6.14.0-1018-gcp #19~24.04.1-Ubuntu SMP Wed Sep 24 23:23:09 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux
Comparing gc-fixing-huge-batch-bug (7bab606) to 2eabb59 diff
BENCH_NAME=view_types
BENCH_COMMAND=cargo bench --features=arrow,async,test_common,experimental --bench view_types
BENCH_FILTER=
BENCH_BRANCH_NAME=gc-fixing-huge-batch-bug
Results will be posted here when complete

alamb · 2025-11-09T17:19:42Z

🤖: Benchmark completed

Details

group                                             gc-fixing-huge-batch-bug               main
-----                                             ------------------------               ----
gc view types all without nulls[100000]           1.01  1596.9±64.40µs        ? ?/sec    1.00  1576.0±51.87µs        ? ?/sec
gc view types all without nulls[8000]             1.02     69.2±3.05µs        ? ?/sec    1.00     67.9±3.82µs        ? ?/sec
gc view types all[100000]                         1.13    292.3±5.13µs        ? ?/sec    1.00    258.7±4.55µs        ? ?/sec
gc view types all[8000]                           1.14     21.4±0.08µs        ? ?/sec    1.00     18.9±0.04µs        ? ?/sec
gc view types slice half without nulls[100000]    1.00    529.0±8.57µs        ? ?/sec    1.06    561.0±9.12µs        ? ?/sec
gc view types slice half without nulls[8000]      1.00     27.1±0.20µs        ? ?/sec    1.02     27.5±0.27µs        ? ?/sec
gc view types slice half[100000]                  1.09    139.2±1.63µs        ? ?/sec    1.00    128.2±2.86µs        ? ?/sec
gc view types slice half[8000]                    1.14     10.8±0.03µs        ? ?/sec    1.00      9.5±0.02µs        ? ?/sec
view types slice                                  1.00    706.5±1.44ns        ? ?/sec    1.00    705.4±1.97ns        ? ?/sec

mapleFU · 2025-11-10T15:34:13Z

I've just change two lines here, it's ready to merge for me now

alamb · 2025-11-10T16:33:46Z

🤖 ./gh_compare_arrow.sh Benchmark Script Running
Linux aal-dev 6.14.0-1018-gcp #19~24.04.1-Ubuntu SMP Wed Sep 24 23:23:09 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux
Comparing gc-fixing-huge-batch-bug (c5146eb) to 2eabb59 diff
BENCH_NAME=view_types
BENCH_COMMAND=cargo bench --features=arrow,async,test_common,experimental --bench view_types
BENCH_FILTER=
BENCH_BRANCH_NAME=gc-fixing-huge-batch-bug
Results will be posted here when complete

alamb · 2025-11-10T16:37:38Z

🤖: Benchmark completed

Details

group                                             gc-fixing-huge-batch-bug               main
-----                                             ------------------------               ----
gc view types all without nulls[100000]           1.00  1562.0±59.51µs        ? ?/sec    1.01  1573.4±55.76µs        ? ?/sec
gc view types all without nulls[8000]             1.00     65.1±2.27µs        ? ?/sec    1.01     65.9±3.62µs        ? ?/sec
gc view types all[100000]                         1.00    282.1±4.21µs        ? ?/sec    1.02    289.1±6.12µs        ? ?/sec
gc view types all[8000]                           1.00     21.5±0.06µs        ? ?/sec    1.00     21.6±0.04µs        ? ?/sec
gc view types slice half without nulls[100000]    1.00    535.1±9.02µs        ? ?/sec    1.05   560.0±12.08µs        ? ?/sec
gc view types slice half without nulls[8000]      1.00     27.2±0.17µs        ? ?/sec    1.02     27.8±0.22µs        ? ?/sec
gc view types slice half[100000]                  1.00    138.6±0.90µs        ? ?/sec    1.04    144.1±2.50µs        ? ?/sec
gc view types slice half[8000]                    1.00     10.9±0.03µs        ? ?/sec    1.01     11.0±0.02µs        ? ?/sec
view types slice                                  1.00    704.9±1.05ns        ? ?/sec    1.00    703.7±0.95ns        ? ?/sec

alamb · 2025-11-10T19:52:18Z

Thanks again @mapleFU -- sorry this one took so long but I think it is quite great now

Add unittest

0482a20

github-actions bot added the arrow Changes to the arrow crate label Oct 23, 2025

mapleFU added 2 commits October 26, 2025 21:12

Pass the test

7034580

Make the code prettier

464db03

mapleFU requested a review from alamb October 26, 2025 13:20

mapleFU marked this pull request as ready for review October 26, 2025 13:20

mapleFU commented Oct 26, 2025

View reviewed changes

mapleFU changed the title ~~View: Fixing gc on huge batch~~ Fix: ViewType gc on huge batch would produce bad output Oct 26, 2025

mapleFU commented Oct 26, 2025

View reviewed changes

Minor change

142284c

alamb reviewed Oct 28, 2025

View reviewed changes

zhuqi-lucas approved these changes Oct 29, 2025

View reviewed changes

mapleFU commented Oct 29, 2025

View reviewed changes

arrow-array/src/array/byte_view_array.rs Outdated Show resolved Hide resolved

mapleFU and others added 3 commits October 29, 2025 22:15

Fix some comments

3f7c50d

fix comment

2110c46

Merge branch 'main' into gc-fixing-huge-batch-bug

aa511d7

mapleFU commented Oct 30, 2025

View reviewed changes

Merge branch 'main' into gc-fixing-huge-batch-bug

fdefa5f

alamb approved these changes Oct 31, 2025

View reviewed changes

Dandandan reviewed Nov 1, 2025

View reviewed changes

alamb mentioned this pull request Nov 4, 2025

Andrew Lamb Weekly-ish Open Source plan - 2025-11-03 apache/datafusion#18486

Closed

53 tasks

alamb reviewed Nov 6, 2025

View reviewed changes

alamb self-requested a review November 6, 2025 12:17

Try and improve GC performance

a962382

alamb mentioned this pull request Nov 6, 2025

Try and improve GC performance mapleFU/arrow-rs#1

Merged

Merge pull request #1 from alamb/alamb/fix_gc

7bab606

Try and improve GC performance

mapleFU commented Nov 8, 2025

View reviewed changes

alamb approved these changes Nov 9, 2025

View reviewed changes

Fix some minor

c5146eb

alamb merged commit a14f77c into apache:main Nov 10, 2025
25 of 26 checks passed

mapleFU deleted the gc-fixing-huge-batch-bug branch November 11, 2025 01:48

mapleFU mentioned this pull request Nov 17, 2025

View: Change some u32::MAX to i32::MAX? #8858

Open

Fix: ViewType gc on huge batch would produce bad output #8694

Fix: ViewType gc on huge batch would produce bad output #8694

Uh oh!

Conversation

mapleFU commented Oct 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mapleFU commented Oct 26, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alamb left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

alamb commented Oct 28, 2025

Uh oh!

alamb commented Oct 28, 2025

Uh oh!

alamb commented Oct 28, 2025

Uh oh!

zhuqi-lucas left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mapleFU commented Oct 31, 2025

Uh oh!

alamb left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alamb commented Nov 1, 2025

Uh oh!

alamb commented Nov 1, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alamb left a comment

Choose a reason for hiding this comment

Uh oh!

mapleFU commented Oct 23, 2025 •

edited

Loading