-
- Downloads
[SPARK-14320][SQL] Make ColumnarBatch.Row mutable
## What changes were proposed in this pull request? In order to leverage a data structure like `AggregateHashMap` (https://github.com/apache/spark/pull/12055) to speed up aggregates with keys, we need to make `ColumnarBatch.Row` mutable. ## How was this patch tested? Unit test in `ColumnarBatchSuite`. Also, tested via `BenchmarkWholeStageCodegen`. Author: Sameer Agarwal <sameer@databricks.com> Closes #12103 from sameeragarwal/mutable-row.
Showing
- sql/core/src/main/java/org/apache/spark/sql/execution/vectorized/AggregateHashMap.java 7 additions, 4 deletions...ache/spark/sql/execution/vectorized/AggregateHashMap.java
- sql/core/src/main/java/org/apache/spark/sql/execution/vectorized/ColumnVector.java 12 additions, 0 deletions...g/apache/spark/sql/execution/vectorized/ColumnVector.java
- sql/core/src/main/java/org/apache/spark/sql/execution/vectorized/ColumnarBatch.java 93 additions, 1 deletion.../apache/spark/sql/execution/vectorized/ColumnarBatch.java
- sql/core/src/test/scala/org/apache/spark/sql/execution/BenchmarkWholeStageCodegen.scala 2 additions, 3 deletions...ache/spark/sql/execution/BenchmarkWholeStageCodegen.scala
- sql/core/src/test/scala/org/apache/spark/sql/execution/vectorized/ColumnarBatchSuite.scala 21 additions, 0 deletions...e/spark/sql/execution/vectorized/ColumnarBatchSuite.scala
Loading
Please register or sign in to comment