-
- Downloads
[SPARK-15259] Sort time metric should not include spill and record insertion time
## What changes were proposed in this pull request? After SPARK-14669 it seems the sort time metric includes both spill and record insertion time. This makes it not very useful since the metric becomes close to the total execution time of the node. We should track just the time spent for in-memory sort, as before. ## How was this patch tested? Verified metric in the UI, also unit test on UnsafeExternalRowSorter. cc davies Author: Eric Liang <ekl@databricks.com> Author: Eric Liang <ekhliang@gmail.com> Closes #13035 from ericl/fix-metrics.
Showing
- core/src/main/java/org/apache/spark/util/collection/unsafe/sort/UnsafeExternalSorter.java 13 additions, 0 deletions...ark/util/collection/unsafe/sort/UnsafeExternalSorter.java
- core/src/main/java/org/apache/spark/util/collection/unsafe/sort/UnsafeInMemorySorter.java 11 additions, 0 deletions...ark/util/collection/unsafe/sort/UnsafeInMemorySorter.java
- core/src/test/java/org/apache/spark/util/collection/unsafe/sort/UnsafeExternalSorterSuite.java 20 additions, 0 deletions...til/collection/unsafe/sort/UnsafeExternalSorterSuite.java
- sql/catalyst/src/main/java/org/apache/spark/sql/execution/UnsafeExternalRowSorter.java 7 additions, 0 deletions...g/apache/spark/sql/execution/UnsafeExternalRowSorter.java
- sql/core/src/main/scala/org/apache/spark/sql/execution/SortExec.scala 2 additions, 7 deletions.../main/scala/org/apache/spark/sql/execution/SortExec.scala
Please register or sign in to comment