-
- Downloads
[SPARK-1683] Track task read metrics.
This commit adds a new metric in TaskMetrics to record the input data size and displays this information in the UI. An earlier version of this commit also added the read time, which can be useful for diagnosing straggler problems, but unfortunately that change introduced a significant performance regression for jobs that don't do much computation. In order to track read time, we'll need to do sampling. The screenshots below show the UI with the new "Input" field, which I added to the stage summary page, the executor summary page, and the per-stage page.    Author: Kay Ousterhout <kayousterhout@gmail.com> Closes #962 from kayousterhout/read_metrics and squashes the following commits: f13b67d [Kay Ousterhout] Correctly format input bytes on executor page 8b70cde [Kay Ousterhout] Added comment about potential inaccuracy of bytesRead d1016e8 [Kay Ousterhout] Udated SparkListenerSuite test 8461492 [Kay Ousterhout] Miniscule style fix ae04d99 [Kay Ousterhout] Remove input metrics for parallel collections 719f19d [Kay Ousterhout] Style fixes bb6ec62 [Kay Ousterhout] Small fixes 869ac7b [Kay Ousterhout] Updated Json tests 44a0301 [Kay Ousterhout] Fixed accidentally added line 4bd0568 [Kay Ousterhout] Added input source, renamed Hdfs to Hadoop. f27e535 [Kay Ousterhout] Updates based on review comments and to fix rebase bf41029 [Kay Ousterhout] Updated Json tests to pass 0fc33e0 [Kay Ousterhout] Added explicit backward compatibility test 4e52925 [Kay Ousterhout] Added Json output and associated tests. 365400b [Kay Ousterhout] [SPARK-1683] Track task read metrics.
Showing
- core/src/main/scala/org/apache/spark/CacheManager.scala 6 additions, 4 deletionscore/src/main/scala/org/apache/spark/CacheManager.scala
- core/src/main/scala/org/apache/spark/executor/TaskMetrics.scala 29 additions, 0 deletions...rc/main/scala/org/apache/spark/executor/TaskMetrics.scala
- core/src/main/scala/org/apache/spark/rdd/BlockRDD.scala 1 addition, 1 deletioncore/src/main/scala/org/apache/spark/rdd/BlockRDD.scala
- core/src/main/scala/org/apache/spark/rdd/HadoopRDD.scala 15 additions, 0 deletionscore/src/main/scala/org/apache/spark/rdd/HadoopRDD.scala
- core/src/main/scala/org/apache/spark/rdd/NewHadoopRDD.scala 13 additions, 0 deletionscore/src/main/scala/org/apache/spark/rdd/NewHadoopRDD.scala
- core/src/main/scala/org/apache/spark/scheduler/JobLogger.scala 12 additions, 3 deletions...src/main/scala/org/apache/spark/scheduler/JobLogger.scala
- core/src/main/scala/org/apache/spark/storage/BlockManager.scala 39 additions, 24 deletions...rc/main/scala/org/apache/spark/storage/BlockManager.scala
- core/src/main/scala/org/apache/spark/storage/ThreadingTest.scala 1 addition, 1 deletion...c/main/scala/org/apache/spark/storage/ThreadingTest.scala
- core/src/main/scala/org/apache/spark/ui/exec/ExecutorsPage.scala 4 additions, 0 deletions...c/main/scala/org/apache/spark/ui/exec/ExecutorsPage.scala
- core/src/main/scala/org/apache/spark/ui/exec/ExecutorsTab.scala 5 additions, 0 deletions...rc/main/scala/org/apache/spark/ui/exec/ExecutorsTab.scala
- core/src/main/scala/org/apache/spark/ui/jobs/ExecutorSummary.scala 1 addition, 0 deletions...main/scala/org/apache/spark/ui/jobs/ExecutorSummary.scala
- core/src/main/scala/org/apache/spark/ui/jobs/ExecutorTable.scala 2 additions, 0 deletions...c/main/scala/org/apache/spark/ui/jobs/ExecutorTable.scala
- core/src/main/scala/org/apache/spark/ui/jobs/JobProgressListener.scala 7 additions, 8 deletions.../scala/org/apache/spark/ui/jobs/JobProgressListener.scala
- core/src/main/scala/org/apache/spark/ui/jobs/StagePage.scala 35 additions, 6 deletionscore/src/main/scala/org/apache/spark/ui/jobs/StagePage.scala
- core/src/main/scala/org/apache/spark/ui/jobs/StageTable.scala 7 additions, 0 deletions.../src/main/scala/org/apache/spark/ui/jobs/StageTable.scala
- core/src/main/scala/org/apache/spark/util/JsonProtocol.scala 19 additions, 1 deletioncore/src/main/scala/org/apache/spark/util/JsonProtocol.scala
- core/src/test/scala/org/apache/spark/CacheManagerSuite.scala 3 additions, 1 deletioncore/src/test/scala/org/apache/spark/CacheManagerSuite.scala
- core/src/test/scala/org/apache/spark/scheduler/SparkListenerSuite.scala 1 addition, 0 deletions...scala/org/apache/spark/scheduler/SparkListenerSuite.scala
- core/src/test/scala/org/apache/spark/storage/BlockManagerSuite.scala 61 additions, 23 deletions...st/scala/org/apache/spark/storage/BlockManagerSuite.scala
- core/src/test/scala/org/apache/spark/util/JsonProtocolSuite.scala 88 additions, 14 deletions.../test/scala/org/apache/spark/util/JsonProtocolSuite.scala
Loading
Please register or sign in to comment