-
- Downloads
Initial work towards scheduler refactoring:
- Replace use of hostPort vs host in Task.preferredLocations with a TaskLocation class that contains either an executorId and a host or just a host. This is part of a bigger effort to eliminate hostPort based data structures and just use executorID, since the hostPort vs host stuff is confusing (and not checkable with static typing, leading to ugly debug code), and hostPorts are not provided by Mesos. - Replaced most hostPort-based data structures and fields as above. - Simplified ClusterTaskSetManager to deal with preferred locations in a more concise way and generally be more concise. - Updated the way ClusterTaskSetManager handles racks: instead of enqueueing a task to a separate queue for all the hosts in the rack, which would create lots of large queues, have one queue per rack name. - Removed non-local fallback stuff in ClusterScheduler that tried to launch less-local tasks on a node once the local ones were all assigned. This change didn't work because many cluster schedulers send offers for just one node at a time (even the standalone and YARN ones do so as nodes join the cluster one by one). Thus, lots of non-local tasks would be assigned even though a node with locality for them would be able to receive tasks just a short time later. - Renamed MapOutputTracker "generations" to "epochs".
Showing
- core/src/hadoop2-yarn/scala/spark/scheduler/cluster/YarnClusterScheduler.scala 0 additions, 7 deletions.../scala/spark/scheduler/cluster/YarnClusterScheduler.scala
- core/src/main/scala/spark/MapOutputTracker.scala 31 additions, 31 deletionscore/src/main/scala/spark/MapOutputTracker.scala
- core/src/main/scala/spark/RDD.scala 2 additions, 2 deletionscore/src/main/scala/spark/RDD.scala
- core/src/main/scala/spark/executor/Executor.scala 8 additions, 4 deletionscore/src/main/scala/spark/executor/Executor.scala
- core/src/main/scala/spark/rdd/BlockRDD.scala 3 additions, 4 deletionscore/src/main/scala/spark/rdd/BlockRDD.scala
- core/src/main/scala/spark/rdd/ZippedPartitionsRDD.scala 2 additions, 0 deletionscore/src/main/scala/spark/rdd/ZippedPartitionsRDD.scala
- core/src/main/scala/spark/scheduler/DAGScheduler.scala 41 additions, 38 deletionscore/src/main/scala/spark/scheduler/DAGScheduler.scala
- core/src/main/scala/spark/scheduler/DAGSchedulerEvent.scala 1 addition, 3 deletionscore/src/main/scala/spark/scheduler/DAGSchedulerEvent.scala
- core/src/main/scala/spark/scheduler/ResultTask.scala 6 additions, 9 deletionscore/src/main/scala/spark/scheduler/ResultTask.scala
- core/src/main/scala/spark/scheduler/ShuffleMapTask.scala 6 additions, 9 deletionscore/src/main/scala/spark/scheduler/ShuffleMapTask.scala
- core/src/main/scala/spark/scheduler/Task.scala 2 additions, 2 deletionscore/src/main/scala/spark/scheduler/Task.scala
- core/src/main/scala/spark/scheduler/TaskLocation.scala 32 additions, 0 deletionscore/src/main/scala/spark/scheduler/TaskLocation.scala
- core/src/main/scala/spark/scheduler/TaskSchedulerListener.scala 1 addition, 1 deletion...rc/main/scala/spark/scheduler/TaskSchedulerListener.scala
- core/src/main/scala/spark/scheduler/cluster/ClusterScheduler.scala 75 additions, 208 deletions...main/scala/spark/scheduler/cluster/ClusterScheduler.scala
- core/src/main/scala/spark/scheduler/cluster/ClusterTaskSetManager.scala 170 additions, 340 deletions...scala/spark/scheduler/cluster/ClusterTaskSetManager.scala
- core/src/main/scala/spark/scheduler/cluster/StandaloneSchedulerBackend.scala 8 additions, 7 deletions.../spark/scheduler/cluster/StandaloneSchedulerBackend.scala
- core/src/main/scala/spark/scheduler/cluster/TaskInfo.scala 1 addition, 3 deletionscore/src/main/scala/spark/scheduler/cluster/TaskInfo.scala
- core/src/main/scala/spark/scheduler/cluster/TaskLocality.scala 32 additions, 0 deletions...src/main/scala/spark/scheduler/cluster/TaskLocality.scala
- core/src/main/scala/spark/scheduler/cluster/TaskSetManager.scala 2 additions, 11 deletions...c/main/scala/spark/scheduler/cluster/TaskSetManager.scala
- core/src/main/scala/spark/scheduler/cluster/WorkerOffer.scala 1 addition, 2 deletions.../src/main/scala/spark/scheduler/cluster/WorkerOffer.scala
Loading
Please register or sign in to comment