- Dec 13, 2013
-
-
Reynold Xin authored
Added a comment about ActorRef and ActorSelection difference.
-
Prashant Sharma authored
-
Reynold Xin authored
Review comments on the PR for scala 2.10 migration.
-
Prashant Sharma authored
-
- Dec 12, 2013
-
-
Patrick Wendell authored
Disabled yarn 2.2 in sbt and mvn build and added a message in the sbt build.
-
Prashant Sharma authored
-
Patrick Wendell authored
Scala 2.10 migration This PR migrates spark to scala 2.10. Summary of changes apart from scala 2.10 migration: (has no implications for user.) 1. Migrated Akka to 2.2.3. Does not use remote death watch for it has a bug, where it tries to send message to dead node infinitely. Uses an indestructible actorsystem which tolerates errors only on executors. (Might be useful for user.) 4. New configuration settings introduced: System.getProperty("spark.akka.heartbeat.pauses", "600") System.getProperty("spark.akka.failure-detector.threshold", "300.0") System.getProperty("spark.akka.heartbeat.interval", "1000") Defaults for these are fairly large to only disable Failure detector that comes with akka. The reason for doing so is we have our own failure detector like mechanism in place and then this is just an overhead on top of that + it leads to a lot of false positives. But with these properties it is possible to enable them. A good use case for enabling it could be when someone wants spark to be sensitive (in a controllable manner ofc.) to GC pauses/Network lags and quickly evict executors that experienced it. More information is included in configuration.md Once we have the SPARK-544 merged, I had like to deprecate atleast these akka properties and may be others too. This PR is duplicate of #221(where all the discussion happened.) for that one pointed to master this one points to scala-2.10 branch.
-
- Dec 11, 2013
-
-
Prashant Sharma authored
-
- Dec 10, 2013
-
-
Prashant Sharma authored
-
Prashant Sharma authored
Conflicts: core/pom.xml core/src/main/scala/org/apache/spark/scheduler/DAGScheduler.scala pom.xml project/SparkBuild.scala streaming/pom.xml yarn/src/main/scala/org/apache/spark/deploy/yarn/YarnAllocationHandler.scala
-
Prashant Sharma authored
-
Patrick Wendell authored
README incorrectly suggests build sources spark-env.sh This is misleading because the build doesn't source that file. IMO it's better to force people to specify build environment variables on the command line always, like we do in every example, so I'm just removing this doc.
-
Patrick Wendell authored
This is misleading because the build doesn't source that file. IMO it's better to force people to specify build environment variables on the command line always, like we do in every example.
-
Prashant Sharma authored
-
- Dec 09, 2013
-
-
Patrick Wendell authored
Add missing license headers I found this when doing further audits on the 0.8.1 release candidate.
-
Patrick Wendell authored
-
Prashant Sharma authored
-
- Dec 08, 2013
-
-
Patrick Wendell authored
[Deb] fix package of Spark classes adding org.apache prefix in scripts embeded in .deb
-
Patrick Wendell authored
Update broken links and add HDP 2.0 version string I ran a link checker on the UI and found several broken links.
-
Patrick Wendell authored
-
Patrick Wendell authored
-
Patrick Wendell authored
-
- Dec 07, 2013
-
-
Patrick Wendell authored
SPARK-917 Improve API links in nav bar
-
Patrick Wendell authored
Correct spellling error in configuration.md
-
Patrick Wendell authored
-
Aaron Davidson authored
-
Prashant Sharma authored
Incorporated Patrick's feedback comment on #211 and made maven build/dep-resolution atleast a bit faster.
-
- Dec 06, 2013
-
-
Patrick Wendell authored
Formatting fix This is a single-line change. The diff appears larger here due to github being out of sync.
-
Patrick Wendell authored
-
Patrick Wendell authored
Adding disclaimer for shuffle file consolidation
-
Patrick Wendell authored
Minor doc fixes and updating README
-
Patrick Wendell authored
-
Patrick Wendell authored
-
Patrick Wendell authored
Updated documentation about the YARN v2.2 build process
-
Ali Ghodsi authored
-
Ali Ghodsi authored
-
Matei Zaharia authored
stageId <--> jobId mapping in DAGScheduler Okay, I think this one is ready to go -- or at least it's ready for review and discussion. It's a carry-over of https://github.com/mesos/spark/pull/842 with updates for the newer job cancellation functionality. The prior discussion still applies. I've actually changed the job cancellation flow a bit: Instead of ``cancelTasks`` going to the TaskScheduler and then ``taskSetFailed`` coming back to the DAGScheduler (resulting in ``abortStage`` there), the DAGScheduler now takes care of figuring out which stages should be cancelled, tells the TaskScheduler to cancel tasks for those stages, then does the cleanup within the DAGScheduler directly without the need for any further prompting by the TaskScheduler. I know of three outstanding issues, each of which can and should, I believe, be handled in follow-up pull requests: 1) https://spark-project.atlassian.net/browse/SPARK-960 2) JobLogger should be re-factored to eliminate duplication 3) Related to 2), the WebUI should also become a consumer of the DAGScheduler's new understanding of the relationship between jobs and stages so that it can display progress indication and the like grouped by job. Right now, some of this information is just being sent out as part of ``SparkListenerJobStart`` messages, but more or different job <--> stage information may need to be exported from the DAGScheduler to meet listeners needs. Except for the eventQueue -> Actor commit, the rest can be cherry-picked almost cleanly into branch-0.8. A little merging is needed in MapOutputTracker and the DAGScheduler. Merged versions of those files are in https://github.com/markhamstra/incubator-spark/tree/aba2b40ce04ee9b7b9ea260abb6f09e050142d43 Note that between the recent Actor change in the DAGScheduler and the cleaning up of DAGScheduler data structures on job completion in this PR, some races have been introduced into the DAGSchedulerSuite. Those tests usually pass, and I don't think that better-behaved code that doesn't directly inspect DAGScheduler data structures should be seeing any problems, but I'll work on fixing DAGSchedulerSuite as either an addition to this PR or as a separate request. UPDATE: Fixed the race that I introduced. Created a JIRA issue (SPARK-965) for the one that was introduced with the switch to eventProcessorActor in the DAGScheduler.
-
Matei Zaharia authored
Change the name of input argument in ClusterScheduler#initialize from context to backend. The SchedulerBackend used to be called ClusterSchedulerContext so just want to make small change of the input param in the ClusterScheduler#initialize to reflect this.
-
Matei Zaharia authored
Added logging of scheduler delays to UI This commit adds two metrics to the UI: 1) The time to get task results, if they're fetched remotely 2) The scheduler delay. When the scheduler starts getting overwhelmed (because it can't keep up with the rate at which tasks are being submitted), the result is that tasks get delayed on the tail-end: the message from the worker saying that the task has completed ends up in a long queue and takes a while to be processed by the scheduler. This commit records that delay in the UI so that users can tell when the scheduler is becoming the bottleneck.
-
Matei Zaharia authored
Memoize preferred locations in ZippedPartitionsBaseRDD so preferred location computation doesn't lead to exponential explosion. This was a problem in GraphX where we have a whole chain of RDDs that are ZippedPartitionsRDD's, and the preferred locations were taking eternity to compute. (cherry picked from commit e36fe55a) Signed-off-by:
Reynold Xin <rxin@apache.org>
-