- Oct 05, 2013
-
-
Patrick Wendell authored
-
- Oct 03, 2013
-
-
Matei Zaharia authored
fixed a wildcard bug in make-distribution.sh; ask sbt to check local maven repo in project/SparkBuild.scala (1) fixed a wildcard bug in make-distribution.sh: with the wildcard * in quotes, this cp command failed. it worked after moving the wildcard out quotes. (2) ask sbt to check local maven repo in SparkBuild.scala: To build Spark (0.9.0-SNAPSHOT) with the HEAD of mesos (0.15.0), I must do "make maven-install" under mesos/build, which publishes the java .jar file under ~/.m2. However, when building Spark (after pointing mesos to version 0.15.0), sbt uses ivy which by default only checks ~/.ivy2. This change is to tell sbt to also check ~/.m2.
-
Matei Zaharia authored
Update README: updated the link
-
Matei Zaharia authored
Allow users to set the application name for Spark on Yarn
-
tgravescs authored
-
- Oct 02, 2013
-
-
Matei Zaharia authored
Send Task results through the block manager when larger than Akka frame size (fixes SPARK-669). This change requires adding an extra failure mode: tasks can complete successfully, but the result gets lost or flushed from the block manager before it's been fetched. This change also moves the deserialization of tasks into a separate thread, so it's no longer part of the DAG scheduler's tight loop. This should improve scheduler throughput, particularly when tasks are sending back large results. Thanks Josh for writing the original version of this patch! This is duplicated from the mesos/spark repo: https://github.com/mesos/spark/pull/835
-
tgravescs authored
-
- Oct 01, 2013
-
-
Du Li authored
-
Du Li authored
-
CruncherBigData authored
-
Kay Ousterhout authored
-
Kay Ousterhout authored
-
- Sep 30, 2013
-
-
Kay Ousterhout authored
Conflicts: core/src/main/scala/org/apache/spark/scheduler/cluster/ClusterScheduler.scala core/src/main/scala/org/apache/spark/scheduler/cluster/ClusterTaskSetManager.scala core/src/main/scala/org/apache/spark/scheduler/local/LocalTaskSetManager.scala
-
Kay Ousterhout authored
-
- Sep 26, 2013
-
-
Reynold Xin authored
Remove -optimize flag
-
Reynold Xin authored
Bug fix in master build
-
Reynold Xin authored
Improved organization of scheduling packages. This commit does not change any code -- only file organization. Please let me know if there was some masterminded strategy behind the existing organization that I failed to understand! There are two components of this change: (1) Moving files out of the cluster package, and down a level to the scheduling package. These files are all used by the local scheduler in addition to the cluster scheduler(s), so should not be in the cluster package. As a result of this change, none of the files in the local package reference files in the cluster package. (2) Moving the mesos package to within the cluster package. The mesos scheduling code is for a cluster, and represents a specific case of cluster scheduling (the Mesos-related classes often subclass cluster scheduling classes). Thus, the most logical place for it seems to be within the cluster package. The one thing about the scheduling code that seems a little funny to me is the naming of the SchedulerBackends. The StandaloneSchedulerBackend is not just for Standalone mode, but instead is used by Mesos coarse grained mode and Yarn, and the backend that *is* just for Standalone mode is instead called SparkDeploySchedulerBackend. I didn't change this because I wasn't sure if there was a reason for this naming that I'm just not aware of.
-
Reynold Xin authored
EC2 SSH improvements
-
Reynold Xin authored
-
Reynold Xin authored
Add mapPartitionsWithIndex
-
Patrick Wendell authored
-
Reynold Xin authored
some minor fixes to MemoryStore This is a repeat of #5, moved to its own branch in my repo. This makes all updates to on ; it skips on synchronizing the reads where it can get away with it.
-
Patrick Wendell authored
Smarter take/limit implementation.
-
- Sep 25, 2013
-
-
Kay Ousterhout authored
This commit does not change any code -- only file organization. There are two components of this change: (1) Moving files out of the cluster package, and down a level to the scheduling package. These files are all used by the local scheduler in addition to the cluster scheduler(s), so should not be in the cluster package. As a result of this change, none of the files in the local package reference files in the cluster package. (2) Moving the mesos package to within the cluster package. The mesos scheduling code is for a cluster, and represents a specific case of cluster scheduling (the Mesos-related classes often subclass cluster scheduling classes). Thus, the most logical place for it is within the cluster package.
-
- Sep 24, 2013
-
-
Patrick Wendell authored
-
Patrick Wendell authored
-
- Sep 23, 2013
-
-
Holden Karau authored
-
Reynold Xin authored
Fix spacing so java.io.tmpdir doesn't run on with SPARK_JAVA_OPTS
-
Y.CORP.YAHOO.COM\tgraves authored
-
Reynold Xin authored
-
Reynold Xin authored
-
- Sep 22, 2013
-
-
Kay Ousterhout authored
This change requires adding an extra failure mode: tasks can complete successfully, but the result gets lost or flushed from the block manager before it's been fetched.
-
Holden Karau authored
-
Reynold Xin authored
Refactor FairSchedulableBuilder
-
jerryshao authored
-
jerryshao authored
-
jerryshao authored
1. Configuration can be read from classpath if not set explicitly. 2. Add missing close handler.
-
Reynold Xin authored
Fix PR926 local properties issues in Spark Streaming like scenarios
-
Reynold Xin authored
Add "org.apache." prefix to packages in spark-class
-