- Dec 16, 2010
-
-
Mosharaf Chowdhury authored
- Config option related to pipelining has been removed. - Summary: Basic -> Pipelining / Parallel -> NO pipelining
-
- Dec 15, 2010
-
-
Mosharaf Chowdhury authored
- Renamed some ParallelLocalFileShuffle config options for clarity.
-
Mosharaf Chowdhury authored
- Renamed parallel-pull version to ParallelLocalFileShuffle - Note that setting max-concurrent connections to 1 in ParallelLocalFileShuffle should essentially be the same as BasicLocalFileShuffle
-
- Dec 07, 2010
-
-
Mosharaf Chowdhury authored
-
- Dec 04, 2010
-
-
Mosharaf Chowdhury authored
-
- Dec 02, 2010
-
-
Mosharaf Chowdhury authored
-
Mosharaf Chowdhury authored
Enabling/disabling HTTP pipelining is a config option now. Performance tradeoffs are not obvious yet.
-
- Nov 28, 2010
-
-
Mosharaf Chowdhury authored
- Added GroupByTest.scala for benchmarking.
-
- Nov 13, 2010
-
-
Matei Zaharia authored
-
- Nov 12, 2010
-
-
Matei Zaharia authored
-
Matei Zaharia authored
-
- Nov 09, 2010
-
-
Matei Zaharia authored
-
- Nov 08, 2010
-
-
Matei Zaharia authored
-
Matei Zaharia authored
-
- Nov 06, 2010
-
-
Matei Zaharia authored
phase of DfsShuffle.
-
- Nov 04, 2010
-
-
Matei Zaharia authored
-
Matei Zaharia authored
-
root authored
Fixed a small bug in DFS shuffle -- the number of reduce tasks was not being set based on numOutputSplits
-
Matei Zaharia authored
-
Matei Zaharia authored
well as versions of the shuffle operations that set the number of splits automatically.
-
Matei Zaharia authored
-
Matei Zaharia authored
don't all hit the same nodes at the same time.
-
- Nov 03, 2010
-
-
Matei Zaharia authored
-
- Nov 02, 2010
-
-
Matei Zaharia authored
-
Matei Zaharia authored
-
- Oct 24, 2010
-
-
Matei Zaharia authored
-
Matei Zaharia authored
-
- Oct 23, 2010
-
-
Matei Zaharia authored
-
Matei Zaharia authored
-
- Oct 19, 2010
-
-
Matei Zaharia authored
These tasks were being subjected to delay scheduling but then counted as having been launched on a preferred node. The solution is to have a separate queue for them and treat them as preferred during scheduling.
-
Matei Zaharia authored
-
- Oct 18, 2010
-
-
Mosharaf Chowdhury authored
Conflicts: src/scala/spark/SparkContext.scala Using the latest one from Matei.
-
- Oct 17, 2010
-
-
Matei Zaharia authored
-
- Oct 16, 2010
-
-
Matei Zaharia authored
are used to create an initial copy of each config file if the user does not have one. This way, users won't accidentally commit their changes to config files to git.
-
Matei Zaharia authored
-
Matei Zaharia authored
use this. Closes #12.
-
Matei Zaharia authored
-
Matei Zaharia authored
Simplified UnionRDD slightly and added a SparkContext.union method for efficiently union-ing a large number of RDDs
-
Matei Zaharia authored
optional constructor parameter, so that the scheduler is guaranteed that a Spark home has been set when it first builds its executor arg.
-
Matei Zaharia authored
SparkContext and have the master node serve those to workers.
-