Commits · 03e8dc6861936a0862fba1ca9f830d5ff507718f · cs525-sp18-g07 / spark

Feb 13, 2013
- Changes functions comments to make them more consistent. · 03e8dc68
  Tathagata Das authored 12 years ago
  
  03e8dc68
- Added filter functionality to reduceByKeyAndWindow with inverse. Consolidated... · 12b020b6
  Tathagata Das authored 12 years ago
  
  Added filter functionality to reduceByKeyAndWindow with inverse. Consolidated reduceByKeyAndWindow's many functions into smaller number of functions with optional parameters.
  12b020b6
- Changed scheduler and file input stream to fix bugs in the driver fault... · 39addd38
  Tathagata Das authored 12 years ago
  
  Changed scheduler and file input stream to fix bugs in the driver fault tolerance. Added MasterFailureTest to rigorously test master fault tolerance with file input stream.
  39addd38
Feb 10, 2013
- Fixed bugs in FileInputDStream and Scheduler that occasionally failed to... · fd90daf8
  Tathagata Das authored 12 years ago
  
  Fixed bugs in FileInputDStream and Scheduler that occasionally failed to reprocess old files after recovering from master failure. Completely modified spark.streaming.FailureTest to test multiple master failures using file input stream.
  fd90daf8
- Fixed bug in CheckpointRDD to prevent exception when the original RDD had zero splits. · 16baea62
  Tathagata Das authored 12 years ago
  
  16baea62
Feb 09, 2013
- Added an initial spark job to ensure worker nodes are initialized. · 99a5fc49
  Tathagata Das authored 12 years ago
  
  99a5fc49
Feb 07, 2013
- Merge branch 'mesos-master' into streaming · 4cc223b4
  Tathagata Das authored 12 years ago
  
  4cc223b4
- Updated JavaStreamingContext with updated kafkaStream API. · d55e3aa4
  Tathagata Das authored 12 years ago
  
  d55e3aa4
- Merge branch 'mesos-streaming' into streaming · c6b2f765
  Tathagata Das authored 12 years ago
  
  c6b2f765
- Merge pull request #372 from Reinvigorate/sm-kafka · 12300758
  Tathagata Das authored 12 years ago
  
  Removing offset management code that is non-existent in kafka 0.7.0+
  12300758
- Merge pull request #373 from Reinvigorate/sm-updateStateByKey · 915d9931
  Tathagata Das authored 12 years ago
  
  StateDStream changes to give updateStateByKey consistent behavior
  915d9931
Feb 05, 2013

Merge pull request #450 from stephenh/inlinemergepair · 9cfa0683
Matei Zaharia authored 12 years ago
```
Inline mergePair to look more like the narrow dep branch.
```
9cfa0683
Merge pull request #451 from stephenh/fixdeathpactexception · 03eefbb2
Matei Zaharia authored 12 years ago
```
Handle Terminated to avoid endless DeathPactExceptions.
```
03eefbb2
Merge branch 'master' into fixdeathpactexception · 870b2aaf
Stephen Haberman authored 12 years ago
```
Conflicts:
	core/src/main/scala/spark/deploy/worker/Worker.scala
```
870b2aaf
Merge pull request #449 from stephenh/longerdriversuite · a4611d66
Matei Zaharia authored 12 years ago
```
Increase DriverSuite timeout.
```
a4611d66

Handle Terminated to avoid endless DeathPactExceptions. · 0e19093f

Stephen Haberman authored 12 years ago

Credit to Roland Kuhn, Akka's tech lead, for pointing out this
various obvious fix, but StandaloneExecutorBackend.preStart's
catch block would never (ever) get hit, because all of the
operation's in preStart are async.

So, the System.exit in the catch block was skipped, and instead
Akka was sending Terminated messages which, since we didn't
handle, it turned into DeathPactException, which started
a postRestart/preStart infinite loop.

0e19093f

Increase DriverSuite timeout. · 1ba3393c
Stephen Haberman authored 12 years ago

1ba3393c

Inline mergePair to look more like the narrow dep branch. · 8bd0e888

Stephen Haberman authored 12 years ago

No functionality changes, I think this is just more consistent
given mergePair isn't called multiple times/recursive.

Also added a comment to explain the usual case of having two parent RDDs.

8bd0e888

Merge pull request #447 from pwendell/streaming-constructor · 2d9eca9f
Matei Zaharia authored 12 years ago
```
Streaming constructor which takes JavaSparkContext
```
2d9eca9f

Streaming constructor which takes JavaSparkContext · 7eea64aa

Patrick Wendell authored 12 years ago

It's sometimes helpful to directly pass a JavaSparkContext,
and take advantage of the various constructors available for that.

7eea64aa

Feb 04, 2013
- Small fix to test for distinct · f6ec547e
  Matei Zaharia authored 12 years ago
  
  f6ec547e
- Fix failing test · aa4ee1e9
  Matei Zaharia authored 12 years ago
  
  aa4ee1e9
Feb 03, 2013
- Merge pull request #445 from JoshRosen/pyspark_fixes · f7b4e428
  Matei Zaharia authored 12 years ago
  
  Fix exit status in PySpark unit tests; fix/optimize PySpark's RDD.take()
  f7b4e428
- Remove unnecessary doctest __main__ methods. · e6172911
  Josh Rosen authored 12 years ago
  
  e6172911
- Merge pull request #379 from stephenh/sparkmem · 3bfaf3ab
  Matei Zaharia authored 12 years ago
  
  Add spark.executor.memory to differentiate executor memory from spark-shell
  3bfaf3ab
- Merge pull request #422 from squito/blockmanager_info · 88ee6163
  Matei Zaharia authored 12 years ago
  
  RDDInfo available from SparkContext
  88ee6163
- Merge pull request #436 from stephenh/removeextraloop · cd4ca936
  Matei Zaharia authored 12 years ago
  
  Once we find a split with no block, we don't have to look for more.
  cd4ca936
- Merge pull request #442 from stephenh/fixsystemnames · d5daaab3
  Matei Zaharia authored 12 years ago
  
  Fix createActorSystem not actually using the systemName parameter.
  d5daaab3
- Formatting · 9163c370
  Matei Zaharia authored 12 years ago
  
  9163c370
- Fetch fewer objects in PySpark's take() method. · 8fbd5380
  Josh Rosen authored 12 years ago
  
  8fbd5380
- Fix reporting of PySpark doctest failures. · 2415c18f
  Josh Rosen authored 12 years ago
  
  2415c18f
Feb 02, 2013
- Formatting · 34a7bcdb
  Matei Zaharia authored 12 years ago
  
  34a7bcdb
- Merge pull request #427 from woggling/dag-sched-tests · 85019d76
  Matei Zaharia authored 12 years ago
  
  Tests for DAGScheduler
  85019d76
- Further simplify checking for Nil. · 7aba123f
  Stephen Haberman authored 12 years ago
  
  7aba123f
- Merge remote-tracking branch 'base/master' into dag-sched-tests · 61079579
  Charles Reiss authored 12 years ago
  
  Conflicts: core/src/main/scala/spark/scheduler/DAGScheduler.scala
  61079579
- Fix dangling old variable names. · cae8a679
  Stephen Haberman authored 12 years ago
  
  cae8a679
- Move executorMemory up into SchedulerBackend. · 696eec32
  Stephen Haberman authored 12 years ago
  
  696eec32
- Merge branch 'master' into sparkmem · 103c375b
  Stephen Haberman authored 12 years ago
  
  103c375b
- Fix createActorSystem not actually using the systemName parameter. · 28e0cb9f
  Stephen Haberman authored 12 years ago
  
  This meant all system names were "spark", which worked, but didn't lead to the most intuitive log output. This fixes createActorSystem to use the passed system name, and refactors Master/Worker to encapsulate their system/actor names instead of having the clients guess at them. Note that the driver system name, "spark", is left as is, and is still repeated a few times, but that seems like a separate issue.
  28e0cb9f
- Code review changes: add sc.stop; style of multiline comments; parens on procedure calls. · 1fd5ee32
  Charles Reiss authored 12 years ago
  
  1fd5ee32