Commits · c0a0df3285d88aa8e7d8a8d12a53e157973ce01e · cs525-sp18-g07 / spark

Feb 10, 2012
- Made the default cache BoundedMemoryCache, and reduced its default size · c0a0df32
  Matei Zaharia authored 13 years ago
  
  c0a0df32
- Added some tests for multithreaded access to Spark. · a766780f
  Matei Zaharia authored 13 years ago
  
  a766780f
- Replaced LocalFileShuffle with a non-singleton ShuffleManager class · 0e93891d
  Matei Zaharia authored 13 years ago
  
  and made DAGScheduler automatically set SparkEnv.
  0e93891d
Feb 06, 2012
- IO optimizations · e02dc83a
  Matei Zaharia authored 13 years ago
  
  e02dc83a
- Use java.util.HashMap in shuffles · c40e7663
  Matei Zaharia authored 13 years ago
  
  c40e7663
- Add dependency on fastutil and update Guava · d6ec664b
  Matei Zaharia authored 13 years ago
  
  d6ec664b
- Synchronization fix in case SparkContext is used from multiple threads. · b267175a
  Matei Zaharia authored 13 years ago
  
  b267175a
- Simplifying test · 43a33350
  Matei Zaharia authored 13 years ago
  
  43a33350
Jan 31, 2012
- Merge branch 'master' of github.com:mesos/spark · 7449ecfb
  Matei Zaharia authored 13 years ago
  
  7449ecfb
- Some fixes to the examples (mostly to use functional API) · 100e8007
  Matei Zaharia authored 13 years ago
  
  100e8007
Jan 30, 2012
- Merge pull request #108 from patelh/master · 72d2489b
  Matei Zaharia authored 13 years ago
  
  Added immutable map registration in kryo serializer
  72d2489b
Jan 26, 2012
- Add register immutable map to kryo serializer · b4795234
  Hiral Patel authored 13 years ago
  
  b4795234
Jan 13, 2012
- Merge pull request #103 from edisontung/master · fabcc825
  Matei Zaharia authored 13 years ago
  
  Made improvements to takeSample. Also changed SparkLocalKMeans to SparkKMeans
  fabcc825
- Fixed a failure recovery bug and added some tests for fault recovery. · fd5581a0
  Matei Zaharia authored 13 years ago
  
  fd5581a0
- Fixed a failure recovery bug and added some tests for fault recovery. · eb05154b
  Matei Zaharia authored 13 years ago
  
  eb05154b
Jan 09, 2012

Fixed bugs · 1ecc221f

Edison Tung authored 13 years ago

I've fixed the bugs detailed in the diff. One of the bugs was already
fixed on the local file (forgot to commit).

1ecc221f

Jan 05, 2012
- Register RDDs with the MapOutputTracker even if they have no partitions. · e269f6f7
  Matei Zaharia authored 13 years ago
  
  Fixes #105.
  e269f6f7
Dec 15, 2011
- Add dependency on Akka and Netty · 5fd101d7
  Matei Zaharia authored 13 years ago
  
  5fd101d7
Dec 14, 2011
- Merge commit 'ad4ebff4 ' · 3034fc0d
  Matei Zaharia authored 13 years ago
  
  3034fc0d
- Make Spark port default to 7077 so that it's not an ephemeral port that might be taken · 6a650cbb
  Matei Zaharia authored 13 years ago
  
  6a650cbb
Dec 02, 2011
- Merge remote-tracking branch 'origin/charles-newhadoop' · 735843a0
  Matei Zaharia authored 13 years ago
  
  735843a0
Dec 01, 2011
- Add new Hadoop API reading support. · 66f05f38
  Charles Reiss authored 13 years ago
  
  66f05f38
- Add new Hadoop API writing support. · 02d43e69
  Charles Reiss authored 13 years ago
  
  02d43e69
- Fixed LocalFileLR to deal with a change in Scala IO sources · 72c4839c
  Matei Zaharia authored 13 years ago
  
  (you can no longer iterate over a Source multiple times).
  72c4839c
- Revert de01b6de ^..HEAD · 42f8847a
  Edison Tung authored 13 years ago
  
  42f8847a
- Fixed bug in RDD · de01b6de
  Edison Tung authored 13 years ago
  
  Math.min takes 2 args, not 1. This was not committed earlier for some reason
  de01b6de
- Renamed SparkLocalKMeans to SparkKMeans · e1c814be
  Edison Tung authored 13 years ago
  
  e1c814be
Nov 30, 2011
- Added fold() and aggregate() operations that reuse an object to · 22b8fcf6
  Matei Zaharia authored 13 years ago
  
  merge results into rather than requiring a new object allocation for each element merged. Fixes #95.
  22b8fcf6
- Send SPARK_JAVA_OPTS to slave nodes. · 09dd58b3
  Matei Zaharia authored 13 years ago
  
  09dd58b3
Nov 21, 2011

added takeSamples method · a3bc012a

Edison Tung authored 13 years ago

takeSamples method takes a specified number of samples from the RDD and
outputs it in an array.

a3bc012a

Added KMeans examples · 3b9d9de5

Edison Tung authored 13 years ago

LocalKMeans runs locally with a randomly generated dataset.
SparkLocalKMeans takes an input file and runs KMeans on it.

3b9d9de5

Nov 13, 2011

Deduplicate exceptions when printing them · ad4ebff4

Ankur Dave authored 13 years ago

The first time they appear, exceptions are printed in full, including
a stack trace. After that, they are printed in abbreviated form. They
are periodically reprinted in full; the reprint interval defaults to 5
seconds and is configurable using the property
spark.logging.exceptionPrintInterval.

ad4ebff4

Report errors in tasks to the driver via a Mesos status update · 35b6358a

Ankur Dave authored 13 years ago

When a task throws an exception, the Spark executor previously just
logged it to a local file on the slave and exited. This commit causes
Spark to also report the exception back to the driver using a Mesos
status update, so the user doesn't have to look through a log file on
the slave.

Here's what the reporting currently looks like:

    # ./run spark.examples.ExceptionHandlingTest master@203.0.113.1:5050
    [...]
    11/10/26 21:04:13 INFO spark.SimpleJob: Lost TID 1 (task 0:1)
    11/10/26 21:04:13 INFO spark.SimpleJob: Loss was due to java.lang.Exception: Testing exception handling
    [...]
    11/10/26 21:04:16 INFO spark.SparkContext: Job finished in 5.988547328 s

35b6358a

Nov 09, 2011
- Bug fix: reject offers that we didn't find any tasks for · 07532021
  Matei Zaharia authored 13 years ago
  
  07532021
Nov 08, 2011

Merge branch 'master' of github.com:mesos/spark · 13f6900e
Matei Zaharia authored 13 years ago

13f6900e
Really upgrade to SBT 0.11.1 (through build.properties and plugin changes) · c7d6f1a6
Matei Zaharia authored 13 years ago

c7d6f1a6
Update Bagel unit tests to reflect API change · c5be7d2b
Ankur Dave authored 13 years ago

c5be7d2b
Closure cleaner unit test · 9e4c79a4
Matei Zaharia authored 13 years ago

9e4c79a4

Updates to the closure cleaner to work better with closures in classes. · f346e646

Matei Zaharia authored 13 years ago

Before, the cleaner attempted to clone $outer objects that were classes
(as opposed to nested closures) and preserve only their used fields,
which was bad because it would miss fields that are accessed indirectly
by methods, and in general it would confuse user code. Now we keep a
reference to those objects without cloning them. This is not perfect
because the user still needs to be careful of what they'll carry along
into closures, but it works better in some cases that seemed confusing
before. We need to improve the documentation on what variables get
passed along with a closure and possibly add some debugging tools for it
as well.

Fixes #71 -- that code now works in the REPL.

f346e646

Nov 07, 2011
- Update to SBT 0.11.1 · 7fd05cbb
  Matei Zaharia authored 13 years ago
  
  7fd05cbb