Commits · 2484b846788ca2ef5f0cebd625154a0333d50797 · cs525-sp18-g07 / spark

Oct 05, 2013
- Bumping EC2 default version in master to `0.8.0`. · 2484b846
  Patrick Wendell authored 11 years ago
  
  2484b846
Oct 03, 2013

Merge pull request #26 from Du-Li/master · 232765f7

Matei Zaharia authored 11 years ago

fixed a wildcard bug in make-distribution.sh; ask sbt to check local
maven repo in project/SparkBuild.scala

(1) fixed a wildcard bug in make-distribution.sh:
with the wildcard * in quotes, this cp command failed. it worked after
moving the wildcard out quotes.

(2) ask sbt to check local maven repo in SparkBuild.scala:
To build Spark (0.9.0-SNAPSHOT) with the HEAD of mesos (0.15.0), I must
do "make maven-install" under mesos/build, which publishes the java .jar
file under ~/.m2. However, when building Spark (after pointing mesos to
version 0.15.0), sbt uses ivy which by default only checks ~/.ivy2. This
change is to tell sbt to also check ~/.m2.

232765f7

Merge pull request #25 from CruncherBigData/master · 405e69bb
Matei Zaharia authored 11 years ago
```
Update README: updated the link
```
405e69bb
Merge pull request #28 from tgravescs/sparYarnAppName · 49dbfccf
Matei Zaharia authored 11 years ago
```
Allow users to set the application name for Spark on Yarn
```
49dbfccf
Add default value to usage statement · c021b8c2
tgravescs authored 11 years ago

c021b8c2

Oct 02, 2013

Merge pull request #10 from kayousterhout/results_through-bm · e597ea34

Matei Zaharia authored 11 years ago

Send Task results through the block manager when larger than Akka frame size (fixes SPARK-669).

This change requires adding an extra failure mode: tasks can complete
successfully, but the result gets lost or flushed from the block manager
before it's been fetched.

This change also moves the deserialization of tasks into a separate thread, so it's no longer part of the DAG scheduler's tight loop. This should improve scheduler throughput, particularly when tasks are sending back large results.

Thanks Josh for writing the original version of this patch!

This is duplicated from the mesos/spark repo: https://github.com/mesos/spark/pull/835

e597ea34

Allow users to set the application name for Spark on Yarn · bc3b20ab
tgravescs authored 11 years ago

bc3b20ab

Oct 01, 2013
- ask ivy/sbt to check local maven repo under ~/.m2 · 9fd6bba6
  Du Li authored 11 years ago
  
  9fd6bba6
- fixed a bug of using wildcard in quotes · 0d19f00e
  Du Li authored 11 years ago
  
  0d19f00e
- Update README · c85f7205
  CruncherBigData authored 11 years ago
  
  c85f7205
- Added additional unit test for repeated task failures · 0dcad2ed
  Kay Ousterhout authored 11 years ago
  
  0dcad2ed
- Fixed compilation errors and broken test. · dea4677c
  Kay Ousterhout authored 11 years ago
  
  dea4677c
Sep 30, 2013

Merge remote-tracking branch 'upstream/master' into results_through-bm · 8deda427

Kay Ousterhout authored 11 years ago

Conflicts:
core/src/main/scala/org/apache/spark/scheduler/cluster/ClusterScheduler.scala
core/src/main/scala/org/apache/spark/scheduler/cluster/ClusterTaskSetManager.scala
core/src/main/scala/org/apache/spark/scheduler/local/LocalTaskSetManager.scala

8deda427

Addressed Matei's code review comments · 58b764b7
Kay Ousterhout authored 11 years ago

58b764b7

Sep 26, 2013

Merge pull request #17 from rxin/optimize · 714fdabd
Reynold Xin authored 11 years ago
```
Remove -optimize flag
```
714fdabd
Merge pull request #16 from pwendell/master · 13eced72
Reynold Xin authored 11 years ago
```
Bug fix in master build
```
13eced72

Merge pull request #14 from kayousterhout/untangle_scheduler · 70a0b993

Reynold Xin authored 11 years ago

Improved organization of scheduling packages.

This commit does not change any code -- only file organization.
Please let me know if there was some masterminded strategy behind
the existing organization that I failed to understand!

There are two components of this change:
(1) Moving files out of the cluster package, and down
a level to the scheduling package. These files are all used by
the local scheduler in addition to the cluster scheduler(s), so
should not be in the cluster package. As a result of this change,
none of the files in the local package reference files in the
cluster package.

(2) Moving the mesos package to within the cluster package.
The mesos scheduling code is for a cluster, and represents a
specific case of cluster scheduling (the Mesos-related classes
often subclass cluster scheduling classes). Thus, the most logical
place for it seems to be within the cluster package.

The one thing about the scheduling code that seems a little funny to me
is the naming of the SchedulerBackends.  The StandaloneSchedulerBackend
is not just for Standalone mode, but instead is used by Mesos coarse grained
mode and Yarn, and the backend that *is* just for Standalone mode is instead called SparkDeploySchedulerBackend. I didn't change this because I wasn't sure if there
was a reason for this naming that I'm just not aware of.

70a0b993

Merge pull request #670 from jey/ec2-ssh-improvements · 76677b8f
Reynold Xin authored 11 years ago
```
EC2 SSH improvements
```
76677b8f
Removed scala -optimize flag. · 3f283278
Reynold Xin authored 11 years ago

3f283278
Merge pull request #930 from holdenk/master · c514cd15
Reynold Xin authored 11 years ago
```
Add mapPartitionsWithIndex
```
c514cd15
Bug fix in master build · e2ff59af
Patrick Wendell authored 11 years ago

e2ff59af

Merge pull request #7 from wannabeast/memorystore-fixes · 560ee5c9

Reynold Xin authored 11 years ago

some minor fixes to MemoryStore

This is a repeat of #5, moved to its own branch in my repo.

This makes all updates to on ; it skips on synchronizing the reads where it can get away with it.

560ee5c9

Merge pull request #9 from rxin/limit · 6566a19b
Patrick Wendell authored 11 years ago
```
Smarter take/limit implementation.
```
6566a19b

Sep 25, 2013

Improved organization of scheduling packages. · d85fe41b

Kay Ousterhout authored 11 years ago

This commit does not change any code -- only file organization.

There are two components of this change:
(1) Moving files out of the cluster package, and down
a level to the scheduling package. These files are all used by
the local scheduler in addition to the cluster scheduler(s), so
should not be in the cluster package. As a result of this change,
none of the files in the local package reference files in the
cluster package.

(2) Moving the mesos package to within the cluster package.
The mesos scheduling code is for a cluster, and represents a
specific case of cluster scheduling (the Mesos-related classes
often subclass cluster scheduling classes). Thus, the most logical
place for it is within the cluster package.

d85fe41b

Sep 24, 2013
- Merge remote-tracking branch 'apache-github/pr/13' into HEAD · 9d34838b
  Patrick Wendell authored 11 years ago
  
  9d34838b
- Update build version in master · 6079721f
  Patrick Wendell authored 11 years ago
  
  6079721f
Sep 23, 2013
- Fix formatting :) · 0cef6835
  Holden Karau authored 11 years ago
  
  0cef6835
- Merge remote-tracking branch 'pr/12' · 7220e8f9
  Reynold Xin authored 11 years ago
  
  Fix spacing so java.io.tmpdir doesn't run on with SPARK_JAVA_OPTS
  7220e8f9
- $Y.CORP.YAHOO.COM\tgraves's avatar$
  
  Fix spacing so that the java.io.tmpdir doesn't run on with SPARK_JAVA_OPTS · a314b307
  Y.CORP.YAHOO.COM\tgraves authored 11 years ago
  
  a314b307
- Merge branch 'master' of https://git-wip-us.apache.org/repos/asf/incubator-spark · 0d2e5c3e
  Reynold Xin authored 11 years ago
  
  0d2e5c3e
- Merge branch 'master' of github.com:markhamstra/incubator-spark · ff540a01
  Reynold Xin authored 11 years ago
  
  ff540a01
- Merge branch 'master' of github.com:mesos/spark · f4dc9d37
  Reynold Xin authored 11 years ago
  
  f4dc9d37
Sep 22, 2013
- Send Task results through the block manager when larger than Akka frame size. · c75eb14f
  Kay Ousterhout authored 11 years ago
  
  This change requires adding an extra failure mode: tasks can complete successfully, but the result gets lost or flushed from the block manager before it's been fetched.
  c75eb14f
- Switch indent from 2 to 4 spaces · 7fe0b0ff
  Holden Karau authored 11 years ago
  
  7fe0b0ff
- Merge pull request #928 from jerryshao/fairscheduler-refactor · 834686b1
  Reynold Xin authored 11 years ago
  
  Refactor FairSchedulableBuilder
  834686b1
- Change Exception to NoSuchElementException and minor style fix · 77e9da1f
  jerryshao authored 11 years ago
  
  77e9da1f
- Remove infix style and others · 85024acd
  jerryshao authored 11 years ago
  
  85024acd
- Refactor FairSchedulableBuilder: · 5850f599
  jerryshao authored 11 years ago
  
  1. Configuration can be read from classpath if not set explicitly. 2. Add missing close handler.
  5850f599
- Merge pull request #937 from jerryshao/localProperties-fix · a2ea069a
  Reynold Xin authored 11 years ago
  
  Fix PR926 local properties issues in Spark Streaming like scenarios
  a2ea069a
- Merge pull request #941 from ilikerps/master · f06f2da2
  Reynold Xin authored 11 years ago
  
  Add "org.apache." prefix to packages in spark-class
  f06f2da2