Commits · 86ed1ad2520662f4a16e535cc05bf2296e6053df · cs525-sp18-g07 / spark

Jan 07, 2014

Fix BlockManagerSuite#after · 86ed1ad2
Mark Hamstra authored 11 years ago

86ed1ad2

Merge pull request #337 from yinxusen/mllib-16-bugfix · 7d5fa175

Reynold Xin authored 11 years ago

Mllib 16 bugfix

Bug fix: https://spark-project.atlassian.net/browse/MLLIB-16

Hi, I fixed the bug and added a test suite for `GradientDescent`. There are 2 checks in the test case. First, the final loss must be lower than the initial one. Second, the trend of loss sequence should be decreasing, i.e., at least 80% iterations have lower losses than their prior iterations.

Thanks!

7d5fa175

Merge pull request #349 from CodingCat/support-worker_dir · 71fc1135

Reynold Xin authored 11 years ago

add the comments about SPARK_WORKER_DIR

this env variable seems to be forgotten

in many cases we need to set this variable, e.g. in EC2, we have to move the large application log files from the EBS to the ephemeral storage

71fc1135

add the comments about SPARK_WORKER_DIR · 3633172e
CodingCat authored 11 years ago
```
this env variable seems to be forgotten …
```
3633172e

Merge pull request #318 from srowen/master · 15d95345

Reynold Xin authored 11 years ago

Suggested small changes to Java code for slightly more standard style, encapsulation and in some cases performance

Sorry if this is too abrupt or not a welcome set of changes, but thought I'd see if I could contribute a little. I'm a Java developer and just getting seriously into Spark. So I thought I'd suggest a number of small changes to the couple Java parts of the code to make it a little tighter, more standard and even a bit faster.

Feel free to take all, some or none of this. Happy to explain any of it.

15d95345

Merge pull request #348 from prabeesh/master · 468af0fa

Reynold Xin authored 11 years ago

spark -> org.apache.spark

Changed package name spark to org.apache.spark which was missing in some of the files

468af0fa

Issue #318 : minor style updates per review from Reynold Xin · 4b92a202
Sean Owen authored 11 years ago

4b92a202

Merge pull request #339 from ScrapCodes/conf-improvements · c3cf0475

Patrick Wendell authored 11 years ago

Conf improvements

There are two new features.

1. Allow users to set arbitrary akka configurations via spark conf.

2. Allow configuration to be printed in logs for diagnosis.

c3cf0475

Merge pull request #331 from holdenk/master · a862cafa

Reynold Xin authored 11 years ago

Add a script to download sbt if not present on the system

As per the discussion on the dev mailing list this script will use the system sbt if present or otherwise attempt to install the sbt launcher. The fall back error message in the event it fails instructs the user to install sbt. While the URLs it fetches from aren't controlled by the spark project directly, they are stable and the current authoritative sources.

a862cafa

Use awk to extract the version · 60a7a6b3
Holden Karau authored 11 years ago

60a7a6b3
formatting related fixes suggested by Patrick. · c729fa7c
Prashant Sharma authored 11 years ago

c729fa7c
Allow configuration to be printed in logs for diagnosis. · b84dc780
Prashant Sharma authored 11 years ago

b84dc780
Allow users to set arbitrary akka configurations via spark conf. · b3018811
Prashant Sharma authored 11 years ago

b3018811
Put quote arround arguments passed down to system sbt · b590adb2
Holden Karau authored 11 years ago

b590adb2
spark -> org.apache.spark · a91f14cf
prabeesh authored 11 years ago

a91f14cf

Jan 06, 2014

Merge pull request #346 from sproblvem/patch-1 · b97ef218

Patrick Wendell authored 11 years ago

Update stop-slaves.sh

The most recently version has changed the directory structure, but this script "sbin/stop-all.sh" doesn't change with it accordingly. This mistake makes "sbin/stop-all.sh" can't stop the slave node.

b97ef218

Update stop-slaves.sh · dea4ba9d

sproblvem authored 11 years ago

The most recently version has changed the directory structure, but this script "sbin/stop-all.sh" doesn't change with it accordingly. This mistake makes "sbin/stop-all.sh" can't stop the slave node.

dea4ba9d

Merge pull request #343 from pwendell/build-fix · e4d6057b

Patrick Wendell authored 11 years ago

Fix test breaking downstream builds

This wasn't detected in the pull-request-builder because it manually sets SPARK_HOME. I'm going to change that (it should't do this) to make it like the other builds.

e4d6057b

Fix test breaking downstream builds · 9272a004
Patrick Wendell authored 11 years ago

9272a004
Merge pull request #340 from ScrapCodes/sbt-fixes · 93bf9620
Patrick Wendell authored 11 years ago
```
Made java options to be applied during tests so that they become self explanatory.
```
93bf9620
Merge pull request #338 from ScrapCodes/ning-upgrade · 60edeb3d
Patrick Wendell authored 11 years ago
```
SPARK-1005 Ning upgrade
```
60edeb3d

Merge pull request #341 from ash211/patch-5 · c708e817

Patrick Wendell authored 11 years ago

Clarify spark.cores.max in docs

It controls the count of cores across the cluster, not on a per-machine basis.

c708e817

Merge pull request #342 from tgravescs/fix_maven_protobuf · 33fcb91e

Patrick Wendell authored 11 years ago

Change protobuf version for yarn alpha back to 2.4.1

The maven build for yarn-alpha uses the wrong protobuf version and hence the generated assembly jar doesn't work with Hadoop 0.23.  Removing the setting for the yarn-alpha profile since the default protobuf version is 2.4.1 at the top of the pom file.

33fcb91e

Merge pull request #330 from tgravescs/fix_addjars_null_handling · 357083c2

Patrick Wendell authored 11 years ago

Fix handling of empty SPARK_EXAMPLES_JAR

Currently if SPARK_EXAMPLES_JAR is left unset you get a null pointer exception when running the examples (atleast on spark on yarn).  The null now gets turned into a string of "null" when its put into the SparkConf so addJar no longer properly ignores it. This fixes that so that it can be left unset.

357083c2

Change protobuf version for yarn alpha back to 2.4.1 · 1f7c090e
Thomas Graves authored 11 years ago

1f7c090e

Clarify spark.cores.max · 2dd4fb56

Andrew Ash authored 11 years ago

It controls the count of cores across the cluster, not on a per-machine basis.

2dd4fb56

Merge remote-tracking branch 'upstream/master' · 7379b291
Sean Owen authored 11 years ago

7379b291
Add warning to null setJars check · 25446dd9
Thomas Graves authored 11 years ago

25446dd9
Made java options to be applied during tests so that they become self explanatory. · 2d0825e9
Prashant Sharma authored 11 years ago

2d0825e9
SPARK-1005 Ning upgrade · 355a0338
Prashant Sharma authored 11 years ago

355a0338
Added GradientDescentSuite · 05e6d5b4
Xusen Yin authored 11 years ago

05e6d5b4
CR feedback (sbt -> sbt/sbt and correct JAR path in script) :) · 2dc83de7
Holden Karau authored 11 years ago

2dc83de7

Merge pull request #333 from pwendell/logging-silence · a2e7e049

Patrick Wendell authored 11 years ago

Quiet ERROR-level Akka Logs

This fixes an issue I've seen where akka logs a bunch of things at ERROR level when connecting to a standalone cluster, even in the normal case. I noticed that even when lifecycle logging was disabled, the netty code inside of akka still logged away via akka's EndpointWriter class. There are also some other log streams that I think are new in akka 2.2.1 that I've disabled.

Finally, I added some better logging to the standalone client. This makes it more clear when a connection failure occurs what is going on. Previously it never explicitly said if a connection attempt had failed.

The commit messages here have some more detail.

a2e7e049

Finish documentation changes · 7d0094bb
Holden Karau authored 11 years ago

7d0094bb
Fix indentatation · 5a598b2d
Holden Karau authored 11 years ago

5a598b2d
Code review feedback · d86dc74d
Holden Karau authored 11 years ago

d86dc74d

Jan 05, 2014

Responding to Aaron's review · 675d7eb4
Patrick Wendell authored 11 years ago

675d7eb4
fix logistic loss bug · a7210728
Xusen Yin authored 11 years ago

a7210728

Merge pull request #334 from pwendell/examples-fix · 5b0986a1

Reynold Xin authored 11 years ago

Removing SPARK_EXAMPLES_JAR in the code

This re-writes all of the examples to use the `SparkContext.jarOfClass` mechanism for loading the examples jar. This necessary for environments like YARN and the Standalone mode where example programs will be submit from inside the cluster rather than at the client using `./spark-example`.

This still leaves SPARK_EXAMPLES_JAR in place in the shell scripts for setting up the classpath if `./spark-example` is run.

5b0986a1

Merge pull request #335 from rxin/ser · f4b924f6

Reynold Xin authored 11 years ago

Fall back to zero-arg constructor for Serializer initialization if there is no constructor that accepts SparkConf.

This maintains backward compatibility with older serializers implemented by users.

f4b924f6