Commits · 7015f6f0e7889616db9595b4e68b5a5b5ffe921a · cs525-sp18-g07 / spark

Apr 25, 2017
- Preparing development version 2.1.2-SNAPSHOT · 8460b090
  Patrick Wendell authored 7 years ago
  
  8460b090
- Preparing Spark release v2.1.1-rc4 · 267aca5b
  Patrick Wendell authored 7 years ago
  
  View commits for tag v2.1.1 v2.1.1
  
  267aca5b
Apr 14, 2017
- Preparing development version 2.1.2-SNAPSHOT · 2a3e50e2
  Patrick Wendell authored 7 years ago
  
  2a3e50e2
- Preparing Spark release v2.1.1-rc3 · 2ed19cff
  Patrick Wendell authored 7 years ago
  
  2ed19cff
Mar 28, 2017
- Preparing development version 2.1.2-SNAPSHOT · 4964dbed
  Patrick Wendell authored 7 years ago
  
  4964dbed
- Preparing Spark release v2.1.1-rc2 · 02b165dc
  Patrick Wendell authored 7 years ago
  
  02b165dc
Mar 21, 2017
- Preparing development version 2.1.2-SNAPSHOT · c4d2b833
  Patrick Wendell authored 8 years ago
  
  c4d2b833
- Preparing Spark release v2.1.1-rc1 · 30abb95c
  Patrick Wendell authored 8 years ago
  
  30abb95c
Dec 15, 2016
- Preparing development version 2.1.1-SNAPSHOT · 483624c2
  Patrick Wendell authored 8 years ago
  
  483624c2
- Preparing Spark release v2.1.0-rc5 · cd0a0836
  Patrick Wendell authored 8 years ago
  
  View commits for tag v2.1.0 v2.1.0
  
  cd0a0836
- Preparing development version 2.1.1-SNAPSHOT · 62a6577b
  Patrick Wendell authored 8 years ago
  
  62a6577b
- Preparing Spark release v2.1.0-rc4 · ec317265
  Patrick Wendell authored 8 years ago
  
  ec317265
- Preparing development version 2.1.1-SNAPSHOT · a7364a82
  Patrick Wendell authored 8 years ago
  
  a7364a82
- Preparing Spark release v2.1.0-rc3 · ef2ccf94
  Patrick Wendell authored 8 years ago
  
  ef2ccf94
Dec 08, 2016
- Preparing development version 2.1.1-SNAPSHOT · 48aa6775
  Patrick Wendell authored 8 years ago
  
  48aa6775
- Preparing Spark release v2.1.0-rc2 · 08071749
  Patrick Wendell authored 8 years ago
  
  08071749
Nov 28, 2016
- Preparing development version 2.1.1-SNAPSHOT · 75d73d13
  Patrick Wendell authored 8 years ago
  
  75d73d13
- Preparing Spark release v2.1.0-rc1 · 80aabc0b
  Patrick Wendell authored 8 years ago
  
  80aabc0b
Jul 19, 2016

[SPARK-16535][BUILD] In pom.xml, remove groupId which is redundant definition... · 21a6dd2a

Xin Ren authored 8 years ago

[SPARK-16535][BUILD] In pom.xml, remove groupId which is redundant definition and inherited from the parent

https://issues.apache.org/jira/browse/SPARK-16535

## What changes were proposed in this pull request?

When I scan through the pom.xml of sub projects, I found this warning as below and attached screenshot
```
Definition of groupId is redundant, because it's inherited from the parent
```
![screen shot 2016-07-13 at 3 13 11 pm](https://cloud.githubusercontent.com/assets/3925641/16823121/744f893e-4916-11e6-8a52-042f83b9db4e.png)

I've tried to remove some of the lines with groupId definition, and the build on my local machine is still ok.
```
<groupId>org.apache.spark</groupId>
```
As I just find now `<maven.version>3.3.9</maven.version>` is being used in Spark 2.x, and Maven-3 supports versionless parent elements: Maven 3 will remove the need to specify the parent version in sub modules. THIS is great (in Maven 3.1).

ref: http://stackoverflow.com/questions/3157240/maven-3-worth-it/3166762#3166762

## How was this patch tested?

I've tested by re-building the project, and build succeeded.

Author: Xin Ren <iamshrek@126.com>

Closes #14189 from keypointt/SPARK-16535.

21a6dd2a

Jul 11, 2016

[SPARK-16477] Bump master version to 2.1.0-SNAPSHOT · ffcb6e05

Reynold Xin authored 8 years ago

## What changes were proposed in this pull request?
After SPARK-16476 (committed earlier today as #14128), we can finally bump the version number.

## How was this patch tested?
N/A

Author: Reynold Xin <rxin@databricks.com>

Closes #14130 from rxin/SPARK-16477.

ffcb6e05

Apr 02, 2016

[MINOR][DOCS] Use multi-line JavaDoc comments in Scala code. · 4a6e78ab

Dongjoon Hyun authored 8 years ago

## What changes were proposed in this pull request?

This PR aims to fix all Scala-Style multiline comments into Java-Style multiline comments in Scala codes.
(All comment-only changes over 77 files: +786 lines, −747 lines)

## How was this patch tested?

Manual.

Author: Dongjoon Hyun <dongjoon@apache.org>

Closes #12130 from dongjoon-hyun/use_multiine_javadoc_comments.

4a6e78ab

Mar 16, 2016

[SPARK-13920][BUILD] MIMA checks should apply to @Experimental and @DeveloperAPI APIs · 3c578c59

Dongjoon Hyun authored 9 years ago

## What changes were proposed in this pull request?

We are able to change `Experimental` and `DeveloperAPI` API freely but also should monitor and manage those API carefully. This PR for [SPARK-13920](https://issues.apache.org/jira/browse/SPARK-13920) enables MiMa check and adds filters for them.

## How was this patch tested?

Pass the Jenkins tests (including MiMa).

Author: Dongjoon Hyun <dongjoon@apache.org>

Closes #11751 from dongjoon-hyun/SPARK-13920.

3c578c59

Mar 11, 2016

[SPARK-13294][PROJECT INFRA] Remove MiMa's dependency on spark-class / Spark assembly · 6ca990fb

Josh Rosen authored 9 years ago

This patch removes the need to build a full Spark assembly before running the `dev/mima` script.

- I modified the `tools` project to remove a direct dependency on Spark, so `sbt/sbt tools/fullClasspath` will now return the classpath for the `GenerateMIMAIgnore` class itself plus its own dependencies.
   - This required me to delete two classes full of dead code that we don't use anymore
- `GenerateMIMAIgnore` now uses [ClassUtil](http://software.clapper.org/classutil/) to find all of the Spark classes rather than our homemade JAR traversal code. The problem in our own code was that it didn't handle folders of classes properly, which is necessary in order to generate excludes with an assembly-free Spark build.
- `./dev/mima` no longer runs through `spark-class`, eliminating the need to reason about classpath ordering between `SPARK_CLASSPATH` and the assembly.

Author: Josh Rosen <joshrosen@databricks.com>

Closes #11178 from JoshRosen/remove-assembly-in-run-tests.

6ca990fb

Mar 03, 2016

[HOT-FIX] Recover some deprecations for 2.10 compatibility. · 02b7677e

Dongjoon Hyun authored 9 years ago

## What changes were proposed in this pull request?

#11479 [SPARK-13627] broke 2.10 compatibility: [2.10-Build](https://amplab.cs.berkeley.edu/jenkins/view/Spark%20QA%20Compile/job/spark-master-compile-maven-scala-2.10/292/console)
At this moment, we need to support both 2.10 and 2.11.
This PR recovers some deprecated methods which were replace by [SPARK-13627].

## How was this patch tested?

Jenkins build: Both 2.10, 2.11.

Author: Dongjoon Hyun <dongjoon@apache.org>

Closes #11488 from dongjoon-hyun/hotfix_compatibility_with_2.10.

02b7677e

Mar 02, 2016

[SPARK-13627][SQL][YARN] Fix simple deprecation warnings. · 9c274ac4

Dongjoon Hyun authored 9 years ago

## What changes were proposed in this pull request?

This PR aims to fix the following deprecation warnings.
  * MethodSymbolApi.paramss--> paramLists
  * AnnotationApi.tpe -> tree.tpe
  * BufferLike.readOnly -> toList.
  * StandardNames.nme -> termNames
  * scala.tools.nsc.interpreter.AbstractFileClassLoader -> scala.reflect.internal.util.AbstractFileClassLoader
  * TypeApi.declarations-> decls

## How was this patch tested?

Check the compile build log and pass the tests.
```
./build/sbt
```

Author: Dongjoon Hyun <dongjoon@apache.org>

Closes #11479 from dongjoon-hyun/SPARK-13627.

9c274ac4

Jan 30, 2016

[SPARK-6363][BUILD] Make Scala 2.11 the default Scala version · 289373b2

Josh Rosen authored 9 years ago

This patch changes Spark's build to make Scala 2.11 the default Scala version. To be clear, this does not mean that Spark will stop supporting Scala 2.10: users will still be able to compile Spark for Scala 2.10 by following the instructions on the "Building Spark" page; however, it does mean that Scala 2.11 will be the default Scala version used by our CI builds (including pull request builds).

The Scala 2.11 compiler is faster than 2.10, so I think we'll be able to look forward to a slight speedup in our CI builds (it looks like it's about 2X faster for the Maven compile-only builds, for instance).

After this patch is merged, I'll update Jenkins to add new compile-only jobs to ensure that Scala 2.10 compilation doesn't break.

Author: Josh Rosen <joshrosen@databricks.com>

Closes #10608 from JoshRosen/SPARK-6363.

289373b2

Jan 08, 2016

[SPARK-4819] Remove Guava's "Optional" from public API · 659fd9d0

Sean Owen authored 9 years ago

Replace Guava `Optional` with (an API clone of) Java 8 `java.util.Optional` (edit: and a clone of Guava `Optional`)

See also https://github.com/apache/spark/pull/10512

Author: Sean Owen <sowen@cloudera.com>

Closes #10513 from srowen/SPARK-4819.

659fd9d0

Dec 31, 2015

[SPARK-3873][STREAMING] Import order fixes for streaming. · efb10cc9

Marcelo Vanzin authored 9 years ago

Also included a few miscelaneous other modules that had very few violations.

Author: Marcelo Vanzin <vanzin@cloudera.com>

Closes #10532 from vanzin/SPARK-3873-streaming.

efb10cc9

Dec 19, 2015
- Bump master version to 2.0.0-SNAPSHOT. · f496031b
  Reynold Xin authored 9 years ago
  
  Author: Reynold Xin <rxin@databricks.com> Closes #10387 from rxin/version-bump.
  f496031b
Nov 17, 2015

[SPARK-11732] Removes some MiMa false positives · fa603e08

Timothy Hunter authored 9 years ago

This adds an extra filter for private or protected classes. We only filter for package private right now.

Author: Timothy Hunter <timhunter@databricks.com>

Closes #9697 from thunterdb/spark-11732.

fa603e08

Sep 15, 2015
- Update version to 1.6.0-SNAPSHOT. · 09b7e7c1
  Reynold Xin authored 9 years ago
  
  Author: Reynold Xin <rxin@databricks.com> Closes #8350 from rxin/1.6.
  09b7e7c1
Aug 25, 2015

[SPARK-9613] [CORE] Ban use of JavaConversions and migrate all existing uses to JavaConverters · 69c9c177

Sean Owen authored 9 years ago

Replace `JavaConversions` implicits with `JavaConverters`

Most occurrences I've seen so far are necessary conversions; a few have been avoidable. None are in critical code as far as I see, yet.

Author: Sean Owen <sowen@cloudera.com>

Closes #8033 from srowen/SPARK-9613.

69c9c177

Jul 16, 2015

[SPARK-9015] [BUILD] Clean project import in scala ide · b536d5dc

Jan Prach authored 9 years ago

Cleanup maven for a clean import in scala-ide / eclipse.

* remove groovy plugin which is really not needed at all
* add-source from build-helper-maven-plugin is not needed as recent version of scala-maven-plugin do it automatically
* add lifecycle-mapping plugin to hide a few useless warnings from ide

Author: Jan Prach <jendap@gmail.com>

Closes #7375 from jendap/clean-project-import-in-scala-ide and squashes the following commits:

c4b4c0f [Jan Prach] fix whitespaces
5a83e07 [Jan Prach] Revert "remove java compiler warnings from java tests"
312007e [Jan Prach] scala-maven-plugin itself add scala sources by default
f47d856 [Jan Prach] remove spark-1.4-staging repository
c8a54db [Jan Prach] remove java compiler warnings from java tests
999a068 [Jan Prach] remove some maven warnings in scala ide
80fbdc5 [Jan Prach] remove groovy and gmavenplus plugin

b536d5dc

Jul 14, 2015

[SPARK-8962] Add Scalastyle rule to ban direct use of Class.forName; fix existing uses · 11e5c372

Josh Rosen authored 9 years ago

This pull request adds a Scalastyle regex rule which fails the style check if `Class.forName` is used directly.  `Class.forName` always loads classes from the default / system classloader, but in a majority of cases, we should be using Spark's own `Utils.classForName` instead, which tries to load classes from the current thread's context classloader and falls back to the classloader which loaded Spark when the context classloader is not defined.

<!-- Reviewable:start -->
[<img src="https://reviewable.io/review_button.png" height=40 alt="Review on Reviewable"/>](https://reviewable.io/reviews/apache/spark/7350)
<!-- Reviewable:end -->

Author: Josh Rosen <joshrosen@databricks.com>

Closes #7350 from JoshRosen/ban-Class.forName and squashes the following commits:

e3e96f7 [Josh Rosen] Merge remote-tracking branch 'origin/master' into ban-Class.forName
c0b7885 [Josh Rosen] Hopefully fix the last two cases
d707ba7 [Josh Rosen] Fix uses of Class.forName that I missed in my first cleanup pass
046470d [Josh Rosen] Merge remote-tracking branch 'origin/master' into ban-Class.forName
62882ee [Josh Rosen] Fix uses of Class.forName or add exclusion.
d9abade [Josh Rosen] Add stylechecker rule to ban uses of Class.forName

11e5c372

Jul 10, 2015

[SPARK-7977] [BUILD] Disallowing println · e14b545d

Jonathan Alter authored 9 years ago

Author: Jonathan Alter <jonalter@users.noreply.github.com>

Closes #7093 from jonalter/SPARK-7977 and squashes the following commits:

ccd44cc [Jonathan Alter] Changed println to log in ThreadingSuite
7fcac3e [Jonathan Alter] Reverting to println in ThreadingSuite
10724b6 [Jonathan Alter] Changing some printlns to logs in tests
eeec1e7 [Jonathan Alter] Merge branch 'master' of github.com:apache/spark into SPARK-7977
0b1dcb4 [Jonathan Alter] More println cleanup
aedaf80 [Jonathan Alter] Merge branch 'master' of github.com:apache/spark into SPARK-7977
925fd98 [Jonathan Alter] Merge branch 'master' of github.com:apache/spark into SPARK-7977
0c16fa3 [Jonathan Alter] Replacing some printlns with logs
45c7e05 [Jonathan Alter] Merge branch 'master' of github.com:apache/spark into SPARK-7977
5c8e283 [Jonathan Alter] Allowing println in audit-release examples
5b50da1 [Jonathan Alter] Allowing printlns in example files
ca4b477 [Jonathan Alter] Merge branch 'master' of github.com:apache/spark into SPARK-7977
83ab635 [Jonathan Alter] Fixing new printlns
54b131f [Jonathan Alter] Merge branch 'master' of github.com:apache/spark into SPARK-7977
1cd8a81 [Jonathan Alter] Removing some unnecessary comments and printlns
b837c3a [Jonathan Alter] Disallowing println

e14b545d

Jun 03, 2015

[SPARK-7801] [BUILD] Updating versions to SPARK 1.5.0 · 2c4d550e

Patrick Wendell authored 9 years ago

Author: Patrick Wendell <patrick@databricks.com>

Closes #6328 from pwendell/spark-1.5-update and squashes the following commits:

2f42d02 [Patrick Wendell] A few more excludes
4bebcf0 [Patrick Wendell] Update to RC4
61aaf46 [Patrick Wendell] Using new release candidate
55f1610 [Patrick Wendell] Another exclude
04b4f04 [Patrick Wendell] More issues with transient 1.4 changes
36f549b [Patrick Wendell] [SPARK-7801] [BUILD] Updating versions to SPARK 1.5.0

2c4d550e

May 01, 2015

[SPARK-4550] In sort-based shuffle, store map outputs in serialized form · 0a2b15ce

Sandy Ryza authored 9 years ago

Refer to the JIRA for the design doc and some perf results.

I wanted to call out some of the more possibly controversial changes up front:
* Map outputs are only stored in serialized form when Kryo is in use.  I'm still unsure whether Java-serialized objects can be relocated.  At the very least, Java serialization writes out a stream header which causes problems with the current approach, so I decided to leave investigating this to future work.
* The shuffle now explicitly operates on key-value pairs instead of any object.  Data is written to shuffle files in alternating keys and values instead of key-value tuples.  `BlockObjectWriter.write` now accepts a key argument and a value argument instead of any object.
* The map output buffer can hold a max of Integer.MAX_VALUE bytes.  Though this wouldn't be terribly difficult to change.
* When spilling occurs, the objects that still in memory at merge time end up serialized and deserialized an extra time.

Author: Sandy Ryza <sandy@cloudera.com>

Closes #4450 from sryza/sandy-spark-4550 and squashes the following commits:

8c70dd9 [Sandy Ryza] Fix serialization
9c16fe6 [Sandy Ryza] Fix a couple tests and move getAutoReset to KryoSerializerInstance
6c54e06 [Sandy Ryza] Fix scalastyle
d8462d8 [Sandy Ryza] SPARK-4550

0a2b15ce

Apr 03, 2015

[SPARK-6428] Turn on explicit type checking for public methods. · 82701ee2

Reynold Xin authored 9 years ago

This builds on my earlier pull requests and turns on the explicit type checking in scalastyle.

Author: Reynold Xin <rxin@databricks.com>

Closes #5342 from rxin/SPARK-6428 and squashes the following commits:

7b531ab [Reynold Xin] import ordering
2d9a8a5 [Reynold Xin] jl
e668b1c [Reynold Xin] override
9b9e119 [Reynold Xin] Parenthesis.
82e0cf5 [Reynold Xin] [SPARK-6428] Turn on explicit type checking for public methods.

82701ee2

Apr 02, 2015

[SPARK-6627] Some clean-up in shuffle code. · 6562787b

Patrick Wendell authored 9 years ago

Before diving into review #4450 I did a look through the existing shuffle
code to learn how it works. Unfortunately, there are some very
confusing things in this code. This patch makes a few small changes
to simplify things. It is not easily to concisely describe the changes
because of how convoluted the issues were, but they are fairly small
logically:

1. There is a trait named `ShuffleBlockManager` that only deals with
one logical function which is retrieving shuffle block data given shuffle
block coordinates. This trait has two implementors FileShuffleBlockManager
and IndexShuffleBlockManager. Confusingly the vast majority of those
implementations have nothing to do with this particular functionality.
So I've renamed the trait to ShuffleBlockResolver and documented it.
2. The aforementioned trait had two almost identical methods, for no good
reason. I removed one method (getBytes) and modified callers to use the
other one. I think the behavior is preserved in all cases.
3. The sort shuffle code uses an identifier "0" in the reduce slot of a
BlockID as a placeholder. I made it into a constant since it needs to
be consistent across multiple places.

I think for (3) there is actually a better solution that would avoid the
need to do this type of workaround/hack in the first place, but it's more
complex so I'm punting it for now.

Author: Patrick Wendell <patrick@databricks.com>

Closes #5286 from pwendell/cleanup and squashes the following commits:

c71fbc7 [Patrick Wendell] Open interface back up for testing
f36edd5 [Patrick Wendell] Code review feedback
d1c0494 [Patrick Wendell] Style fix
a406079 [Patrick Wendell] [HOTFIX] Some clean-up in shuffle code.

6562787b

Mar 20, 2015

[SPARK-6371] [build] Update version to 1.4.0-SNAPSHOT. · a7456459

Marcelo Vanzin authored 10 years ago

Author: Marcelo Vanzin <vanzin@cloudera.com>

Closes #5056 from vanzin/SPARK-6371 and squashes the following commits:

63220df [Marcelo Vanzin] Merge branch 'master' into SPARK-6371
6506f75 [Marcelo Vanzin] Use more fine-grained exclusion.
178ba71 [Marcelo Vanzin] Oops.
75b2375 [Marcelo Vanzin] Exclude VertexRDD in MiMA.
a45a62c [Marcelo Vanzin] Work around MIMA warning.
1d8a670 [Marcelo Vanzin] Re-group jetty exclusion.
0e8e909 [Marcelo Vanzin] Ignore ml, don't ignore graphx.
cef4603 [Marcelo Vanzin] Indentation.
296cf82 [Marcelo Vanzin] [SPARK-6371] [build] Update version to 1.4.0-SNAPSHOT.

a7456459