Commits · ad4e60ee7e2c49c24a9972312915f7f7253c7679 · cs525-sp18-g07 / spark

Apr 18, 2014

Reynold Xin authored 11 years ago

Author: Reynold Xin <rxin@apache.org>

Closes #444 from rxin/pyspark and squashes the following commits:

fc11356 [Reynold Xin] Made the PySpark shell version checking compatible with Python 2.6.
571830b [Reynold Xin] Fixed broken pyspark shell.

81a152c5

Apr 16, 2014

[python alternative] pyspark require Python2, failing if system default is Py3 from shell.py · bb76eae1

AbhishekKr authored 11 years ago

Python alternative for https://github.com/apache/spark/pull/392; managed from shell.py

Author: AbhishekKr <abhikumar163@gmail.com>

Closes #399 from abhishekkr/pyspark_shell and squashes the following commits:

134bdc9 [AbhishekKr] pyspark require Python2, failing if system default is Py3 from shell.py

bb76eae1

Apr 10, 2014

Set spark.executor.uri from environment variable (needed by Mesos) · 5cd11d51

Ivan Wick authored 11 years ago

The Mesos backend uses this property when setting up a slave process. It is similarly set in the Scala repl (org.apache.spark.repl.SparkILoop), but I couldn't find any analogous for pyspark.

Author: Ivan Wick <ivanwick+github@gmail.com>

This patch had conflicts when merged, resolved by
Committer: Matei Zaharia <matei@databricks.com>

Closes #311 from ivanwick/master and squashes the following commits:

da0c3e4 [Ivan Wick] Set spark.executor.uri from environment variable (needed by Mesos)

5cd11d51

Apr 07, 2014

SPARK-1099: Introduce local[*] mode to infer number of cores · 0307db0f

Aaron Davidson authored 11 years ago

This is the default mode for running spark-shell and pyspark, intended to allow users running spark for the first time to see the performance benefits of using multiple cores, while not breaking backwards compatibility for users who use "local" mode and expect exactly 1 core.

Author: Aaron Davidson <aaron@databricks.com>

Closes #182 from aarondav/110 and squashes the following commits:

a88294c [Aaron Davidson] Rebased changes for new spark-shell
a9f393e [Aaron Davidson] SPARK-1099: Introduce local[*] mode to infer number of cores

0307db0f

Feb 08, 2014

Merge pull request #542 from markhamstra/versionBump. Closes #542. · c2341c92

Mark Hamstra authored 11 years ago

Version number to 1.0.0-SNAPSHOT

Since 0.9.0-incubating is done and out the door, we shouldn't be building 0.9.0-incubating-SNAPSHOT anymore.

@pwendell

Author: Mark Hamstra <markhamstra@gmail.com>

== Merge branch commits ==

commit 1b00a8a7c1a7f251b4bb3774b84b9e64758eaa71
Author: Mark Hamstra <markhamstra@gmail.com>
Date:   Wed Feb 5 09:30:32 2014 -0800

    Version number to 1.0.0-SNAPSHOT

c2341c92

Jan 02, 2014
- pyspark -> bin/pyspark · a3f90a2e
  Prashant Sharma authored 11 years ago
  
  a3f90a2e
Dec 24, 2013
- Typo: avaiable -> available · 3665c722
  Andrew Ash authored 11 years ago
  
  3665c722
Sep 24, 2013
- Update build version in master · 6079721f
  Patrick Wendell authored 11 years ago
  
  6079721f
Sep 07, 2013

Export StorageLevel and refactor · c1cc8c4d
Aaron Davidson authored 11 years ago

c1cc8c4d

Remove reflection, hard-code StorageLevels · 8001687a

Aaron Davidson authored 11 years ago

The sc.StorageLevel -> StorageLevel pathway is a bit janky, but otherwise
the shell would have to call a private method of SparkContext. Having
StorageLevel available in sc also doesn't seem like the end of the world.
There may be a better solution, though.

As for creating the StorageLevel object itself, this seems to be the best
way in Python 2 for creating singleton, enum-like objects:
http://stackoverflow.com/questions/36932/how-can-i-represent-an-enum-in-python

8001687a

Sep 06, 2013

SPARK-660: Add StorageLevel support in Python · a63d4c7d

Aaron Davidson authored 11 years ago

It uses reflection... I am not proud of that fact, but it at least ensures
compatibility (sans refactoring of the StorageLevel stuff).

a63d4c7d

Sep 01, 2013
- Add banner to PySpark and make wordcount output nicer · bbaa9d7d
  Matei Zaharia authored 11 years ago
  
  bbaa9d7d
Aug 12, 2013
- Implementing SPARK-865: Add the equivalent of ADD_JARS to PySpark · 8fd5c7bc
  Andre Schumacher authored 12 years ago
  
  Now ADD_FILES uses a comma as file name separator.
  8fd5c7bc
Jul 16, 2013
- Add Apache license headers and LICENSE and NOTICE files · af3c9d50
  Matei Zaharia authored 12 years ago
  
  af3c9d50
Jan 30, 2013
- Make module help available in python shell. · 3f945e3b
  Patrick Wendell authored 12 years ago
  
  Also, adds a line in doc explaining how to use.
  3f945e3b
Jan 20, 2013
- Added accumulators to PySpark · 8e7f098a
  Matei Zaharia authored 12 years ago
  
  8e7f098a
Jan 01, 2013
- Add `pyspark` script to replace the other scripts. · ce9f1bbe
  Josh Rosen authored 12 years ago
  
  Expand the PySpark programming guide.
  ce9f1bbe
- Rename top-level 'pyspark' directory to 'python' · b58340db
  Josh Rosen authored 12 years ago
  
  b58340db
Dec 28, 2012

Simplify PySpark installation. · 665466df

Josh Rosen authored 12 years ago

- Bundle Py4J binaries, since it's hard to install
- Uses Spark's `run` script to launch the Py4J
  gateway, inheriting the settings in spark-env.sh

With these changes, (hopefully) nothing more than
running `sbt/sbt package` will be necessary to run
PySpark.

665466df

Dec 27, 2012
- Add IPython support to pyspark-shell. · 2d98fff0
  Josh Rosen authored 12 years ago
  
  Suggested by / based on code from @MLnick
  2d98fff0
Oct 19, 2012
- Add PySpark README and run scripts. · c23bf1af
  Josh Rosen authored 12 years ago
  
  c23bf1af