Skip to content
Snippets Groups Projects
Commit 7cd490ef authored by root's avatar root
Browse files

Clarify that PySpark is not supported on Windows

parent ec31e68d
No related branches found
No related tags found
No related merge requests found
...@@ -17,10 +17,9 @@ There are a few key differences between the Python and Scala APIs: ...@@ -17,10 +17,9 @@ There are a few key differences between the Python and Scala APIs:
* Python is dynamically typed, so RDDs can hold objects of different types. * Python is dynamically typed, so RDDs can hold objects of different types.
* PySpark does not currently support the following Spark features: * PySpark does not currently support the following Spark features:
- Special functions on RDDs of doubles, such as `mean` and `stdev` - Special functions on RDDs of doubles, such as `mean` and `stdev`
- `lookup` - `lookup`, `sample` and `sort`
- `persist` at storage levels other than `MEMORY_ONLY` - `persist` at storage levels other than `MEMORY_ONLY`
- `sample` - Execution on Windows -- this is slated for a future release
- `sort`
In PySpark, RDDs support the same methods as their Scala counterparts but take Python functions and return Python collection types. In PySpark, RDDs support the same methods as their Scala counterparts but take Python functions and return Python collection types.
Short functions can be passed to RDD methods using Python's [`lambda`](http://www.diveintopython.net/power_of_introspection/lambda_functions.html) syntax: Short functions can be passed to RDD methods using Python's [`lambda`](http://www.diveintopython.net/power_of_introspection/lambda_functions.html) syntax:
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment