Skip to content
Snippets Groups Projects
user avatar
Holden Karau authored
Upgrade to Py4j0.9

Author: Holden Karau <holden@pigscanfly.ca>
Author: Holden Karau <holden@us.ibm.com>

Closes #8615 from holdenk/SPARK-10447-upgrade-pyspark-to-py4j0.9.
e18b571c
History

Welcome to Spark Python API Docs!

Contents:

Core classes:

:class:`pyspark.SparkContext`

Main entry point for Spark functionality.

:class:`pyspark.RDD`

A Resilient Distributed Dataset (RDD), the basic abstraction in Spark.

:class:`pyspark.streaming.StreamingContext`

Main entry point for Spark Streaming functionality.

:class:`pyspark.streaming.DStream`

A Discretized Stream (DStream), the basic abstraction in Spark Streaming.

:class:`pyspark.sql.SQLContext`

Main entry point for DataFrame and SQL functionality.

:class:`pyspark.sql.DataFrame`

A distributed collection of data grouped into named columns.

Indices and tables

  • :ref:`search`