Skip to content

Latest commit

 

History

History
52 lines (31 loc) · 1.08 KB

File metadata and controls

52 lines (31 loc) · 1.08 KB

Welcome to Spark Python API Docs!

Contents:

.. toctree::
   :maxdepth: 2

   pyspark
   pyspark.sql
   pyspark.streaming
   pyspark.ml
   pyspark.mllib


Core classes:

:class:`pyspark.SparkContext`

Main entry point for Spark functionality.

:class:`pyspark.RDD`

A Resilient Distributed Dataset (RDD), the basic abstraction in Spark.

:class:`pyspark.streaming.StreamingContext`

Main entry point for Spark Streaming functionality.

:class:`pyspark.streaming.DStream`

A Discretized Stream (DStream), the basic abstraction in Spark Streaming.

:class:`pyspark.sql.SQLContext`

Main entry point for DataFrame and SQL functionality.

:class:`pyspark.sql.DataFrame`

A distributed collection of data grouped into named columns.

Indices and tables