Apache Spark

Apache Spark
Spark Logo
Original author(s)Matei Zaharia
Developer(s)Apache Spark
Initial releaseMay 26, 2014 (2014-05-26)
Stable release
3.0.0 / June 18, 2020 (2020-06-18)
RepositorySpark Repository
Written inScala[1]
Operating systemMicrosoft Windows, macOS, Linux
Available inScala, Java, SQL, Python, R
TypeData analytics, machine learning algorithms
LicenseApache License 2.0
Websitespark.apache.org Edit this at Wikidata
Warning: Page using Template:Infobox software with unknown parameter "status" (this message is shown only in preview).

Apache Spark is an open-source distributed general-purpose cluster-computing framework. Spark provides an interface for programming entire clusters with implicit data parallelism and fault tolerance. Originally developed at the University of California, Berkeley's AMPLab, the Spark codebase was later donated to the Apache Software Foundation, which has maintained it since.

  1. ^ "Spark Release 2.0.0". MLlib in R: SparkR now offers MLlib APIs [..] Python: PySpark now offers many more MLlib algorithms"

Powered by 654 easy search