Originally created at U.C. Berkeley’s AMPLab in 2009, Apache Spark is a “lightning-fast unified analytics engine” designed for large-scale data processing. It works with cluster computing platforms ...