Skip to content

Latest commit

 

History

History
30 lines (20 loc) · 548 Bytes

running-spark.md

File metadata and controls

30 lines (20 loc) · 548 Bytes

Sample Lab : Running Spark v2

Screencast

Sandbox spark Screencast

Checkout our Sandbox channel for more videos.

Script

In terminal

  $   cd
  # unpack
  $   tar xvf files/spark-2.1.0-bin-hadoop2.7.tgz
  $   mv  spark-2.1.0-bin-hadoop2.7  spark

  # start spark shell
  $  ~/spark/bin/spark-shell

The following commands are in Spark Shell

  // load file
  > val f = spark.read.textFile("data/spark-commits/spark-commits.log")
  > f.count