24 November 2015

foreach() Example

foreach() is an action. Unlike other actions, foreach do not return any value. It simply operates on all the elements in the RDD. foreach() can be used in situations, where we do not want to return any result, but want to initiate a computation. A good example is ; inserting elements in RDD into database. Let us look at an example for foreach()

scala> val testData=Array(1,2,3)
testData: Array[Int] = Array(1, 2, 3)

scala> val inputrdd = sc.parallelize(testData)
inputrdd: org.apache.spark.rdd.RDD[Int] = ParallelCollectionRDD[2] at parallelize at :23

scala>

scala> inputrdd.foreach{ x => {
     |                         println(x)
     |                       }}
3
1
2

scala>

scala> //Checkout this ; out addition has no

scala> //impact on the actual elements in RDD

scala> inputrdd.collect()
res6: Array[Int] = Array(1, 2, 3)

Reference

Learning spark : Page 41