pyspark.streaming.StreamingContext.transform

StreamingContext.transform(dstreams: List[DStream[Any]], transformFunc: Callable[[…], pyspark.rdd.RDD[T]]) → pyspark.streaming.dstream.DStream[T][source]

Create a new DStream in which each RDD is generated by applying a function on RDDs of the DStreams. The order of the JavaRDDs in the transform function parameter will be the same as the order of corresponding DStreams in the list.