pyspark.streaming.DStream.cogroup

DStream.cogroup(other, numPartitions=None)[source]

Return a new DStream by applying ‘cogroup’ between RDDs of this DStream and other DStream.

Hash partitioning is used to generate the RDDs with numPartitions partitions.