pyspark.streaming.DStream.cogroup#

DStream.cogroup(other, numPartitions=None)[source]#

Return a new DStream by applying ‘cogroup’ between RDDs of this DStream and other DStream.

Hash partitioning is used to generate the RDDs with numPartitions partitions.