pyspark.RDD.treeAggregate

RDD.treeAggregate(zeroValue, seqOp, combOp, depth=2)[source]

Aggregates the elements of this RDD in a multi-level tree pattern.

depthint, optional

suggested depth of the tree (default: 2)

Examples

>>> add = lambda x, y: x + y
>>> rdd = sc.parallelize([-5, -4, -3, -2, -1, 1, 2, 3, 4], 10)
>>> rdd.treeAggregate(0, add, add)
-5
>>> rdd.treeAggregate(0, add, add, 1)
-5
>>> rdd.treeAggregate(0, add, add, 2)
-5
>>> rdd.treeAggregate(0, add, add, 5)
-5
>>> rdd.treeAggregate(0, add, add, 10)
-5