Repartition¶

class Repartition.Repartition(disable=False, n=None)[source]¶

Bases: mmlspark.Utils.ComplexParamsMixin, pyspark.ml.util.JavaMLReadable, pyspark.ml.util.JavaMLWritable, pyspark.ml.wrapper.JavaTransformer

Partitions the dataset into n partitions. Default value for n is 10.

Parameters:	disable (bool) – Whether to disable repartitioning (so that one can turn it off for evaluation) (default: false) n (int) – Number of partitions

getDisable()[source]¶

Returns:	Whether to disable repartitioning (so that one can turn it off for evaluation) (default: false)
Return type:	bool

Returns:	Number of partitions
Return type:	int

classmethod read()[source]¶: Returns an MLReader instance for this class.

setDisable(value)[source]¶

Parameters:	disable (bool) – Whether to disable repartitioning (so that one can turn it off for evaluation) (default: false)

setN(value)[source]¶

Parameters:	n (int) – Number of partitions

setParams(disable=False, n=None)[source]¶

Set the (keyword only) parameters

Parameters:	disable (bool) – Whether to disable repartitioning (so that one can turn it off for evaluation) (default: false) n (int) – Number of partitions