Repartition

class Repartition.Repartition(disable=False, n=None)[source]

Bases: mmlspark.Utils.ComplexParamsMixin, pyspark.ml.util.JavaMLReadable, pyspark.ml.util.JavaMLWritable, pyspark.ml.wrapper.JavaTransformer

Partitions the dataset into n partitions. Default value for n is 10.

Parameters:
  • disable (bool) – Whether to disable repartitioning (so that one can turn it off for evaluation) (default: false)
  • n (int) – Number of partitions
getDisable()[source]
Returns:Whether to disable repartitioning (so that one can turn it off for evaluation) (default: false)
Return type:bool
static getJavaPackage()[source]

Returns package name String.

getN()[source]
Returns:Number of partitions
Return type:int
classmethod read()[source]

Returns an MLReader instance for this class.

setDisable(value)[source]
Parameters:disable (bool) – Whether to disable repartitioning (so that one can turn it off for evaluation) (default: false)
setN(value)[source]
Parameters:n (int) – Number of partitions
setParams(disable=False, n=None)[source]

Set the (keyword only) parameters

Parameters:
  • disable (bool) – Whether to disable repartitioning (so that one can turn it off for evaluation) (default: false)
  • n (int) – Number of partitions