LightGBMRegressor¶
-
class
LightGBMRegressor.LightGBMRegressor(alpha=0.9, application='regression', baggingFraction=1.0, baggingFreq=0, baggingSeed=3, defaultListenPort=12400, featureFraction=1.0, featuresCol='features', labelCol='label', learningRate=0.1, maxBin=255, maxDepth=-1, minSumHessianInLeaf=0.001, numIterations=100, numLeaves=31, parallelism='data_parallel', predictionCol='prediction')[source]¶ Bases:
mmlspark.Utils.ComplexParamsMixin,pyspark.ml.util.JavaMLReadable,pyspark.ml.util.JavaMLWritable,pyspark.ml.wrapper.JavaEstimatorParameters: - alpha (double) – parameter for Huber loss and Quantile regression (default: 0.9)
- application (str) – Regression application, regression_l2, regression_l1, huber, fair, poisson, quantile, mape, gamma or tweedie (default: regression)
- baggingFraction (double) – Bagging fraction (default: 1.0)
- baggingFreq (int) – Bagging frequence (default: 0)
- baggingSeed (int) – Bagging seed (default: 3)
- defaultListenPort (int) – The default listen port on executors, used for testing (default: 12400)
- featureFraction (double) – Feature fraction (default: 1.0)
- featuresCol (str) – features column name (default: features)
- labelCol (str) – label column name (default: label)
- learningRate (double) – Learning rate or shrinkage rate (default: 0.1)
- maxBin (int) – Max bin (default: 255)
- maxDepth (int) – Max depth (default: -1)
- minSumHessianInLeaf (double) – minimal sum hessian in one leaf (default: 0.001)
- numIterations (int) – Number of iterations, LightGBM constructs num_class * num_iterations trees (default: 100)
- numLeaves (int) – Number of leaves (default: 31)
- parallelism (str) – Tree learner parallelism, can be set to data_parallel or voting_parallel (default: data_parallel)
- predictionCol (str) – prediction column name (default: prediction)
-
getAlpha()[source]¶ Returns: parameter for Huber loss and Quantile regression (default: 0.9) Return type: double
-
getApplication()[source]¶ Returns: Regression application, regression_l2, regression_l1, huber, fair, poisson, quantile, mape, gamma or tweedie (default: regression) Return type: str
-
getDefaultListenPort()[source]¶ Returns: The default listen port on executors, used for testing (default: 12400) Return type: int
-
getLearningRate()[source]¶ Returns: Learning rate or shrinkage rate (default: 0.1) Return type: double
-
getMinSumHessianInLeaf()[source]¶ Returns: minimal sum hessian in one leaf (default: 0.001) Return type: double
-
getNumIterations()[source]¶ Returns: Number of iterations, LightGBM constructs num_class * num_iterations trees (default: 100) Return type: int
-
getParallelism()[source]¶ Returns: Tree learner parallelism, can be set to data_parallel or voting_parallel (default: data_parallel) Return type: str
-
setAlpha(value)[source]¶ Parameters: alpha (double) – parameter for Huber loss and Quantile regression (default: 0.9)
-
setApplication(value)[source]¶ Parameters: application (str) – Regression application, regression_l2, regression_l1, huber, fair, poisson, quantile, mape, gamma or tweedie (default: regression)
-
setBaggingFraction(value)[source]¶ Parameters: baggingFraction (double) – Bagging fraction (default: 1.0)
-
setDefaultListenPort(value)[source]¶ Parameters: defaultListenPort (int) – The default listen port on executors, used for testing (default: 12400)
-
setFeatureFraction(value)[source]¶ Parameters: featureFraction (double) – Feature fraction (default: 1.0)
-
setFeaturesCol(value)[source]¶ Parameters: featuresCol (str) – features column name (default: features)
-
setLearningRate(value)[source]¶ Parameters: learningRate (double) – Learning rate or shrinkage rate (default: 0.1)
-
setMinSumHessianInLeaf(value)[source]¶ Parameters: minSumHessianInLeaf (double) – minimal sum hessian in one leaf (default: 0.001)
-
setNumIterations(value)[source]¶ Parameters: numIterations (int) – Number of iterations, LightGBM constructs num_class * num_iterations trees (default: 100)
-
setParallelism(value)[source]¶ Parameters: parallelism (str) – Tree learner parallelism, can be set to data_parallel or voting_parallel (default: data_parallel)
-
setParams(alpha=0.9, application='regression', baggingFraction=1.0, baggingFreq=0, baggingSeed=3, defaultListenPort=12400, featureFraction=1.0, featuresCol='features', labelCol='label', learningRate=0.1, maxBin=255, maxDepth=-1, minSumHessianInLeaf=0.001, numIterations=100, numLeaves=31, parallelism='data_parallel', predictionCol='prediction')[source]¶ Set the (keyword only) parameters
Parameters: - alpha (double) – parameter for Huber loss and Quantile regression (default: 0.9)
- application (str) – Regression application, regression_l2, regression_l1, huber, fair, poisson, quantile, mape, gamma or tweedie (default: regression)
- baggingFraction (double) – Bagging fraction (default: 1.0)
- baggingFreq (int) – Bagging frequence (default: 0)
- baggingSeed (int) – Bagging seed (default: 3)
- defaultListenPort (int) – The default listen port on executors, used for testing (default: 12400)
- featureFraction (double) – Feature fraction (default: 1.0)
- featuresCol (str) – features column name (default: features)
- labelCol (str) – label column name (default: label)
- learningRate (double) – Learning rate or shrinkage rate (default: 0.1)
- maxBin (int) – Max bin (default: 255)
- maxDepth (int) – Max depth (default: -1)
- minSumHessianInLeaf (double) – minimal sum hessian in one leaf (default: 0.001)
- numIterations (int) – Number of iterations, LightGBM constructs num_class * num_iterations trees (default: 100)
- numLeaves (int) – Number of leaves (default: 31)
- parallelism (str) – Tree learner parallelism, can be set to data_parallel or voting_parallel (default: data_parallel)
- predictionCol (str) – prediction column name (default: prediction)
-
class
LightGBMRegressor.M(java_model=None)[source]¶ Bases:
mmlspark.Utils.ComplexParamsMixin,pyspark.ml.wrapper.JavaModel,pyspark.ml.util.JavaMLWritable,pyspark.ml.util.JavaMLReadableModel fitted by
LightGBMRegressor.This class is left empty on purpose. All necessary methods are exposed through inheritance.