SimpleHTTPTransformer¶
-
class
SimpleHTTPTransformer.SimpleHTTPTransformer(concurrency=1, concurrentTimeout=100.0, errorCol=None, flattenOutputBatches=None, handler=None, inputCol=None, inputParser=None, miniBatcher=None, outputCol=None, outputParser=None)[source]¶ Bases:
mmlspark.Utils.ComplexParamsMixin,pyspark.ml.util.JavaMLReadable,pyspark.ml.util.JavaMLWritable,pyspark.ml.wrapper.JavaTransformerParameters: - concurrency (int) – max number of concurrent calls (default: 1)
- concurrentTimeout (double) – max number seconds to wait on futures if concurrency >= 1 (default: 100.0)
- errorCol (str) – column to hold http errors (default: [self.uid]_errors)
- flattenOutputBatches (bool) – whether to flatten the output batches
- handler (object) – Which strategy to use when handling requests (default: UserDefinedFunction(<function2>,StringType,None))
- inputCol (str) – The name of the input column
- inputParser (object) – format to parse the column to (default: JSONInputParser_68c8218df4fc)
- miniBatcher (object) – Minibatcher to use
- outputCol (str) – The name of the output column
- outputParser (object) – format to parse the column to
-
getConcurrentTimeout()[source]¶ Returns: max number seconds to wait on futures if concurrency >= 1 (default: 100.0) Return type: double
-
getErrorCol()[source]¶ Returns: column to hold http errors (default: [self.uid]_errors) Return type: str
-
getHandler()[source]¶ Returns: Which strategy to use when handling requests (default: UserDefinedFunction(<function2>,StringType,None)) Return type: object
-
getInputParser()[source]¶ Returns: format to parse the column to (default: JSONInputParser_68c8218df4fc) Return type: object
-
setConcurrency(value)[source]¶ Parameters: concurrency (int) – max number of concurrent calls (default: 1)
-
setConcurrentTimeout(value)[source]¶ Parameters: concurrentTimeout (double) – max number seconds to wait on futures if concurrency >= 1 (default: 100.0)
-
setErrorCol(value)[source]¶ Parameters: errorCol (str) – column to hold http errors (default: [self.uid]_errors)
-
setFlattenOutputBatches(value)[source]¶ Parameters: flattenOutputBatches (bool) – whether to flatten the output batches
-
setHandler(value)[source]¶ Parameters: handler (object) – Which strategy to use when handling requests (default: UserDefinedFunction(<function2>,StringType,None))
-
setInputParser(value)[source]¶ Parameters: inputParser (object) – format to parse the column to (default: JSONInputParser_68c8218df4fc)
-
setParams(concurrency=1, concurrentTimeout=100.0, errorCol=None, flattenOutputBatches=None, handler=None, inputCol=None, inputParser=None, miniBatcher=None, outputCol=None, outputParser=None)[source]¶ Set the (keyword only) parameters
Parameters: - concurrency (int) – max number of concurrent calls (default: 1)
- concurrentTimeout (double) – max number seconds to wait on futures if concurrency >= 1 (default: 100.0)
- errorCol (str) – column to hold http errors (default: [self.uid]_errors)
- flattenOutputBatches (bool) – whether to flatten the output batches
- handler (object) – Which strategy to use when handling requests (default: UserDefinedFunction(<function2>,StringType,None))
- inputCol (str) – The name of the input column
- inputParser (object) – format to parse the column to (default: JSONInputParser_68c8218df4fc)
- miniBatcher (object) – Minibatcher to use
- outputCol (str) – The name of the output column
- outputParser (object) – format to parse the column to