RecognizeText¶
-
class
RecognizeText.RecognizeText(concurrency=1, concurrentTimeout=100.0, errorCol=None, imageUrl=None, mode=None, outputCol=None, subscriptionKey=None, url=None)[source]¶ Bases:
mmlspark.Utils.ComplexParamsMixin,pyspark.ml.util.JavaMLReadable,pyspark.ml.util.JavaMLWritable,pyspark.ml.wrapper.JavaTransformerParameters: - concurrency (int) – max number of concurrent calls (default: 1)
- concurrentTimeout (double) – max number seconds to wait on futures if concurrency >= 1 (default: 100.0)
- errorCol (str) – column to hold http errors (default: [self.uid]_error)
- imageUrl (object) – the url of the image to use
- mode (object) – If this parameter is set to ‘Printed’, printed text recognition is performed. If ‘Handwritten’ is specified, handwriting recognition is performed
- outputCol (str) – The name of the output column (default: [self.uid]_output)
- subscriptionKey (object) – the API key to use
- url (str) – Url of the service
-
getConcurrentTimeout()[source]¶ Returns: max number seconds to wait on futures if concurrency >= 1 (default: 100.0) Return type: double
-
getErrorCol()[source]¶ Returns: column to hold http errors (default: [self.uid]_error) Return type: str
-
getMode()[source]¶ Returns: If this parameter is set to ‘Printed’, printed text recognition is performed. If ‘Handwritten’ is specified, handwriting recognition is performed Return type: object
-
getOutputCol()[source]¶ Returns: The name of the output column (default: [self.uid]_output) Return type: str
-
setConcurrency(value)[source]¶ Parameters: concurrency (int) – max number of concurrent calls (default: 1)
-
setConcurrentTimeout(value)[source]¶ Parameters: concurrentTimeout (double) – max number seconds to wait on futures if concurrency >= 1 (default: 100.0)
-
setErrorCol(value)[source]¶ Parameters: errorCol (str) – column to hold http errors (default: [self.uid]_error)
-
setMode(value)[source]¶ Parameters: mode (object) – If this parameter is set to ‘Printed’, printed text recognition is performed. If ‘Handwritten’ is specified, handwriting recognition is performed
-
setModeCol(value)[source]¶ Parameters: mode (object) – If this parameter is set to ‘Printed’, printed text recognition is performed. If ‘Handwritten’ is specified, handwriting recognition is performed
-
setOutputCol(value)[source]¶ Parameters: outputCol (str) – The name of the output column (default: [self.uid]_output)
-
setParams(concurrency=1, concurrentTimeout=100.0, errorCol=None, imageUrl=None, mode=None, outputCol=None, subscriptionKey=None, url=None)[source]¶ Set the (keyword only) parameters
Parameters: - concurrency (int) – max number of concurrent calls (default: 1)
- concurrentTimeout (double) – max number seconds to wait on futures if concurrency >= 1 (default: 100.0)
- errorCol (str) – column to hold http errors (default: [self.uid]_error)
- imageUrl (object) – the url of the image to use
- mode (object) – If this parameter is set to ‘Printed’, printed text recognition is performed. If ‘Handwritten’ is specified, handwriting recognition is performed
- outputCol (str) – The name of the output column (default: [self.uid]_output)
- subscriptionKey (object) – the API key to use
- url (str) – Url of the service