OCRText

The OCRText component enables extraction of Text from an Image.

Screenshot

Double Click on the OCRText component title bar to launch the OCR Settings Window.

Screenshot

1.Engine: The Intellibot Studio supports the use of the following engines.

  • Tessract 4.0(LSTM)

  • Windows

  • ABBYY

  • Abbyy Cloud

  • Google Cloud

  • Microsoft Cloud

  • Amazon Cloud

2.Language: Displays all languages supported by the OCR engine selected. Default language is “English”.

3.Search: Allows users to search for properties for the selected OCR Engine.

4.Options: Enables various options based on the OCR Engine Selected.

Ports

The OCRText component exposes Control In, Control Out and Data In and Data Out ports by default.

Port Description
Control In Must be connected to the Control Out port of one or more components.
Control Out Can be connected to the Control In port of any other component or connector
Data In (Bitmap) Can be connected to the Data Out port of any other component or connector.
Data Out (String) Returns the extracted Text (String).

Properties

To edit the properties of the OCRText component, in the Properties window, users can change the required property.

Users can edit the following properties:

Property Description
Search Search for the respective property.
Delay After Execution Specifies the wait time (in seconds) after the action is executed.
Delay Before Execution Specifies the wait time (in seconds) before the action is executed.
Name Allows users to name the OCRText Component.
Image Source Image Source will display two options:
Port: Can be connected to different component to read the Image from.
FilePath: Allows users to specify the FilePath for the saved Image.

Example

Let us consider the example of text from an Image.

Screenshot

To extract the text:

We will first use the ReadFromFile Component to provide the OCR text with an image from which it can read as below-

  1. In the Toolbox, expand the Utilities category.

  2. Expand Image Utilities.

  3. Drag the ReadFromFile component and drop it on the design surface.

  4. Double click on FilePath and mention File path for the saved Image.

  5. The OCRText Component will be used to convert the image to text by-

    • In the Toolbox, expand Actions.

    • Drag and Drop the OCRText component to the Design surface.

  6. The Show component will be used to view the extracted text.

  7. To add the Show component we would:

    • Expand General, Drag and Drop the Show component to the Design surface.
  8. Connect:

    • The Control port between Start and ReadfromFile component.

    • The Controlout port of the ReadFromFile component to the Control in port of the OCRText component.

    • The Dataout port of the ReadFromFile Component to the Data in port of the OCRText component

    • The Control out port of the OCRText component to the Control in port of the Show component.

    • The Data out port of the OCRText component to the Data in port for the Showcomponent.

    • The Control out port of the MessageBox component to the End Component.

  9. Click Run.

  10. Once the text is successfully extracted, it is displayed in the MessageBox-

    Screenshot