Retrieve Document Text
This activity retrieves existing text from a Laserfiche or electronic document and stores it in a token as plain text. Learn about the token this activity produces.
Note: The preview text when monitoring the activity is limited to 500 characters. The entire text is still retrieved in the token during processing.
Tip: Using this activity in conjunction with the Pattern Matching activity can help you extract useful information from pages of text.
To set up the activity:
- Drag the activity from the toolbox pane and drop it in the Designer pane.
- In the properties pane, type a name and description of the activity.
- In the Connection Profile property box, select the repository you want the activity to use.
- In the Pages to Retrieve property box, select the pages you want to retrieve from the entry you select in the Retrieve Text From property box. Click the Token button (right arrow) to add a token.
- All pages: Store the text from all pages of the document in a token.
- Pages: Specify a range of pages from which you want to retrieve text.
Example: Specify 1, 3, 5-7 to include pages 1, 3, 5, 6, and 7. You can also specify a page range that skips pages. For instance, specifying 2@2 will include all even-numbered pages, whereas specifying 2@3 will include every third page starting with page two.
- In the Token Options property box, select the checkbox to store each page of text in a separate value in the multi-value "Text" token, instead of in a single-value token.
Note: The multi-value token's index starts with 1 -- not the page number of the retrieved text. For example, if your page range is 2, 3, 5, then page 2 is indexed as 1, page 3 as 2, and page 5 as 3.
- In the Retrieve Text From property box, select the entry you want this activity to retrieve text from. Learn more about selecting an entry
You may want to disable activities if you only want to test specific sections of your workflow. To disable activities:
- Select an activity in the designer pane to highlight it.
- In the Description property box, clear Enable.