Import Agent Options
You can configure options that will apply to all profiles, including settings for OCR, configuration for import of various types of documents, and file monitoring options.
To configure options in Import Agent, open the Import Agent Configuration Manager. Select the Profile menu, and then select Options.
General
Count
Sets the starting point for the Session Count token. See Import Agent Tokens for more information.
Distributed Computing Cluster
Laserfiche Cloud users should leave the Distributed Computing Cluster section blank.
OCR
OCR settings allow you to customize the way in which OCR is performed on imported documents.
- Language: Select a language to help optimize the character recognition.
- Decolumnize text: Select this option to convert multiple columns of generated text into a single column. Clearing the checkbox will preserve column formatting in the OCRed text, even if that separates words and sentences.
- Perform image enhancement: Images will be temporarily enhanced prior to OCR processing to optimize the processing. Click Configure open the Image Clean-up Options dialog box and configure the desired temporary image enhancements.
- Deskew image: Straighten crooked images.
- Despeckle image: Remove undesired noise from an image.
- Specify the maximum size of noise to be removed (in pixels): Size is specified as both width and height. For example, setting this option to 2 will remove all noise that is equal to or smaller than a 2 pixel x 2 pixel square.
- Rotate image: Rotates images to an orientation that is appropriate for OCR processing. After the OCR process is performed, the image will return to its original orientation.
- Automatically: The direction in which text flows on the page will be detected. The image will be rotated so the text flows horizontally (left to right).
- By this amount: The amount by which the image will be rotated. An image can be rotated by 90, 180, or 270 degrees.
- Line removal: Remove lines from an image.
- Horizontal: Removes horizontal lines from the image. Characters that are damaged due to the line removal will be repaired.
- Vertical: Removes vertical lines from the image. Characters that are damaged due to the line removal will be repaired.
- Optimization priority: Choose one of the following:
- Speed: Reduces the amount of time it takes to OCR. Generated text may be less accurate. Best for documents with clear text.
- Balance: Strikes a balance between speed and accuracy. Best for documents with average text.
- Accuracy: Increases OCR quality. Processing time will also be increased. Best for documents with less clear text.
File Types
Images
By default, Import Agent will convert images in BMP, GIF, JPG, JPEG, PCX, PNG, TIF, and TIFF formats to Laserfiche imaged documents. You can manually select which of these file formats to convert in this option. If you choose not to convert a particular image file type, the image file will open in its original format in its native application.
To modify the import conversion list, add or remove extensions from the list.
Text
Import Agent lets you choose whether certain types of text files will be imported as Laserfiche documents or electronic documents. To modify the import conversion list, add or remove extensions from the list.
Laserfiche files
Laserfiche supports import via two Laserfiche-specific formats that allow you to import multiple documents at once: list files and briefcase files.
The Recognize files with LST extensions as list files and Recognize files with XML and LSTX extensions as list files options allow you to import list files, which allow for more precise handling of import in certain cases. For information on the structure of XML import list files, see the schema definition file and sample XML files included in the installation directory (e.g., C:\Program Files\Laserfiche\Import Agent\List File Examples).
The Recognize files with LFB extensions as Laserfiche briefcases option allows you to import Laserfiche briefcases using Import Agent. See Exporting a Collection of Documents and Folders as a Briefcase for more information on briefcases.
Advanced
Import Agent receives notifications from your operating system that help it determine when files are available for import. In some situations, Import Agent is unable to import a file after receiving a notification (e.g., the file was in use when the notification was sent). To prevent files from being overlooked, Import Agent periodically checks monitored directories. You can specify how often a check should be performed, as well as how long Import Agent should wait before importing files that have been recently modified. These options are useful in situations where content is being added toa file as it is being created.
Example: Your scanner has been configured to create multi-page TIFF images. In this scenario, you should increase the time interval that Import Agent waits so your scanner has enough time to create all of the document's pages.
To configure monitoring options
- In the Options menu, open the Advanced tab.
- Under File monitoring interval, specify the number of seconds that must elapse between each time that Import Agent will check for new files.
- Under New files, specify the number of seconds that must elapse after a file's been modified before it can be imported into Laserfiche.
- Under Import Thread Count, specify the number of threads Import Agent will run on. This allows you to run Import Agent on machines with lower processing speeds.
- If you are running Import Agent on a powerful machine, you can increase thread count and enable parallel processing in the next step.
- If Import Agent is taking too many resources and affecting the performance of other products, reduce the thread count and clear the parallel processing option in the next step.
- Select Enable parallel processing for each profile, to allow for profiles to run in parallel.
- Click OK.
Note: These settings apply to all profiles.