<< Click to Display Table of Contents >> Navigation: OCR Module > OCR Library Types > PXO_Options |
PXO_Options is the OCR options input structure:
MEMBER
|
TYPE
|
MEANING |
Lang
|
|
OCR language identifier. Please ensure that the language pack is installed in the correct location. See here for additional OCR language pack downloads.
|
RegionMode
|
|
A region mode specifier. This is useful for increasing the OCR accuracy and speed when the input type is known ahead of time, for example in cases of a single line/paragraph.
|
Whitelist
|
BSTR
|
A list of permitted characters. All other symbols will be ignored. This can be used to increase the level of accuracy in cases where the input type is known.
|
Blacklist
|
BSTR
|
A list of characters that are not permitted. All other symbols will be considered suitable for OCR identification. This can be used to remove undesired symbols when the input type is known.
|
DataPath |
BSTR |
The path to the language pack directory that contains the subfolder ocrdats/. Please note that it is necessary to point at the parent directory that contains the folder, for example if languages are installed in:
C:\OCR_Application\Languages\ocrdats\
then this member must be assigned the following string:
“C:\OCR_Application\Languages\”.
Pointing instead at the ocrdats folder will result in an error.
|
|
DWORD |
The flags that image processing uses. See OCR_ImageProcessingFlags. |
raster_dpi
|
int |
This is a DPI setting for the rasterizing/resampling of pages for OCR. The OCR algorithm uses the entire page as input to avoid the complications that result from fragmented images/multiple images on a single page.
We recommend a value of 300 DPI to ensure a high level of accuracy. If lower values are used then the speed of recognition will increase but there is an increased likelihood of errors.
Please note that the resolution of embedded images limits the effect of this setting, and there is no safeguard in place to ensure that images are not upsampled unnecessarily.
|
accMode
|
int |
This member is reserved for future use and should be set to zero. |
Please note that all components are case-sensitive.