2023.1:Image Processing (Activity): Difference between revisions
Dgreenwood (talk | contribs) No edit summary |
No edit summary |
||
| Line 1: | Line 1: | ||
{{stubs}} | {{stubs}} | ||
<blockquote> | <blockquote>{{#lst:Glossary|Image Processing}}</blockquote> | ||
</blockquote> | |||
An '''IP Profile''' lists a series of steps to performing image processing functions called "'''[[IP Command]]s'''". There are several '''IP Commands''' in Grooper, including ones that remove borders from an image, adjust the skew angle of an image, change the color format of an image, and more. | An '''IP Profile''' lists a series of steps to performing image processing functions called "'''[[IP Command]]s'''". There are several '''IP Commands''' in Grooper, including ones that remove borders from an image, adjust the skew angle of an image, change the color format of an image, and more. | ||
Revision as of 13:42, 19 April 2024
|
STUB |
This article is a stub. It contains minimal information on the topic and should be expanded. |
wallpaper Image Processing is an Activity that enhances contract Batch Page images and optimizes them for better OCR text recognition and data extraction results.
An IP Profile lists a series of steps to performing image processing functions called "IP Commands". There are several IP Commands in Grooper, including ones that remove borders from an image, adjust the skew angle of an image, change the color format of an image, and more.
Permanent vs. Temporary Image Processing
The Image Processing activity permanently alters a document's image by applying an IP Profile. However, it is possible to temporarily clean up document images to benefit OCR results and revert back to the original document image. This is done during the Recognize activity.
For example, you may have a document where table lines are getting in the way of accurate OCR. However, if you remove these lines during the Image Processing activity, they will be permanently removed, making it difficult to review the documents in Review and changing the archival image stored later to something that no longer looks like the original document.
Instead, you can use an OCR Profile referencing an IP Profile containing a Line Removal command during Recognize. The image will be temporarily changed according to the IP Profile. Then, OCR will run on the altered image. Last, the image will revert back to its original form.
- Furthermore, any image based "layout data" targeted by the IP Profile (such as the table line locations for this example) will still be saved to the page for later use.