2023:Ordered OMR (Value Extractor): Difference between revisions
No edit summary |
|||
| Line 65: | Line 65: | ||
|valign=top style="width:60%"| | |valign=top style="width:60%"| | ||
With a ''Horizontal'' '''''Flow Direction''''' selected, Grooper would first look at box for YES | With a ''Horizontal'' '''''Flow Direction''''' selected, Grooper would first look at box for 1 and YES or 1Y and determine whether or not it is checked. Then it would look at 1N and determine whether or not that one was checked. Then it would return to the first column at 2Y and then move to 2N and so on. It jumps back and forth between the two columns, first looking horizontally for information before moving on to the next row. If there were a third column, Grooper would look at the first row and extract the values for the first, second, and third columns before moving to the next row. | ||
So in this example if we were to symbolize "YES" with a Y and a "NO" with an N, Grooper would return the following values in this order: 1N 2Y 3N 4N 5N 6Y 7Y 8N 9N 10Y 11N | So in this example if we were to symbolize "YES" with a Y and a "NO" with an N, Grooper would return the following values in this order: 1N 2Y 3N 4N 5N 6Y 7Y 8N 9N 10Y 11N | ||
| Line 87: | Line 87: | ||
|valign=top style="width:60%"| | |valign=top style="width:60%"| | ||
With a ''Vertical'' '''''Flow Direction''''' selected, Grooper would first look at box for YES | With a ''Vertical'' '''''Flow Direction''''' selected, Grooper would first look at box for 1 and YES or 1Y and determine whether or not it is checked. Then it would look at 2Y and determine whether or not that one was checked, and so on down the line. At the end of the first column, the '''''Ordered OMR''''' extractor would start again at the top of the second column at 1N and go down that column determining whether or not the boxes are checked. | ||
So in this example if we were to symbolize "YES" with a Y and a "NO" with an N, Grooper would return the following values in this order: 2Y 6Y 7Y 10Y 1N 3N 4N 5N 8N 9N 11N | So in this example if we were to symbolize "YES" with a Y and a "NO" with an N, Grooper would return the following values in this order: 2Y 6Y 7Y 10Y 1N 3N 4N 5N 8N 9N 11N | ||
| Line 98: | Line 98: | ||
</tabs> | </tabs> | ||
==How To== | ==How To== | ||
Revision as of 11:53, 14 February 2023
| WIP |
This article is a work-in-progress or created as a placeholder for testing purposes. This article is subject to change and/or expansion. It may be incomplete, inaccurate, or stop abruptly. This tag will be removed upon draft completion. |
Ordered OMR is an extractor type similar to a Labeled OMR in that it is used to return OMR check box information. Rather than relying on a label for the extraction, the Ordered OMR returns information from the boxes based on the order of the check boxes.
About
Check boxes on a form can be extremely useful. They give us quick information at a glance. However, there is not an expression we can put into a text extractor, such as a Pattern Match or List Match, to find checked and unchecked boxes. Instead we must use one of the OMR extractors.
OMR stands for "Optical Mark Recognition". OMR first detects the check boxes on a document and then determines whether not that box is checked or unchecked. The most common ways a box can be checked are with a checkmark, black box, or an "X".
There are three types of OMR recognition in Grooper: Labeled OMR, Ordered OMR, and Zonal OMR.
- NOTE: For any OMR detection, documents in Grooper first need to be recognized and go through the Box Detection or Box Removal step from either and OCR Profile or an IP Profile. Please see the OCR and IP Profile wiki articles for more information.
Ordered OMR determines which boxes are checked and unchecked and then returns values based on the order of the boxes. Before extraction, the boxes have to be given an Output Value to assign a specific value to each box. So, what does this mean?
How Does It Work?
Understanding Ordered OMR
|
Horizontal Flow
|
Vertical Flow
|
How To
So how do we set this up in Grooper? An Ordered OMR can be selected anywhere an extractor is used.
Configuring on a Value Reader | |
|
Configuring on a Data Type | |
|
Configuring on Other Object TypesThe Ordered OMR extractor can be used on a multitude of object types. Any object that has an extractor property can be configured with an Ordered OMR. The configuration process on other objects is identical to both the Value Reader and Data Type objects. Simply select Ordered OMR as your extractor type.
|
Once you have Ordered OMR selected as the extractor type, there are several properties that need to be configured.
Mode: This property sets the check box format. There are three options for Mode: CheckOne, CheckMulti, and Boolean. The CheckMulti option is going to be most commonly used for an Ordered OMR extractor. For the example below, we will be using CheckMulti. For more information on the other two options, please visit the Labeled OMR - 2021 Wiki Page.
Location: This property tells Grooper what area of the document to look for OMR boxes. There are four options for the Location: Fixed Region, Relative Region, Shape Region, and Text Region. For the example below, we will be using Fixed Region. For more information on the other three options, please visit the Labeled OMR - 2021 Wiki Page.
Output Values: This property allows you to assign each check box a value to be extracted. The order of your Output Values depends on whether you decide to use a Horizontal or Vertical Flow Direction.
Flow Direction: This property tells Grooper in what order to extract the box values. There are two options for the Flow Direction: Horizontal or Vertical. If you do not select the proper Flow Direction that matches with your Output Values, Grooper will not extract the correct information. Please reference the "About" section of this article to determine which Flow Direction fits your needs.
Mode | |
|
|
|
|
|
Location | |
|
|
|
|
|
|
|
|
|
Output Values | |
|
Flow Direction | |
|
|
|















