2023:Highlight Zone (Value Extractor): Difference between revisions

From Grooper Wiki
No edit summary
No edit summary
Line 1: Line 1:
<blockquote>
<blockquote>
The '''''Highlight Zone''''' extractor uses zonal information to highlight a portion on a document.
The '''''Highlight Zone''''' extractor uses zonal information to highlight a region of a document.
</blockquote>
</blockquote>


Please note, this "extractor" doesn't actually extract any text from a document.  It is most often implemented as a visual aid only during Data Review. It uses many of the same functions as the ''Read Zone'' extractor, so please review the [[Read Zone - 2023]] article prior to following the tutorial in this article.  
Please note, this "extractor" doesn't actually extract any text from a document.  It is most often implemented as a visual aid for users reviewing documents during '''Review''', highlighting troublesome fields on a document users will need to manually enter.


== About ==
== About ==
''Highlight Zone'' is a zonal extractor specifically designed to aid in Data Review by highlighting specific text on a document that needs to be verified. It's very similar to the Read Zone extractor in that you use one of the four Location options (Fixed Region, Relative Region, Shape Region or Text Region) to draw an extraction zone on a geographic region of the page.
''Highlight Zone'' is a zonal extractor designed to aid '''Review''' users performing data review by highlighting specific region on a document that needs to be verified. It's very similar to the '''''Read Zone''''' extractor in that you use one of the four '''''Location''''' options (''Fixed Region'', ''Relative Region'', ''Shape Region'' or ''Text Region'') to draw an extraction zone on a geographic region of the page.


However, rather than returning the OCR or native text data within the zone, nothing is actually extracted. Instead, it simply places a zone in a particular location on a document. In terms of a "data instance", it returns an instance location but no text value whatsoever. Most commonly, this extractor will be used to aid document reviewers, highlighting a troublesome field on the document for manual review.
However, rather than returning the OCR or native text data within the zone, nothing is actually extracted. Instead, it simply places a zone in a particular location on a document. In terms of a "data instance", it returns an instance location but no text value whatsoever. Most commonly, this extractor will be used to aid document reviewers, highlighting a troublesome field on the document for manual review.
Line 14: Line 14:
|⚠
|⚠
|
|
This extractor shares some similarities to the ''Read Zone'' extractor. It is recommended you become familiar with the different aspects of the ''Read Zone'' extractor before configuring the ''Detect Signature'' extractor. For more information, please see the [[Read Zone - 2023]] article.
'''BE AWARE:''' '''''Highlight Zone's''''' setup is similar to the '''''Read Zone''''' extractor in that it uses the same '''''Location''''' properties the '''''Read Zone''''' extractor uses to draw the an extraction zone.
* This article presumes you are familiar with the '''''Read Zone''''' extractor and its setup.
* If you are not familiar with '''''Read Zone''''', you may find it helpful to review the [[Read Zone]] article prior to following the tutorial in this article.
|}
|}


== How To ==
== How To ==

Revision as of 17:00, 29 December 2023

The Highlight Zone extractor uses zonal information to highlight a region of a document.

Please note, this "extractor" doesn't actually extract any text from a document. It is most often implemented as a visual aid for users reviewing documents during Review, highlighting troublesome fields on a document users will need to manually enter.

About

Highlight Zone is a zonal extractor designed to aid Review users performing data review by highlighting specific region on a document that needs to be verified. It's very similar to the Read Zone extractor in that you use one of the four Location options (Fixed Region, Relative Region, Shape Region or Text Region) to draw an extraction zone on a geographic region of the page.

However, rather than returning the OCR or native text data within the zone, nothing is actually extracted. Instead, it simply places a zone in a particular location on a document. In terms of a "data instance", it returns an instance location but no text value whatsoever. Most commonly, this extractor will be used to aid document reviewers, highlighting a troublesome field on the document for manual review.

BE AWARE: Highlight Zone's setup is similar to the Read Zone extractor in that it uses the same Location properties the Read Zone extractor uses to draw the an extraction zone.

  • This article presumes you are familiar with the Read Zone extractor and its setup.
  • If you are not familiar with Read Zone, you may find it helpful to review the Read Zone article prior to following the tutorial in this article.


How To


In this example, we are going to collect the Signature Date from the Application for Cow Ownership document.

Often, Grooper has a difficult time extracting hand written information. OCR generally just does not work well on handwritten information. We are going to assume this might be the case for these handwritten dates, so we will need a Reviewer to come back through and manually enter in the information from the document. We can use Highlight Zone to make this easier for the Reviewer.