Highlight Zone (Extractor Type)

From Grooper Wiki
(Redirected from Highlight Zone)

This article was migrated from an older version and has not been updated for the current version of Grooper.

This tag will be removed upon article review and update.

This article is about the current version of Grooper.

Note that some content may still need to be updated.

2025 2023

Highlight Zone is an Extractor Type that sets a highlight region on a document without performing any actual data extraction. This "extractor" is used to mark areas of interest or importance for Review users or for uncommon scenarios where a data instance location is needed with no actual value.

Please note, this "extractor" doesn't actually extract any text from a document. It is most often implemented as a visual aid for users reviewing documents during Review, highlighting troublesome fields on a document users will need to manually enter.

You may download the ZIP(s) below and upload it into your own Grooper environment (version 2023). The first contains a Project with resources used in examples throughout this article. The second contains one or more Batches of sample documents.

About

Highlight Zone is a zonal extractor designed to aid Review users performing data review by highlighting specific region on a document that needs to be verified. It's very similar to the Read Zone extractor in that you use one of the four Location options (Fixed Region, Relative Region, Shape Region or Text Region) to draw an extraction zone on a geographic region of the page.

However, rather than returning the OCR or native text data within the zone, nothing is actually extracted. Instead, it simply places a zone in a particular location on a document. In terms of a "data instance", it returns an instance location but no text value whatsoever. Most commonly, this extractor will be used to aid document reviewers, highlighting a troublesome field on the document for manual review.

BE AWARE: Highlight Zone's setup is similar to the Read Zone extractor in that it uses the same Location properties the Read Zone extractor uses to draw the an extraction zone.

  • This article presumes you are familiar with the Read Zone extractor and its setup.
  • If you are not familiar with Read Zone, you may find it helpful to review the Read Zone article prior to following the tutorial in this article.


How To


In this example, we are going to collect the Signature Date from the Application for Cow Ownership document.

Often, Grooper has a difficult time extracting hand written information. OCR generally just does not work well on handwritten information. We are going to assume this might be the case for these handwritten dates, so we will need a Reviewer to come back through and manually enter in the information from the document. We can use Highlight Zone to make this easier for the Reviewer.