2023.1:Split (Collation Provider)

From Grooper Wiki
Revision as of 14:19, 19 April 2024 by Rpatton (talk | contribs) (wip // via Wikitext Extension for VSCode)

This article is about an older version of Grooper.

Information may be out of date and UI elements may have changed.

20252023.1

WIP

This article is a work-in-progress or created as a placeholder for testing purposes. This article is subject to change and/or expansion. It may be incomplete, inaccurate, or stop abruptly.

This tag will be removed upon draft completion.

Split is one of many Collation Providers you can use in Grooper to combine or organize extracted data based on the data's layout relationship. It is used to divide up a page into smaller sections, allowing you to extract from those sections rather than the whole page.

You may download the ZIP(s) below and upload it into your own Grooper environment (version 2023.1). The first contains one or more Batches of sample documents. The second contains one or more Projects with resources used in examples throughout this article.

About

The Split Collation Provider is a tool used to divide up a document into smaller sections. This allows you to extract text from a smaller section rather than the whole page.

The Provider splits the page based on what the Data Type is extracting and the configured Split Position property. There are four different positions to consider:

  • Begin
  • End
  • Between
  • Around

How To

Begin

End

Between

Around