2.90:Collation Provider (Property)
Collation Providers allow Data Type extractor results to be combined, organized, or utilized in specific ways.
Results can be combined, made into arrays, returned as a key-value pair's value, and more.
The following Collation Providers are available in Grooper:
- Individual
- Combine
- Key-Value Pair
- Key-Value List
- Array
- Ordered Array
- Split
- Pattern-Based
- Multi-Column
About
Data Type extractors in Grooper use regular expression to match a document's text data in order to return a particular piece of information. Extractors serve a variety of purposes. They can be used to populate fields in a Data Model, to separate and classify documents, to break up a document into sections, and more. For the most part, any time part of document's text data is needed or useful to do something, you need an extractor to find and return it.
Often, this requires something more complex than returning a single result. The relationships between multiple extraction results are often important. The fact results are physically related to each other on the page, or text exists between one or more results, or results are in one order versus another can be used accomplish various goals in Grooper.
For example, the Individual, Array, and Ordered Array Collation Providers all collate results differently.
Individual
The Individual Collation Provider returns all extraction results individually. This is the default Collation Provider for Data Type extractors.
|
Array
The Array Collation Provider organizes and returns results much differently.
- First, it will only return results if multiple extraction results are lined up in a particular order on the page, according to the "layout" set for this provider. For example, an Array collated extractor using a Horizontal Layout will only return results if they are aligned horizontally, one result after another from left to right.
- Second, instead of each result being returned individually, all results meeting the layout requirements are returned as a single value.
Essentially, an Array collated result is a collection of results who share a layout relationship, that are all lined up together (either horizontally, vertically, or in the left/right and top/bottom text flow of the document).
|

