Multi Separator (Separation Provider)
|
STUB |
This article is a stub. It contains minimal information on the topic and should be expanded. Would you like to see this article expanded? Let us know at groopereducation@bisok.com. |
Multi Separator is a Separation Provider that separates loose pages into documents using multiple Separation Providers. It allows users to create a list of any of the other Separation Providers. If the first provider on the list fails to separate a page (or, as more often is the case, a series of pages), the next one will be applied. If that fails, the next, and so on.
The first Separation Provider should be considered the "main provider". The subsequent Separation Providers will only attempt to create a document folder if the main provider (or the providers before it) leaves the page as a loose page.
There are several Separation Providers that continue placing pages into a folder after a condition is met. Pattern-Based Separation is the most obvious. If the provider encounters a page with text that matches a regex pattern, it will create a document folder and continue placing subsequent pages into that folder until the pattern is matched again. Once pages are in a folder, none of the subsequent Separation Providers will attempt to execute for the already foldered pages (even if they would have qualified for separation if they were unfoldered).
Be advised, this provider has limited functionality as a practical matter. There are very few circumstances where this provider is used in the real-world. As a practical matter, most users who fall into a scenario where they need to separate using multiple providers will end up using multiple Separate steps to do so.
Glossary
Multi Separator: Multi Separator is a Separation Provider that separates loose pages into documents using multiple Separation Providers. It allows users to create a list of any of the other Separation Providers. If the first provider on the list fails to separate a page (or, as more often is the case, a series of pages), the next one will be applied. If that fails, the next, and so on.
Pattern-Based Separation: Pattern-Based Separation is a Separation Provider that creates a new document folder every time a value returned by a defined pattern is encountered on a page.
Pattern-Based: Pattern-Based is a Collation Provider option for pin Data Type extractors. Pattern-Based uses regular expressions to sequence returned results into a final result set.
Separate: insert_page_break Separate is an Activity that sorts contract Batch Pages into individual folder Batch Folders. This distinguishes "loose pages" from the documents formed by those pages. Once loose pages are separated into Batch Folder documents, they can be further processed by unknown_document Classify, export_notes Extract, output Export and other Activities that need to run on the folder (i.e. document) level.
Separation Provider: Separation Providers divide a sequence of contract Batch Pages into logical documents. They define the rules and criteria for grouping pages and determining document boundaries. Separation is a foundational step in document workflows, transforming a continuous stream of scanned or imported pages into discrete, classified documents ready for extraction, validation, and export.
- The insert_page_break Separate activity's Provider setting specifies which Separation Provider is applied to the selected inventory_2 Batch Scope.
- One folder Batch Folder is created for each span of pages identified as a document by the Separation Provider.
Separation: Separation is the process of taking an unorganized inventory_2 Batch of loose contract Batch Pages and organizing them into documents represented by folder Batch Folders in Grooper. This is done so Grooper can later assign a description Document Type to each document folder in a process known as "classification".