2.90:Event-Based Separation (Separation Provider): Difference between revisions
Dgreenwood (talk | contribs) No edit summary |
Dgreenwood (talk | contribs) No edit summary |
||
| Line 6: | Line 6: | ||
** ''Blank Page'' - A blank page will trigger a new folder. | ** ''Blank Page'' - A blank page will trigger a new folder. | ||
** ''Barcode'' - A scanned barcode will trigger a new folder. | ** ''Barcode'' - A scanned barcode will trigger a new folder. | ||
** ''Content Type'' - This '''''Separation Event''''' uses [[Lexical]] training examples to trigger folder creation. Whenever a page confidently matches a trained example document's first page, a new folder is created. | ** ''Content Type'' - This '''''Separation Event''''' uses [[Lexical]] or [[Visual (Classification Method)|Visual]] training examples to trigger folder creation. Whenever a page confidently matches a trained example document's first page, a new folder is created. | ||
** ''Page Count'' - This is for fixed page separation. A new folder is created by a set number of pages for a document. | ** ''Page Count'' - This is for fixed page separation. A new folder is created by a set number of pages for a document. | ||
** ''Shape'' - A new folder is created every time a "shape feature" is detected. Shape features are detected using a '''[[Shape Detection]]''' IP Command from an '''IP Profile'''. | ** ''Shape'' - A new folder is created every time a "shape feature" is detected. Shape features are detected using a '''[[Shape Detection]]''' IP Command from an '''IP Profile'''. | ||
== About == | |||
When the event is triggered, one of two things can happen: | When the event is triggered, one of two things can happen: | ||
* A new folder is created and pages are | * A new folder is created and subsequent pages are appended until a new event is triggered (resulting in a new document) | ||
* The page triggering the event may be deleted | * The page triggering the event may be deleted | ||
** Sometimes, a cover sheet may be placed before a document. That page does not necessarily carry any useful data once the document is separated. If it is truly "junk" this can automate its deletion. | |||
More than one event may be used per configuration of this provider. | More than one event may be used per configuration of this provider. | ||
== Separation Events == | |||
=== Blank Page === | === Blank Page === | ||
| Line 32: | Line 34: | ||
You will set the barcode symbology(ies) on the document(s), using the property panel. You may also set the barcode's value the separator should watch for. If no value is set, the presence of any barcode (of the assigned symbology) at all will trigger the event. | You will set the barcode symbology(ies) on the document(s), using the property panel. You may also set the barcode's value the separator should watch for. If no value is set, the presence of any barcode (of the assigned symbology) at all will trigger the event. | ||
=== Page Count === | |||
The ''PageCount'' event is used for fixed page separation. For example, if you expect a new document to exist in a '''Batch''' every three pages, you can set the Page Count of this event to "3". A new '''Batch Folder''' will be created every three pages, with three '''Batch Pages''' placed in the created folder. | |||
=== Shape === | |||
The ''Shape'' event allows images on a page, such as a stamp or a logo, to be used to separate documents. "Shape features" are used as the trigger event for ''Event-Based Separation''. First, shape features must be saved to the page using a '''[[Shape Detection]]''' '''[[IP Command]]''' from an '''[[IP Profile]]'''. This allows Grooper to "see" whether or not a shape is on a page. Once the feature is encountered on the page, the event is triggered, allowing a new document folder to be created. | |||
=== Content Type === | === Content Type === | ||
This '''''Separation Event''''' uses trained examples of documents to establish the separation points between them. If you can match the first page of every document in a '''Batch''' with the first page of trained examples of '''Document Types''' in a '''Content Model''' you can start separation when you match a first page of a '''Document Type''' and stop once you see another page that matches a first page of '''Document Type'''. Furthermore, you can go ahead and classify the created folder as that '''Document Type''' that matches the page. | |||
Any page classified as Page 1 of a '''Document Type''' in a '''Content Model''' will trigger the event. A new '''Batch Folder''' will be created and the page will be placed inside. Subsequent pages will be included in the folder until a new Page 1 of a '''Document Type''' is found. | |||
The '' | Training data from both ''[[Lexical]]'' and ''[[Visual]]'' '''''Classification Methods''''' can be used. The ''Content Type'' event works particularly well when using ''Visual'' classification. This event can allow ''Visual'' classification ''and'' separation of documents within a single '''Separate''' step in '''Batch Process''' and even separation and classification in real time during scanning. | ||
Revision as of 15:26, 15 October 2020
Event-Based Separation is a Separation Provider that separates documents using one or more "Separation Events". Each Separation Event triggers the creation of a new folder.
The events are as follows:
- Blank Page - A blank page will trigger a new folder.
- Barcode - A scanned barcode will trigger a new folder.
- Content Type - This Separation Event uses Lexical or Visual training examples to trigger folder creation. Whenever a page confidently matches a trained example document's first page, a new folder is created.
- Page Count - This is for fixed page separation. A new folder is created by a set number of pages for a document.
- Shape - A new folder is created every time a "shape feature" is detected. Shape features are detected using a Shape Detection IP Command from an IP Profile.
About
When the event is triggered, one of two things can happen:
- A new folder is created and subsequent pages are appended until a new event is triggered (resulting in a new document)
- The page triggering the event may be deleted
- Sometimes, a cover sheet may be placed before a document. That page does not necessarily carry any useful data once the document is separated. If it is truly "junk" this can automate its deletion.
More than one event may be used per configuration of this provider.
Separation Events
Blank Page
Blank pages are used as the trigger event for folder creation. The Blank Page Detection property of this event detects whether an image is blank. The blank page is used as the separation point for where a document starts. All pages after the blank page will be included in a new Batch Folder until a new blank page is detected.
Note: This event can be also used to automate deleting blank pages by setting the Delete Page property to True.
Barcode
Virtually scanned barcodes are used to trigger folder creation. Grooper will read barcodes on a page during separation. If a barcode is found (according to the property settings configured on the event), a new folder will be created.
You will set the barcode symbology(ies) on the document(s), using the property panel. You may also set the barcode's value the separator should watch for. If no value is set, the presence of any barcode (of the assigned symbology) at all will trigger the event.
Page Count
The PageCount event is used for fixed page separation. For example, if you expect a new document to exist in a Batch every three pages, you can set the Page Count of this event to "3". A new Batch Folder will be created every three pages, with three Batch Pages placed in the created folder.
Shape
The Shape event allows images on a page, such as a stamp or a logo, to be used to separate documents. "Shape features" are used as the trigger event for Event-Based Separation. First, shape features must be saved to the page using a Shape Detection IP Command from an IP Profile. This allows Grooper to "see" whether or not a shape is on a page. Once the feature is encountered on the page, the event is triggered, allowing a new document folder to be created.
Content Type
This Separation Event uses trained examples of documents to establish the separation points between them. If you can match the first page of every document in a Batch with the first page of trained examples of Document Types in a Content Model you can start separation when you match a first page of a Document Type and stop once you see another page that matches a first page of Document Type. Furthermore, you can go ahead and classify the created folder as that Document Type that matches the page.
Any page classified as Page 1 of a Document Type in a Content Model will trigger the event. A new Batch Folder will be created and the page will be placed inside. Subsequent pages will be included in the folder until a new Page 1 of a Document Type is found.
Training data from both Lexical and Visual Classification Methods can be used. The Content Type event works particularly well when using Visual classification. This event can allow Visual classification and separation of documents within a single Separate step in Batch Process and even separation and classification in real time during scanning.