2023.1:Classification (Concept): Difference between revisions
Configadmin (talk | contribs) No edit summary |
m Added note that this activity is generally run before Extraction |
||
Line 1: | Line 1: | ||
Classification, in Grooper, is the process of assigning a [[ | Classification, in Grooper, is the process of assigning a [[Content Type]] to a [[Batch Folder]]. Before classification, a [[Batch Folder]] can be seen as a "blank" document full of various [[Batch Page|pages]], but it doesn't know what kind of document it is yet. Documents are classified by: | ||
* The Separate activity by assigning a [[Content Type]] to each new folder created | * The Separate activity by assigning a [[Content Type]] to each new folder created | ||
Line 5: | Line 5: | ||
* Manually assigning a [[Document Type]] by using the "Apply Document Type" command on a [[Batch Folder]]. | * Manually assigning a [[Document Type]] by using the "Apply Document Type" command on a [[Batch Folder]]. | ||
During the Classify activity, Grooper will use information from the [[Batch Page|pages]] in the Batch Folder (generally text) and configurations from a [[Content Model]] to give it a [[Document Type]] from your [[Content Model]]. | During the Classify activity, Grooper will use information from the [[Batch Page|pages]] in the Batch Folder (generally text) and configurations from a [[Content Model]] to give it a [[Document Type]] from your [[Content Model]]. This activity is generally performed before [[Extraction]], because until a document is classified, Grooper will not understand which Data Elements to look for and the instructions to use to identify those elements within the document. |
Revision as of 09:50, 1 June 2020
Classification, in Grooper, is the process of assigning a Content Type to a Batch Folder. Before classification, a Batch Folder can be seen as a "blank" document full of various pages, but it doesn't know what kind of document it is yet. Documents are classified by:
- The Separate activity by assigning a Content Type to each new folder created
- The Classify activity using logic set on a Content Model or
- Manually assigning a Document Type by using the "Apply Document Type" command on a Batch Folder.
During the Classify activity, Grooper will use information from the pages in the Batch Folder (generally text) and configurations from a Content Model to give it a Document Type from your Content Model. This activity is generally performed before Extraction, because until a document is classified, Grooper will not understand which Data Elements to look for and the instructions to use to identify those elements within the document.