2023.1:Classification (Concept): Difference between revisions

From Grooper Wiki
No edit summary
Mharrison (talk | contribs)
m Added note that this activity is generally run before Extraction
Line 1: Line 1:
Classification, in Grooper, is the process of assigning a [[Document Type]] to a [[Batch Folder]].  Before classification, a [[Batch Folder]] can be seen as a "blank" document full of various [[Batch Page|pages]], but it doesn't know what kind of document it is yet.  Documents are classified by:
Classification, in Grooper, is the process of assigning a [[Content Type]] to a [[Batch Folder]].  Before classification, a [[Batch Folder]] can be seen as a "blank" document full of various [[Batch Page|pages]], but it doesn't know what kind of document it is yet.  Documents are classified by:


* The Separate activity by assigning a [[Content Type]] to each new folder created
* The Separate activity by assigning a [[Content Type]] to each new folder created
Line 5: Line 5:
* Manually assigning a [[Document Type]] by using the "Apply Document Type" command on a [[Batch Folder]].   
* Manually assigning a [[Document Type]] by using the "Apply Document Type" command on a [[Batch Folder]].   


During the Classify activity, Grooper will use information from the [[Batch Page|pages]] in the Batch Folder (generally text) and configurations from a [[Content Model]] to give it a [[Document Type]] from your [[Content Model]].
During the Classify activity, Grooper will use information from the [[Batch Page|pages]] in the Batch Folder (generally text) and configurations from a [[Content Model]] to give it a [[Document Type]] from your [[Content Model]]. This activity is generally performed before [[Extraction]], because until a document is classified, Grooper will not understand which Data Elements to look for and the instructions to use to identify those elements within the document.

Revision as of 09:50, 1 June 2020

Classification, in Grooper, is the process of assigning a Content Type to a Batch Folder. Before classification, a Batch Folder can be seen as a "blank" document full of various pages, but it doesn't know what kind of document it is yet. Documents are classified by:

During the Classify activity, Grooper will use information from the pages in the Batch Folder (generally text) and configurations from a Content Model to give it a Document Type from your Content Model. This activity is generally performed before Extraction, because until a document is classified, Grooper will not understand which Data Elements to look for and the instructions to use to identify those elements within the document.