2023.1:Classification (Concept)

From Grooper Wiki
Revision as of 09:50, 1 June 2020 by Mharrison (talk | contribs) (Added note that this activity is generally run before Extraction)

Classification, in Grooper, is the process of assigning a Content Type to a Batch Folder. Before classification, a Batch Folder can be seen as a "blank" document full of various pages, but it doesn't know what kind of document it is yet. Documents are classified by:

During the Classify activity, Grooper will use information from the pages in the Batch Folder (generally text) and configurations from a Content Model to give it a Document Type from your Content Model. This activity is generally performed before Extraction, because until a document is classified, Grooper will not understand which Data Elements to look for and the instructions to use to identify those elements within the document.