Activity

From Grooper Wiki
Revision as of 09:15, 1 May 2025 by Dgreenwood (talk | contribs)

STUB

This article is a stub. It contains minimal information on the topic and should be expanded.


This article is about the current version of Grooper.

Note that some content may still need to be updated.

2025 2023.120232021

Grooper Activities define specific document processing operations done to a inventory_2 Batch, folder Batch Folder, or contract Batch Page. In a settings Batch Process, each edit_document Batch Process Step executes a single Activity (determined by the step's "Activity" property).

  • Batch Process Steps are frequently referred by the name of their configured Activity followed by the word "step". For example: "Classify step".

For example, OCR data is obtained from pages via the Recognize activity. Processed documents are exported to a storage platform via the Export activity.

Activities fall into one of two categories: "Code Activities" and "Attended Activities".

  • Code Activities are automated. They are performed by Activity Processing services and do not require human interaction.
    • There are over thirty different Code Activities in Grooper.
    • For example: Classify is a Code Activity where documents are classified according to a Content Model.
  • Attended Activities are not automated. The are performed by a human operator and require human interaction.
    • There is a single Attended Activity: the Review activity.
    • Review steps are added to a Batch Process to allow a user to verify Grooper's automated results.
    • Review provides different user-interfaces, called "Viewers", to review different things. For example, the "Data Viewer" allows user to validate and correct data collected by the Extract activity.

Attended Activities

Review is the only Attended Activity in Grooper. Depending on what the user needs to review, one or more "Review Views" will be added to the Review step. These give users a specialized user interfaces allowing them to review the Batch and its content. The following Review Viewers are currently available:

  • Scan Viewer - Gives users an interface to scan documents into a Batch using an optical scanner.
  • Thumbnail Viewer - Gives users an interface to review individual pages in a Batch. Typically this is used to review the results of an IP Profile applied by an Image Processing step.
  • Classification Viewer - Gives users an interface to review and edit classification results made by a Classify step.
  • Separation Viewer - Gives users an interface to review and edit separation and classification decisions made by the ESP Auto Separation provider during a Separate step.
  • Data Viewer - Gives users an interface to review and edit index data collected during the Extract step.
  • Folder Viewer - Gives users a basic interface to navigate through folders and pages in a Batch using a tree viewer.

Code Activities

There are many more Code Activities in Grooper. They fall into the following categories:

Cleanup and Recognition

These Activities are used to condition documents for further processing.

  • The most commonly used Activity in this category is the all-important "Recognize" activity, which obtains machine readable text from image-based and native-text pages.

Document Processing

These activities process documents in a variety of different ways.

  • This includes some of the most commonly used Grooper Activities, such as Separate, Classify, Extract and Export.

Microform Processing

These activities specifically apply to processing microfiche. For more information, visit our Microfiche Processing article.

Transform

These activities transform document content from one form to another.

  • The most commonly used Activity in this category is "Split Pages" which creates pages nodes from a PDF file attached to a Batch Folder.

Utilities

These are miscellaneous activities that don't fit in well into the other categories.

  • The most commonly used Activity in this category is "Execute" which gives users the capability to automate various object commands normally available by right clicking nodes in Grooper.