2021:Behaviors (Property): Difference between revisions

From Grooper Wiki
No edit summary
No edit summary
Line 39: Line 39:
|}
|}
====Labeling====
====Labeling====
{|cellpadding=10 cellspacing=5
|valign=top style="width:50%"|
The ''Labeling Behavior'' is a '''Content Type''' '''''Behavior''''' designed to collect and utilize a document's field labels in a variety of ways.  This includes functionality for classification and data extraction.
The ''Labeling Behavior'' is a '''Content Type''' '''''Behavior''''' designed to collect and utilize a document's field labels in a variety of ways.  This includes functionality for classification and data extraction.
The ''Labeling Behavior'' functionality allows Grooper users to quickly onboard new '''Document Types''' for structured and semi-structured forms, utilizing labels as a thumbprint for classification and data extraction purposes.  Once the ''Labeling Behavior'' is enabled, labels are identified and collected using the "Labels" tab of '''Document Types'''.  These "Label Sets" can then be used for the following purposes:
* Document classification - Using the ''Labelset-Based'' '''''Classification Method'''''
* Field based data extraction - Using the ''Labeled Value'' '''''Extractor Type'''''
* Tabular data extraction - Using a '''Data Table''' object's ''Tabular Layout'' '''''Extract Method'''''
* Sectional data extraction - Using a '''Data Section''' object's ''Transaction Detection'' '''''Extract Method'''''


Understanding the full functionality of the ''Labeling Behavior'' is worthy of it’s own article, which can be found [[Labeling Behavior - 2021|here]].
Understanding the full functionality of the ''Labeling Behavior'' is worthy of it’s own article, which can be found [[Labeling Behavior - 2021|here]].
|
[[File:Labeling-behavior-label-interface.png]]
|}
====PDF Generate====
====PDF Generate====
{|cellpadding=10 cellspacing=5
{|cellpadding=10 cellspacing=5

Revision as of 11:36, 25 March 2021

2021

This article is in development for the upcoming version of Grooper, Grooper 2021. Behaviors are a new set of data control functionalities in 2021. This information is incomplete and/or may change by the time of release.
Graphic depicting the notion of Weighted Rules.

Behaviors are properties set on Content Types and Export activities that control the flow of documents into and out of Grooper .

About

Behaviors are new to Grooper 2021 and are born of the idea that consolidating the control of the flow of data to the objects most relevant to its collection and delivery (Content Types{Content Model, Content Category, Document Type} and the Export activity) creates a more streamlined and effective Grooper experience. If the Content Type controls all the relevant information about how documents are organized and what is to be collected from them, it is logical that controlling how those documents come into Grooper, and how they get out, should also be defined by the same object. It is also a much cleaner experience to no longer have multiple steps in a Batch Process to cover the types of Export activities (Legacy Export, Mapped Export, Database Export), but instead a single Export activity that leverages the Behaviors of the Content Type, Behaviors defined on the Export activity, or both.

If you take CMIS Connections, for example, Grooper previously created an unnecessary amount of objects to get to the CMIS Content Type. The mappings for importing and exporting were also set here, which made for more places which a person need to go to configure the flow of data into and out of Grooper. It also created a bottle neck of control because if you wanted to have disparate Content Types use a similar CMIS connection, but the import/export mappings inherently needed to be different, you would have had to create a separate CMIS Content Type. Now, one CMIS Repository object can exist, and the import/export mechanisms are controlled via Behaviors (again, defined on Content Types or Export activities.)

Types of Behaviors

There are four types of Behaviors that can be added to a Content Type or Export activity.

Import

Import Behaviors control the flow of documents into Grooper by establishing communication, an Import Definition (of which multiple can exist), between a CMIS Repository object and a Grooper Content Type.

CMIS Content Type objects no longer exist in Grooper 2021, therefore Import mappings are no longer set there. They are instead set on an Import Behavior of a Content Type. This elminates the need for multiple CMIS Conent Type objects to exist for the individual needs of discrete Content Types, and instead consolidates the control to the object most pertinent to the import functionality.

Export

Export Behaviors control the flow of documents out of Grooper by defining export connectivity to external systems such as file systems (directly through a File Export definition, or a CMIS Repository object), content management systems (via CMIS Repository objects), Database Tables, mail servers (directly through an IMAP Export definition, or a CMIS Repository object), or FTP/SFTP servers (directly through FTP/SFTP Export definitions, or a CMIS Repository object).

CMIS Content Type objects no longer exist in Grooper 2021, therefore Export mappings are no longer set there. They are instead set on the Export Behavior of a Content Type , or a Batch Process Step set to Export on its Activity Type property. This elminates the need for multiple CMIS Conent Type objects to exist for the individual needs of discrete Content Types, and instead consolidates the control to the object most pertinent to the export functionality.

Labeling

The Labeling Behavior is a Content Type Behavior designed to collect and utilize a document's field labels in a variety of ways. This includes functionality for classification and data extraction.

The Labeling Behavior functionality allows Grooper users to quickly onboard new Document Types for structured and semi-structured forms, utilizing labels as a thumbprint for classification and data extraction purposes. Once the Labeling Behavior is enabled, labels are identified and collected using the "Labels" tab of Document Types. These "Label Sets" can then be used for the following purposes:

  • Document classification - Using the Labelset-Based Classification Method
  • Field based data extraction - Using the Labeled Value Extractor Type
  • Tabular data extraction - Using a Data Table object's Tabular Layout Extract Method
  • Sectional data extraction - Using a Data Section object's Transaction Detection Extract Method

Understanding the full functionality of the Labeling Behavior is worthy of it’s own article, which can be found here.

PDF Generate

The PDF Generate Behavior is a Content Type Behavior designed to create an exportable PDF file with additional native PDF elements, using the classification and extraction content of a Batch Folder. This includes capabilities to export extracted data as PDF metadata, inserting bookmarks, and creating PDF annotations, such as highlighting, checkbox and signature widgets.

Understanding the full functionality of the PDF Generate Behavior is worthy of it’s own article, which can be found here.

How To

Foundations

SharePoint

Document Understanding

CMIS Repository

Database

Data Model & Extraction

Import Behavior Setup

Behaviors

Import Definitions

Read Mappings

Write Mappings

Export Behaviors Setup

Content Model Export Behavior

Batch Process Export Behavior

Execution and Results

Import Query

Reviewing Classification

Viewing Results