What's New in Grooper 2021: Difference between revisions

From Grooper Wiki
No edit summary
No edit summary
Line 1: Line 1:
Below you will find helpful links to all the articles about the new/changed functionality in this version of Grooper.
__NOTOC__


== Welcome to Grooper 2021! ==


*[[Value Reader]]
=== Introducing... Behaviors! ===
** Including new/changed Extractor Types!
 
'''''[[Behaviors]]''''' are a new set of features designed to centralize the '''Content Model''' as the main hub controlling various aspects of document processing.  ''''Behaviors'''' are born of the idea that consolidating the the flow of data to the objects most relevant to its collection and delivery makes for a more streamlined and effective Grooper experience.
 
This allows a '''Content Model''' (and its component '''Content Types''') to wrest control from various other disparate '''Activities''', centralizing command of how documents and their data are modeled and what happens to that data once collected.  The result is more focused control about how data is imported, collected, and exported by a '''Content Model'''.  In other words, how it "behaves".
 
The following Behavior Types are introduced in 2021:
 
* '''''Import Behavior'''''
* '''''Export Behavior'''''
* '''''Labeling Behavior'''''
* '''''PDF Data Maping'''''
* '''''Text Rendering'''''
 
=== Introducing... Label Sets! ====


*[[Behaviors]]
** [[PDF Generate Behavior]]
** [[Labeling Behavior]]
** [[Labeling Behavior]]
*** Including Label Set based classification and data extraction techniques
 
==== Introducing... PDF Data Mapping! ====
 
* [[PDF Generate Behavior]]
 
=== Changes to Document Export and Database Export ===
 
==== Goodbye Document Export and Database Export.  Hello Export! ====
 
In 2021, we heavily reworked Grooper's document and data export functionality, to improve the process and allow for new functionality.  As part of this process, we unified '''Document Export''' and '''Database Export''' into a single '''Activity''':  '''Export'''
 
'''Export''' is now the single '''Activity''' driving all export operations in Grooper.  Whether exporting PDFs to a content management system, exporting data to a database, or any content to any external storage platform, '''Export''' is your way to go.
 
==== Goodbye CMIS Content Types.  Hello Import and Export Behaviors! ====
 
One big change to how things were done before 2021 is how data is mapped according to its '''Data Model''' structure to or from an external storage platform upon document import or export.  Previously, these mappings were configured using the CMIS Content Type objects, created as children of a CMIS Connection.
 
In 2021, the '''CMIS Connection''' object purely serves the function of integrating Grooper with an external storage platform.  Import and export mappings are defined using '''''Import''' or '''''Export Behaviors'''''.  This removes some unnecessary object bloat around the '''CMIS Connection''' object and lets the '''Content Model''' and '''Document Types''' drive their associated '''Data Model''' mappings.
 
Import and Export Behaviors are configurable via:
* The '''Export''' '''Activity'''
* '''Content Models''' or '''Document Types'''
 
=== Introducing... Data Rules! ===


*[[Data Rule]]
*[[Data Rule]]
* Apply Rules and Convert Data activity types
=== Introducing... API! ===
=== Data Extraction Improvements ===
*[[Value Reader]]
** Including new/changed Extractor Types!


*[[Vertical Wrap]] for easier stacked label matching.
*[[Vertical Wrap]] for easier stacked label matching.
Line 16: Line 59:
*[[Constrained Wrap]] for easier pattern matching for data constrained in a box (think table cells).
*[[Constrained Wrap]] for easier pattern matching for data constrained in a box (think table cells).


*Changes to [[Content Action#Changes to Content Action in Version 2021|Content Action]]
* New and improved Table Extraction methods
** [[Tabular Layout]]
** [[Delimited Extract]]
** Fixed Width
** Fluid Layout
 
* New Data Section Methods
** [[Transaction Detection]]
** [[Nested Table]]
 
=== Install and Setup Changes ===


*Changes to [[Grooper Config - 2021|Grooper Config]]
*Changes to [[Grooper Config - 2021|Grooper Config]]
Line 22: Line 75:
*Changes to [[Install and Setup|Install and Setup - 2021]]
*Changes to [[Install and Setup|Install and Setup - 2021]]
** Including changes to Grooper Repository creation and connection.
** Including changes to Grooper Repository creation and connection.
* Licensing changes
=== Miscellaneous ===


ARTICLES COMING SOON
*Changes to [[Content Action#Changes to Content Action in Version 2021|Content Action]]
* New Table Extraction Methods/Changes
* New Data Section Methods
* Document ingestion API integration
* Licensing changes
* Document Viewer improvements
* Document Viewer improvements
* Text file improvements
* Text file improvements
* Apply Rules and Convert Data activity types

Revision as of 10:23, 30 August 2021


Welcome to Grooper 2021!

Introducing... Behaviors!

Behaviors are a new set of features designed to centralize the Content Model as the main hub controlling various aspects of document processing. 'Behaviors' are born of the idea that consolidating the the flow of data to the objects most relevant to its collection and delivery makes for a more streamlined and effective Grooper experience.

This allows a Content Model (and its component Content Types) to wrest control from various other disparate Activities, centralizing command of how documents and their data are modeled and what happens to that data once collected. The result is more focused control about how data is imported, collected, and exported by a Content Model. In other words, how it "behaves".

The following Behavior Types are introduced in 2021:

  • Import Behavior
  • Export Behavior
  • Labeling Behavior
  • PDF Data Maping
  • Text Rendering

Introducing... Label Sets! =

Introducing... PDF Data Mapping!

Changes to Document Export and Database Export

Goodbye Document Export and Database Export. Hello Export!

In 2021, we heavily reworked Grooper's document and data export functionality, to improve the process and allow for new functionality. As part of this process, we unified Document Export and Database Export into a single Activity: Export

Export is now the single Activity driving all export operations in Grooper. Whether exporting PDFs to a content management system, exporting data to a database, or any content to any external storage platform, Export is your way to go.

Goodbye CMIS Content Types. Hello Import and Export Behaviors!

One big change to how things were done before 2021 is how data is mapped according to its Data Model structure to or from an external storage platform upon document import or export. Previously, these mappings were configured using the CMIS Content Type objects, created as children of a CMIS Connection.

In 2021, the CMIS Connection object purely serves the function of integrating Grooper with an external storage platform. Import and export mappings are defined using Import or Export Behaviors. This removes some unnecessary object bloat around the CMIS Connection object and lets the Content Model and Document Types drive their associated Data Model mappings.

Import and Export Behaviors are configurable via:

  • The Export Activity
  • Content Models or Document Types

Introducing... Data Rules!

  • Data Rule
  • Apply Rules and Convert Data activity types

Introducing... API!

Data Extraction Improvements

  • Constrained Wrap for easier pattern matching for data constrained in a box (think table cells).

Install and Setup Changes


Miscellaneous

  • Changes to Content Action
  • Document Viewer improvements
  • Text file improvements