Main Page

From Grooper Wiki
Jump to navigation Jump to search
Getting Started

Grooper is a software application that helps organizations innovate workflows by integrating difficult data.

Grooper empowers rapid innovation for organizations processing and integrating large quantities of difficult data. Created by a team of courageous developers frustrated by limitations in existing solutions, Grooper is an intelligent document and digital data integration platform. Grooper combines patented and sophisticated image processing, capture technology, machine learning, and natural language processing. Grooper – intelligent document processing; limitless, template-free data integration.

Getting Started
Install and Setup
2.80 Reference Documentation


Featured Articles Did you know?
This is an example of the Separation Review Attended Client interface.

Separation and Separation Review

Grooper uses various approaches and algorithms to determine the classification of a page or folder. The settings on a Content Model and Document Type add to the complexities for separating pages into documents. Grooper Version 2.9 builds on the Separation settings found on Document Types, including the ability to adjust the Training Scope and configure a Secondary Page Extractor.

Adjusting the Training Scope provides benefits to the accuracy and performance of ESP Auto Separation by focusing what is important when it comes time to separate and classify Unstructured paginated documents. For example, the Normal mode will create a single FormType and divide trained examples into "First", "Middle" and "Last" pages. From individual document to individual document, often the most meaningful features composing them are found on the first and last pages, and there can be more variance on the pages in between. This is different from the previous approach, which created individual FormTypes for each trained example, each with their own "Page X of X" PageType objects. This unifies all trained examples into a single FormType, making the training and classification of these documents ultimately simpler and more efficient. The FirstLast mode assumes meaningful features for classification are only found on the first and last pages, with the middle pages containing no information needed to make a separation or classification decision. With this mode enabled, only trained examples of the first and last page and their associated features will be saved. This can improve processing time by removing all the features in the middle pages for consideration. The FirstOnly mode narrows this scope even further by only storing features from the first page of trained documents.

Furthermore, ESP Auto Separation removes but does not eliminate a lot of the manual work to separate and classify documents. Separation Review is a new review module designed to make the manual work quick and easy.

For more information on Separation and Separation Review, visit the full article here

The Separation Review module was added to improve review of complicated document sets separated and classified by ESP Auto Separation. Because this Separation Provider separates using page based classification, it can be important to how it made the decision to separate or not separate a document on a page by page basis. The Classify Review module presents the reviewer with pages already placed in document folders, and it can be cumbersome to review the page by page separation viewing documents already placed in folders.

Enter Separation Review. This review module is modeled off our ESP Auto Separation Tester which Grooper Architects use to test the separation and classification of documents, using ESP Separation. This viewer gives you a much broader look at the individual pages, allowing an easier (and ultimately quicker and more efficient) view of the separation logic applied to the batch.

There are further quality of life improvements for the Separation Review module, making the process of reviewing documents separated and classified by ESP Auto Separation simpler, faster, and more satisfying.

New in 2.9 Featured Use Case

Welcome to Grooper 2.9!
Below you will find helpful links to all the articles about the new/changed functionality in this version of Grooper.

Compile stats 02.png
Microsoft office integration 000.png
Document viewer 00.png
Separation and review 18.png
Compile Stats Microsoft Office Integration Document Viewer Separation and Separation Review
Data review 00.png
Weighted rules 00.png
Data element overrides 000.png
Database export 002.png
Data Review Confidence Multiplier Data Element Overrides Database Export
Cmis lookup 002.png
Content type filter 000.png
Output extractor key 000.png
Box cmis binding 000.png
CMIS Lookup Content Type Filter Output Extractor Key Box (CMIS Binding)
Linq to grooper objects 001.png
LINQ to Grooper Objects
American-airlines-credit-union-financial-services-document-data-capture-integration-grooper.jpg

They’re Saving Over 5,000 Hours Every Year in Data Discovery and Processing


American Airlines Credit Union has transformed their data workflows, quickly saving thousands of hours in electronic data discovery , resulting in much greater efficiency and improved member services.

Discover how they:

  • Quickly found 40,000 specific files among one billion
  • Easily integrated with data silos and content management systems when no other solution would
  • Have cut their mortgage processing time in half (and they process mortgages for 47 branch offices!)
  • Learn from the document and electronic data discovery experts at BIS!

You can access the full case study clicking this link.


Other Resources