Glossary: Difference between revisions
added clip frames // via Wikitext Extension for VSCode |
completed Activity section // via Wikitext Extension for VSCode |
||
| Line 2: | Line 2: | ||
<section begin="Activity" /> | <section begin="Activity" /> | ||
'''''[[Activity (Property)|Activity]]''''' is a property on [[image:GrooperIcon_BatchProcessStep.png]] '''[[Batch Process Step]]''' objects. '''''Activities''''' define specific document processing operations done to a [[image:GrooperIcon_Batch.png]] '''[[Batch]]''', [[image:GrooperIcon_BatchFolder.png]] '''[[Batch Folder]]''', or [[image:GrooperIcon_BatchPage.png]] '''[[Batch Page]]'''. | '''''[[Activity (Property)|Activity]]''''' is a property on [[image:GrooperIcon_BatchProcessStep.png]] '''[[Batch Process Step]]''' objects. '''''Activities''''' define specific document processing operations done to a [[image:GrooperIcon_Batch.png]] '''[[Batch]]''', [[image:GrooperIcon_BatchFolder.png]] '''[[Batch Folder]]''', or [[image:GrooperIcon_BatchPage.png]] '''[[Batch Page]]'''. | ||
'''Batch Process Steps''' configured with specific '''''Activities''''' are frequently referred by the name of the '''''Activity''''' followed by the word "step". For example: '''Classify Step'''. | |||
<section end="Activity" /> | <section end="Activity" /> | ||
| Line 11: | Line 13: | ||
=== Clip Frames === | === Clip Frames === | ||
<section begin="Clip Frames" /> | <section begin="Clip Frames" /> | ||
The '''''[[Clip Frames (Activity)|Clip Frames]]''''' '''''[[Activity (Property)|Activity]]'' | The '''''[[Clip Frames (Activity)|Clip Frames]]''''' '''''[[Activity (Property)|Activity]]''''' extracts defined areas from [https://en.wikipedia.org/wiki/Microform microfiche] card images, creating new image frames or layers for focused analysis or processing. | ||
<section end="Clip Frames" /> | <section end="Clip Frames" /> | ||
=== Detect Frames === | === Detect Frames === | ||
<section begin="Detect Frames" /> | <section begin="Detect Frames" /> | ||
The '''''[[Detect Frames (Activity)|Detect Frames]]''''' '''''[[Activity (Property)|Activity]]''''' locates and identifies frame lines on [https://en.wikipedia.org/wiki/Microform microfiche] card images, enabling the isolation of areas within the frames for further data extraction or processing. | |||
<section end="Detect Frames" /> | <section end="Detect Frames" /> | ||
=== Execute === | === Execute === | ||
<section begin="Execute" /> | <section begin="Execute" /> | ||
The '''''[[Execute (Activity)|Execute]]''''' '''''[[Activity (Property)|Activity]]''''' runs a specified child command, allowing for the modular and controlled execution of tasks within a larger automated workflow. | |||
<section end="Execute" /> | <section end="Execute" /> | ||
=== Export === | === Export === | ||
<section begin="Export" /> | <section begin="Export" /> | ||
The '''''[[Export (Activity)|Export]]''''' '''''[[Activity (Property)|Activity]]''''' facilitates the transfer of documents and extracted information to external systems or formats, completing the data processing workflow. | |||
<section end="Export" /> | <section end="Export" /> | ||
=== Extract === | === Extract === | ||
<section begin="Extract" /> | <section begin="Extract" /> | ||
The '''''[[Extract (Activity)|Extract]]''''' '''''[[Activity (Property)|Activity]]''''' retrieves relevant information, defined by '''[[Data Element (Concept)|Data Elements]]''', from [[image:GrooperIcon_BatchFolder.png]] '''[[Batch Folder|Batch Folders]]''', transforming unstructured or semi-structured content into structured, usable data. | |||
<section end="Extract" /> | <section end="Extract" /> | ||
=== Image Processing === | === Image Processing === | ||
<section begin="Image Processing" /> | <section begin="Image Processing" /> | ||
The '''''[[Image Processing (Activity)|Image Processing]]''''' '''''[[Activity (Property)|Activity]]''''' enhances and optimizes [[image:GrooperIcon_BatchPage.png]] '''[[Batch Page|Batch Pages]]''' for better recognition and data extraction results. | |||
<section end="Image Processing" /> | <section end="Image Processing" /> | ||
=== Initialize Card === | === Initialize Card === | ||
<section begin="Initialize Card" /> | <section begin="Initialize Card" /> | ||
The '''''[[Initialize Card (Activity)|Initialize Card]]''''' '''''[[Activity (Property)|Activity]]''''' prepares and configures [https://en.wikipedia.org/wiki/Microform microfiche] card images for further processing. | |||
<section end="Initialize Card" /> | <section end="Initialize Card" /> | ||
=== Recognize === | === Recognize === | ||
<section begin="Recognize" /> | <section begin="Recognize" /> | ||
The '''''[[Recognize (Activity)|Recognize]]''''' '''''[[Activity (Property)|Activity]]''''' interprets [[image:GrooperIcon_BatchPage.png]] '''[[Batch Page|Batch Pages]]''' and [[image:GrooperIcon_BatchFolder.png]] '''[[Batch Folder|Batch Folders]]''', converting them into machine-readable text and capturing layout data for comprehensive analysis and data extraction. This will attach a text and/or layoutData file to the respective object. | |||
<section end="Recognize" /> | <section end="Recognize" /> | ||
=== Render === | === Render === | ||
<section begin="Render" /> | <section begin="Render" /> | ||
The '''''[[Render (Activity)|Render]]''''' '''''[[Activity (Property)|Activity]]''''' normalizes electronic document content from file formats '''Grooper''' cannot read innately to a [https://en.wikipedia.org/wiki/PDF PDF format]. This allows '''Grooper''' to extract the text via the '''''[[Recognize (Activity)|Recognize]]''''' '''''[[Activity (Property)|Activity]]'''''. | |||
<section end="Render" /> | <section end="Render" /> | ||
=== Review === | === Review === | ||
<section begin="Review" /> | <section begin="Review" /> | ||
The '''''[[Review (Activity)|Review]]''''' '''''[[Activity (Property)|Activity]]''''' facilitates human evaluation and validation of processed [[image:GrooperIcon_BatchFolder.png]] '''[[Batch Folder|Batch Folders]]''' and extracted data for accuracy and completeness. | |||
<section end="Review" /> | <section end="Review" /> | ||
=== Send Mail === | === Send Mail === | ||
<section begin="Send Mail" /> | <section begin="Send Mail" /> | ||
The '''''[[Send Mail (Activity)|Send Mail]]''''' '''''[[Activity (Property)|Activity]]''''' automates the dispatch of emails with or without attachments, based on workflow events and conditions. | |||
<section end="Send Mail" /> | <section end="Send Mail" /> | ||
=== Separate === | === Separate === | ||
<section begin="Separate" /> | <section begin="Separate" /> | ||
The '''''[[Separate (Activity)|Separate]]''''' '''''[[Activity (Property)|Activity]]''''' sorts [[image:GrooperIcon_BatchPage.png]] '''[[Batch Page|Batch Pages]]''' into individual [[image:GrooperIcon_BatchFolder.png]] '''[[Batch Folder|Batch Folders]]''', distinguishing them for independent processing and organization. | |||
<section end="Separate" /> | <section end="Separate" /> | ||
=== Split Pages === | === Split Pages === | ||
<section begin="Split Pages" /> | <section begin="Split Pages" /> | ||
Multi-page documents (typically [https://en.wikipedia.org/wiki/PDF PDFs] and [https://en.wikipedia.org/wiki/TIFF TIFFs]) come into '''Grooper''' represented as single [[image:GrooperIcon_BatchFolder.png]] '''[[Batch Folder|Batch Folders]]'''. The '''''[[Split Pages (Activity)|Split Pages]]''''' '''''[[Activity (Property)|Activity]]''''' exposes [[image:GrooperIcon_BatchPage.png]] '''[[Batch Page|Batch Pages]]''' as child objects of the [[image:GrooperIcon_BatchFolder.png]] '''[[Batch Folder|Batch Folders]]''' for individualized processing and handling. | |||
<section end="Split Pages" /> | <section end="Split Pages" /> | ||
=== XML Transform === | === XML Transform === | ||
<section begin="XML Transform" /> | <section begin="XML Transform" /> | ||
The '''''[[XML Transform (Activity)|XML Transform]]''''' '''''[[Activity (Property)|Activity]]''''' applies [https://en.wikipedia.org/wiki/XSLT XSLT] stylesheets to [https://en.wikipedia.org/wiki/XML XML] data to modify or reformat the output structure for various purposes. | |||
<section end="XML Transform" /> | <section end="XML Transform" /> | ||
Revision as of 13:27, 19 April 2024
Activity
Activity is a property on
Batch Process Step objects. Activities define specific document processing operations done to a
Batch,
Batch Folder, or
Batch Page.
Batch Process Steps configured with specific Activities are frequently referred by the name of the Activity followed by the word "step". For example: Classify Step.
Classify
Classify is an Activity that "classifies"
Batch Folders in a
Batch by assigning them a Content Type using patterns, lexical understanding, or rules as defined by a
Content Model.
Clip Frames
The Clip Frames Activity extracts defined areas from microfiche card images, creating new image frames or layers for focused analysis or processing.
Detect Frames
The Detect Frames Activity locates and identifies frame lines on microfiche card images, enabling the isolation of areas within the frames for further data extraction or processing.
Execute
The Execute Activity runs a specified child command, allowing for the modular and controlled execution of tasks within a larger automated workflow.
Export
The Export Activity facilitates the transfer of documents and extracted information to external systems or formats, completing the data processing workflow.
Extract
The Extract Activity retrieves relevant information, defined by Data Elements, from
Batch Folders, transforming unstructured or semi-structured content into structured, usable data.
Image Processing
The Image Processing Activity enhances and optimizes
Batch Pages for better recognition and data extraction results.
Initialize Card
The Initialize Card Activity prepares and configures microfiche card images for further processing.
Recognize
The Recognize Activity interprets
Batch Pages and
Batch Folders, converting them into machine-readable text and capturing layout data for comprehensive analysis and data extraction. This will attach a text and/or layoutData file to the respective object.
Render
The Render Activity normalizes electronic document content from file formats Grooper cannot read innately to a PDF format. This allows Grooper to extract the text via the Recognize Activity.
Review
The Review Activity facilitates human evaluation and validation of processed
Batch Folders and extracted data for accuracy and completeness.
Send Mail
The Send Mail Activity automates the dispatch of emails with or without attachments, based on workflow events and conditions.
Separate
The Separate Activity sorts
Batch Pages into individual
Batch Folders, distinguishing them for independent processing and organization.
Split Pages
Multi-page documents (typically PDFs and TIFFs) come into Grooper represented as single
Batch Folders. The Split Pages Activity exposes
Batch Pages as child objects of the
Batch Folders for individualized processing and handling.
XML Transform
The XML Transform Activity applies XSLT stylesheets to XML data to modify or reformat the output structure for various purposes.
Behavior
Export Behavior
Labeling Behavior
PDF Data Mapping
CMIS Connection Type
AppXtender
Box
Exchange
FTP
IMAP
NTFS
OneDrive
SFTP
Classification Method
Labelset-Based
Lexical
Rules-Based
Visual
Collation Provider
Collation Provider
AND
Array
Combine
Key-Value List
Key-Value Pair
Ordered Array
Pattern-Based
Split
Concept
Activity Processing
Asset Management
Backup and Restore Grooper Repository
CMIS+
CMIS
CMIS Query
CSS Data Viewer Styling
Classification
Code Expressions
Combined Methods
Content Type
Data Context
Data Element
Data Extractor
Data Instance
Desktop Scanning in Grooper
Download or Upload Grooper Objects
EDI Integration
Expressions
Expressions Cookbook
Field Mapping
Five Phases of Grooper
Flow Collation
Fuzzy RegEx
GPT Integration
Grooper Azure AD Connector
Grooper Infrastructure
Grooper Repository
Grooper Service
Image Processing
Import Mode and Document Linking
Import or Export Grooper Objects
LINQ to Grooper Objects
Layered OCR
Layout Data
License Activation
Microfiche Processing
Microsoft Office Integration
OCR
OCR Synthesis
Object Nomenclature
Overrides
PDF Page Types
Regular Expression
Repository
Separation
TF-IDF
Table Extraction
Test Batch
Thread
Training-Based Approaches to Document Classification
Training Batch
UNC Path
URL Endpoints for Review
Waterfall Classification
XML Schema Integration
Export Type
CMIS Export
Data Export
Extractor Type
Detect Signature
Find Barcode
Highlight Zone
Labeled OMR
Labeled Value
List Match
Ordered OMR
Pattern Match
Read Barcode
Read Zone
Word Match
Zonal OMR
IP Command
Barcode Detection
Binarize
Extract Page
Line Removal
Scratch Removal
Shape Detection
Shape Removal
Import Provider
CMIS Import
Import Descendants
Import Query Results
Lookup
CMIS Lookup
Database Lookup
Web Service Lookup
Object
Batch
Batch Folder
Batch Page
Batch Process
CMIS Connection
CMIS Repository
Content Category
Content Model
Data Connection
Data Field
Data Model
Data Rule
Data Section
Data Table
Data Type
Document Type
Field Class
File Store
Form Type
IP Profile
Lexicon
Machine
OCR Profile
Object Library
Page Type
Processing Queue
Project
Review Queue
Scanner Profile
Separation Profile
Value Reader
Property
Confidence Multiplier and Output Confidence
Constrained Wrap
Content Type Filter
OCR Engine
Output Extractor Key
Paragraph Marking
Permission Sets
Scope
Secondary Types
Tab Marking
Vertical Wrap
Section Extract Method
Nested Table
Transaction Detection
Separation Provider
Separation Provider
Change in Value Separation
Control Sheet Separation
EPI Separation
ESP Auto Separation
Event-Based Separation
Multi Separator
Pattern-Based Separation
Undo Separation
Service
API Services
Activity Processing
Grooper Licensing
Table Extract Method
Delimited Extract
Fluid Layout
Grid Layout
Row Match
Tabular Layout
UI Element
Document Viewer
Node Tree
Summary Tabs