Nested Table (Section Extract Method): Difference between revisions

Revision as of 11:06, 10 May 2024

Nested Table is a insert_page_break Data Section Extract Method. This method divides a document into sections by extracting table data within those sections. This gives Grooper users a method for extracting hierarchical tables as well as dividing up a document into sections where each of those sections have the same table (or at least tabular data which can be extracted by a single table Data Table object).

The Nested Table method is reliant on the Labeling Behavior functionality (AKA "Label Sets").

For more information on the Nested Table method and its configuration, visit the section of the Labeling Behavior article linked below.

The Nested Table Method

For more information on Label Sets in general, you can find the full Labeling Behavior article linked below.

Labeling Behavior

Glossary

Data Section: A insert_page_break Data Section is a container for Data Elements in a data_table Data Model. variables They can contain Data Fields, table Data Tables, and even Data Sections as child nodes and add hierarchy to a Data Model. They serve two main purposes:

They can simply act as organizational buckets for Data Elements in larger Data Models.
By configuring its "Extract Method", a Data Section can subdivide larger and more complex documents into smaller parts to assist in extraction.
- "Single Instance" sections define a division (or "record") that appears only once on a document.
- "Multi-Instance" sections define collection of repeating divisions (or "records").

Data Table: A table Data Table is a Data Element specialized in extracting tabular data from documents (i.e. data formatted in rows and columns).

The Data Table itself defines the "Table Extract Method". This is configured to determine the logic used to locate and return the table's rows.
The table's columns are defined by adding view_column Data Column nodes to the Data Table (as its children).

Extract: export_notes Extract is an Activity that retrieves information from folder Batch Folder documents, as defined by Data Elements in a data_table Data Model. This is how Grooper locates unstructured data on your documents and collects it in a structured, usable format.

Labeling Behavior: A Labeling Behavior extends "label set" functionality to description Document Types. This allows you to collect field labels and other labels present on a document and use them in a variety of ways. This includes functionality for classification, field extraction, table extraction, and section extraction.

Nested Table: Nested Table is a insert_page_break Data Section Extract Method. This method divides a document into sections by extracting table data within those sections. This gives Grooper users a method for extracting hierarchical tables as well as dividing up a document into sections where each of those sections have the same table (or at least tabular data which can be extracted by a single table Data Table object).

Section Extract Method: The Extract Method property of a insert_page_break Data Section defines a "Section Extract Method" which specifies how section instances will be identified and extracted.

@@ Line 9: / Line 9: @@
 * [[Labeling Behavior]]
-[[Category:Articles]]
+== Glossary ==
+<u><big>'''Data Section'''</big></u>: {{#lst:Glossary|Data Section}}
+<u><big>'''Data Table'''</big></u>: {{#lst:Glossary|Data Table}}
+<u><big>'''Extract'''</big></u>: {{#lst:Glossary|Extract}}
+<u><big>'''Labeling Behavior'''</big></u>: {{#lst:Glossary|Labeling Behavior}}
+<u><big>'''Nested Table'''</big></u>: {{#lst:Glossary|Nested Table}}
+<u><big>'''Section Extract Method'''</big></u>: {{#lst:Glossary|Section Extract Method}}