2023:List Match (Value Extractor): Difference between revisions
No edit summary |
No edit summary |
||
| Line 32: | Line 32: | ||
{|cellpadding=10 cellspacing=5 | {|cellpadding=10 cellspacing=5 | ||
|valign=top style="width:40%"| | |valign=top style="width:40%"| | ||
====Configuring on a Value Reader==== | |||
|- | |||
|valign=top style="width:40%"| | |||
# In your '''Node Tree''', create or select a '''Value Reader'''. | # In your '''Node Tree''', create or select a '''Value Reader'''. | ||
#* Visit the [[Value Reader]] Wiki Page for instructions on how to create a '''Value Reader'''. | #* Visit the [[Value Reader]] Wiki Page for instructions on how to create a '''Value Reader'''. | ||
| Line 46: | Line 52: | ||
{|cellpadding=10 cellspacing=5 | {|cellpadding=10 cellspacing=5 | ||
|valign=top style="width:40%"| | |valign=top style="width:40%"| | ||
====Configuring on a Data Type==== | |||
|- | |||
|valign=top style="width:40%"| | |||
# In your '''Node Tree''', create or select a '''Data Type'''. | # In your '''Node Tree''', create or select a '''Data Type'''. | ||
#* Visit the [[Data Type]] Wiki Page for instructions on how to create a '''Data Type'''. | #* Visit the [[Data Type]] Wiki Page for instructions on how to create a '''Data Type'''. | ||
| Line 68: | Line 80: | ||
{|cellpadding=10 cellspacing=5 | {|cellpadding=10 cellspacing=5 | ||
|valign=top style="width:40%"| | |valign=top style="width:40%"| | ||
====Configuring Local Entries==== | |||
|- | |- | ||
|valign=top style="width:40%"| | |valign=top style="width:40%"| | ||
# For '''Value Readers''', select the object you wish to configure and click "Tester" tab. | # For '''Value Readers''', select the object you wish to configure and click "Tester" tab. | ||
#* When configuring a '''Data Type''', first click the ellipsis button at the end of the '''''Local Extractor''''' property with ''List Match'' selected to bring up the editing window. | #* When configuring a '''Data Type''', first click the ellipsis button at the end of the '''''Local Extractor''''' property with ''List Match'' selected to bring up the editing window. | ||
| Line 96: | Line 111: | ||
{|cellpadding=10 cellspacing=5 | {|cellpadding=10 cellspacing=5 | ||
|valign=top style="width:40%"| | |valign=top style="width:40%"| | ||
====Referencing Lexicons==== | |||
|- | |- | ||
Revision as of 09:37, 27 January 2023
A List Match is an extractor type that can be used when configuring several data extraction tools such as a Value Reader or Data Type. It is designed to return values matching one or more items in a defined list. By default, the List Match extractor does not use or require regular expressions (regex).
About
The List Match is one of the the simplest extractors used in Grooper. It is designed to return values matching one or more items in a defined list. This can be used to extract numbers, specific words, or full phrases contained within a document. A List Match extractor returns an exact match including any spaces, numbers, punctuation, or special characters.
To configure a List Match, you can input the desired extracted value as a Local Entry or reference a pre-configured Lexicon.
Unlike a Pattern Match, the List Match extractor does not use or require regular expressions by default, but regex can be enabled in the properties menu. Similar to a Pattern Match, Suffix and Prefix Patterns can be added to help anchor the expression and limit the amount of false positives extracted.
How To
A List Match is most commonly used when configuring objects such as Value Readers or Data Types. It is great for extracting text information such as:
- Specific company names
- Field labels
- Headers and Footers
- Full phrases
- Exact numbers
If the information you need to extract follows a specific pattern, such as a date or social security number, then it may be better to consider a different extractor like a Pattern Match.
Configuring by Object Type
Configuring on a Value Reader | |
|
Configuring on a Data Type | |
|
Local Entries vs Lexicons
A List Match can be configured using a Local Entry or a Lexicon. Local Entries are simple and easy to set up, especially if you only need to add a few entries. If you plan to extract a large number of items from a list or plan on building multiple extractors using the same list, it might be more efficient to set up a Lexicon to reference first.
Configuring Local Entries | |
|
|
|
Referencing Lexicons | |
|
|
|
|
|
|





