2023:List Match (Value Extractor): Difference between revisions

From Grooper Wiki
No edit summary
No edit summary
Line 25: Line 25:
==How To==
==How To==


<tabs>


<tab Name="Configuring With Local Entries">
# In your '''Node Tree''', right click and create the desired item <!--What is this called?--> such as a '''Data Type''' or '''Value Reader'''.
# Select the item created <!--Need to find out what these are called--> to bring up its configuration properties.
# In the ''Value Reader'' tab, click the drop down list next to ''Extractor'' and select '''''List Match'''''.
# Save and select the ''Tester'' tab and make sure the ''Expressions'' sub-tab is selected.
# Under '''''LOCAL ENTRIES''''', type the desired text to be extracted.
#* If needed, add a '''''Prefix''''' and '''''Suffix Pattern''''' to anchor your extraction.
# Save and test your extraction. 
</tab>


<tab Name="Referencing Lexicons">
# In your '''Node Tree''', right click and create the desired item <!--What is this called?--> such as a '''Data Type''' or '''Value Reader'''.
# Select the item created <!--Need to find out what these are called--> to bring up its configuration properties.
# In the ''Value Reader'' tab, click the drop down list next to ''Extractor'' and select '''''List Match'''''.
# Save and select the ''Tester'' tab and make sure the ''Properties'' sub-tab is selected.
# Click the arrow next to ''Vocabulary'' to access additional properties.
# Click the ellipsis button next to the ''Included Lexicons'' property. This should open a new window where you can add pre-configured Lexicons.
# In the new window, click through the '''''Projects''''' and '''''Folders''''' until you find the desired Lexicon. Click the check boxes next to the desired Lexicons.
# Click ''OK'' to apply the Lexicon.
# Save and test your extraction.
</tab>
<tab name="Properties of a List Match">
</tab>
</tabs>


==See Also==
==See Also==


* [[Value Reader]]
* [[Value Reader]]
<!--(Just putting this here because I wrote this for About and then felt like it maybe didn't belong in about, but I want to keep it anyway):
A '''List Match''' can reference '''Lexicons''' to aid in extraction. This can save the user time and energy when extracting the same information across multiple data extraction tools. Instead of typing out the full desired list for each value reader or data type, one '''Lexicon''' can be configured and referenced multiple times across multiple extractors.

Revision as of 11:21, 17 January 2023

WIP

This article is a work-in-progress or created as a placeholder for testing purposes. This article is subject to change and/or expansion. It may be incomplete, inaccurate, or stop abruptly.

This tag will be removed upon draft completion.

A List Match is an extractor type that can be used when configuring several data extraction tools such as a Value Reader or Data Type. It is designed to return values matching one or more items in a defined list. The List Match extractor does not use or require regular expressions (regex) by default.

About

The List Match is one of the the simplest extractors used in Grooper. It is designed to return values matching one or more items in a defined list. This can be used to extract numbers, specific words, or full phrases contained within a document. A List Match extractor returns an exact match including any spaces, numbers, punctuation, or special characters.

To configure a List Match, you can input the desired extracted value as a Local Entry or reference a pre-configured Lexicon.

Unlike a Pattern Match, the List Match extractor does not use or require regular expressions by default, but regex can be enabled in the properties menu. Similar to a Pattern Match, Suffix and Prefix Patterns can be added to help anchor the expression and limit the amount of false positives extracted.

How To

  1. In your Node Tree, right click and create the desired item such as a Data Type or Value Reader.
  2. Select the item created to bring up its configuration properties.
  3. In the Value Reader tab, click the drop down list next to Extractor and select List Match.
  4. Save and select the Tester tab and make sure the Expressions sub-tab is selected.
  5. Under LOCAL ENTRIES, type the desired text to be extracted.
    • If needed, add a Prefix and Suffix Pattern to anchor your extraction.
  6. Save and test your extraction.
  1. In your Node Tree, right click and create the desired item such as a Data Type or Value Reader.
  2. Select the item created to bring up its configuration properties.
  3. In the Value Reader tab, click the drop down list next to Extractor and select List Match.
  4. Save and select the Tester tab and make sure the Properties sub-tab is selected.
  5. Click the arrow next to Vocabulary to access additional properties.
  6. Click the ellipsis button next to the Included Lexicons property. This should open a new window where you can add pre-configured Lexicons.
  7. In the new window, click through the Projects and Folders until you find the desired Lexicon. Click the check boxes next to the desired Lexicons.
  8. Click OK to apply the Lexicon.
  9. Save and test your extraction.

See Also