Rules-Based Approach: Difference between revisions

From Grooper Wiki
No edit summary
Tag: Redirect target changed
 
(8 intermediate revisions by the same user not shown)
Line 1: Line 1:
This approach uses [[Data Extractor]]s to find key words, phrases, or other text-based information in order to identify and classify a document (assigning a '''[[Document Type]]''' to a document).  For example, a document with a centered header of "Purchase Report" might be classified as a "Purchase Report" '''Document Type''' with this approach.  One could build a [[Data Type]] extractor using regular expression to match the phrase "Purchase Report" centered at the top of a document to identify it.  Once set on the '''Document Type''', if the extractor returned a result on a document, it would be classified as a "Purchase Report" '''Document Type'''.
#REDIRECT [[Rules-Based (Classify Method)]]
 
These "rules" are set using the '''''Positive Extractor''''' and '''''Negative Extractor''''' properties of a '''Document Type''' object in a '''[[Content Model]]'''

Latest revision as of 10:39, 25 June 2025