Text Preprocessor

From Grooper Wiki
Revision as of 15:58, 26 August 2025 by Dgreenwood (talk | contribs) (Created page with "Grooper's "Text Preprocessor" adjusts how raw text is formatted before extraction. It manipulates control characters (such as CR/LF pairs) to allow regular expression patterns to match (or ignore) structural elements, such as line breaks, paragraph boundaries and tab markers. The Text Preprocessor executes the following: * Paragraph Marking * Tab Marking * Vertical Tab Marking * Ignore Control Characters")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Grooper's "Text Preprocessor" adjusts how raw text is formatted before extraction. It manipulates control characters (such as CR/LF pairs) to allow regular expression patterns to match (or ignore) structural elements, such as line breaks, paragraph boundaries and tab markers. The Text Preprocessor executes the following: