2023.1:Pattern-Based Separation (Separation Provider): Difference between revisions
Dgreenwood (talk | contribs) No edit summary |
Dgreenwood (talk | contribs) No edit summary |
||
| Line 1: | Line 1: | ||
{{stubs}} | {{stubs}} | ||
<section begin="glossary" /> | |||
<blockquote> | |||
'''''Pattern-Based Separation''''' is a '''''[[Separation Provider]]''''' that creates a new document folder every time a value returned by a defined regex pattern is encountered on a page. | |||
</blockquote> | |||
<section end="glossary" /> | |||
A [[Data Extractor]] is used to find a value on a page. When the extractor returns a result on a page, the page is placed in a new folder, creating a new document. If the extractor does not return a result on the following page, that page is included behind the previous page in the newly created folder. Once the extractor does produce a result on a subsequent page (even if it is the same result as the previous page) it will be placed in a new folder, creating a new document. | |||
[[Category:Articles]] | [[Category:Articles]] | ||
Revision as of 16:28, 8 January 2024
|
STUB |
This article is a stub. It contains minimal information on the topic and should be expanded. |
Pattern-Based Separation is a Separation Provider that creates a new document folder every time a value returned by a defined regex pattern is encountered on a page.
A Data Extractor is used to find a value on a page. When the extractor returns a result on a page, the page is placed in a new folder, creating a new document. If the extractor does not return a result on the following page, that page is included behind the previous page in the newly created folder. Once the extractor does produce a result on a subsequent page (even if it is the same result as the previous page) it will be placed in a new folder, creating a new document.