2023.1:EPI Separation (Separation Provider): Difference between revisions
Dgreenwood (talk | contribs) No edit summary |
Dgreenwood (talk | contribs) No edit summary |
||
Line 1: | Line 1: | ||
{{stubs}} | {{stubs}} | ||
<section begin=" | <section begin="glossary" /> | ||
<blockquote> | <blockquote> | ||
The '''''EPI Separation''''' provider uses embedded page information ("EPI") to separate loose pages into document folders. | The '''''EPI Separation''''' provider uses embedded page information ("EPI") to separate loose pages into document folders. | ||
</blockquote> | </blockquote> | ||
<section end=" | <section end="glossary" /> | ||
For this '''''Separation Proivder''''', a [[Data Extractor]] is used to find page numbers from the text on a page (i.e. "Page 1 of 10"). The extractor must also define two groups "PageNo" and "PageCount" in its regular expression pattern. The pattern "Page (?<PageNo>\d+) of (?<PageCount>\d+)" would group the "1" and "10" of our earlier example properly). If the value of PageNo is 1, a new folder is created. As long as each subsequent page's PageNo value follows in sequence, they are included in the folder. If the page is out of sequence (or the extractor fails to produce a result), it is left as a loose page. | For this '''''Separation Proivder''''', a [[Data Extractor]] is used to find page numbers from the text on a page (i.e. "Page 1 of 10"). The extractor must also define two groups "PageNo" and "PageCount" in its regular expression pattern. The pattern "Page (?<PageNo>\d+) of (?<PageCount>\d+)" would group the "1" and "10" of our earlier example properly). If the value of PageNo is 1, a new folder is created. As long as each subsequent page's PageNo value follows in sequence, they are included in the folder. If the page is out of sequence (or the extractor fails to produce a result), it is left as a loose page. | ||
[[Category:Articles]] | [[Category:Articles]] |
Revision as of 13:22, 9 January 2024
STUB |
This article is a stub. It contains minimal information on the topic and should be expanded. |
The EPI Separation provider uses embedded page information ("EPI") to separate loose pages into document folders.
For this Separation Proivder, a Data Extractor is used to find page numbers from the text on a page (i.e. "Page 1 of 10"). The extractor must also define two groups "PageNo" and "PageCount" in its regular expression pattern. The pattern "Page (?<PageNo>\d+) of (?<PageCount>\d+)" would group the "1" and "10" of our earlier example properly). If the value of PageNo is 1, a new folder is created. As long as each subsequent page's PageNo value follows in sequence, they are included in the folder. If the page is out of sequence (or the extractor fails to produce a result), it is left as a loose page.