2.90:Tab Marking (Property): Difference between revisions
Dgreenwood (talk | contribs) No edit summary |
Dgreenwood (talk | contribs) No edit summary |
||
| Line 12: | Line 12: | ||
Normally, a space is a space is a space. Whether a space between characters, a space between columns, or any other space between characters, those spaces are represented by a single space character in a document's text data. | Normally, a space is a space is a space. Whether a space between characters, a space between columns, or any other space between characters, those spaces are represented by a single space character in a document's text data. | ||
However, often, knowing there's a large amount of space one one or both sides of a label or value can be useful information for how to extract that data. The image | {|cellpadding=10 cellspacing=5 style="margin:20px" | ||
|style="width:50%" valign=top| | |||
However, often, knowing there's a large amount of space one one or both sides of a label or value can be useful information for how to extract that data. The image here has three columns each with pairs of numbers. | |||
| | |||
[[File:Tab Marking 01.png|center]] | [[File:Tab Marking 01.png|center]] | ||
|- | |||
|style="width:50%" valign=top| | |||
You can visually differentiate between the numbers in the second column from the others based on the spatial context around it. The numbers in this columns have a large amount of space on either side between them and the numbers in the other columns. | You can visually differentiate between the numbers in the second column from the others based on the spatial context around it. The numbers in this columns have a large amount of space on either side between them and the numbers in the other columns. | ||
| | |||
[[File:Tab Marking 02.png|center]] | |||
|- | |||
|style="width:50%" valign=top| | |||
However, with default extractor settings, there's no differentiation between the spaces between words and large spaces between the columns. We call words, phrases, numbers or other data separated by large amounts of space like this "segments". | |||
As is, it would be cumbersome to write a regex pattern to differentiate between the pairs of numbers (or other "segments" on the page). | |||
[[ | | | ||
[[file:tab Marking 03.png|center]] | |||
|- | |||
| | |||
With tab characters | |||
|} | |||
Revision as of 09:00, 13 August 2020

Tab Marking allows you to insert tab characters into a document's text data.
The Tab Marking property enables tab characters for regular expression pattern matching. These characters are inserted into a document's text data wherever there is a large gap of space between characters on a line.
About
Normally, a space is a space is a space. Whether a space between characters, a space between columns, or any other space between characters, those spaces are represented by a single space character in a document's text data.


