2.80:Shape Removal (IP Command)

Shape Removal detects and removes shapes from documents.

The Shape Removal command builds upon the existing Shape Detection command. Shape Detection finds shapes in a document (such as logos) from a set of sample images given by the user. Shape Removal goes one step further and removes the detected shape from the document. The removed pixels are either filled in with a solid color or inpainted to try and match it to pixels nearby.

Removing a shape from a document may be helpful if it is interfering with another Grooper activity, such as getting OCR text from the Recognize activity.

Version Differences

Shape Removal is a new feature in version 2.80. Prior to 2.80 shapes could be detected via the Shape Detection command. However, Shape Detection was generally used for Visual document classification. Shape Removal would not have been possible in previous versions.

Use Cases

The primary use for the Shape Removal command is to improve a document's readability. Often, images on a page can interfere with OCR results from the Recognize activity. If you can give Shape Removal sample images of what to look for, it can remove those images from a document set. Logos, for example, are often removed by Shape Removal. As well as removing logos to improve OCR results (which can be done without removing them from the final exported documents), Shape Removal could also be used to permanently de-brand the exported documents.

Original Image

Logo detected and removed

How To: Add Shape Removal to an IP Profile

Property Details

Detection Settings Details

(% class="box floatinginfobox" %) ((( )))

The properties located in "Detection Settings" are used to set sample images to detect on documents and configure how and where they are detected. Pressing the ellipsis button at the end of the property will bring up a new window with the properties listed below.

(% class="grooper-table-properties" %)

=Property|=Default Value|=Information

Template:Icon name="chevron-down"Template:/icon|(% style="width:18%" %)**General**|(% style="width:10%" %) |

Sample Images|0 sample images|Here, you will capture sample images of the shape you want to detect. Press the ellipsis button at the end of the property to bring up a new window to add samples. You will select documents from a test batch and lasso the image to be detected.

Shape Name| |Use this property to type a name used to identify the shape.

Proximity Measure|SAD|This property sets how similarity is determined between sample images and other images. There are three methods available: SAD (or sum of absolute differences), CrossCorr (or normalized cross-correlation), and SSD (or sum of squared distances). Each method uses different different equations to compare the pixels in the sample image to the pixels on the document. SAD is a very simple way to automate searching for sample images. It measures the absolute difference between each pixel in the sample image with the corresponding pixel in the block its being compared to. Potentially, SAD may be unreliable given changes in lighting, color, or image degredation, but is generally the go do method for shape detection.

Background Differencing|False|Setting this property to true can help when dealing with shapes with a lot of blank space in the sample image. Shapes containing mostly white space can be challenging. If 90% of the image's pixels are white, the Shape Detection operation will match other regions on a document that also contain 90% white pixels. This can produce a lot of false-positive matches with high confidence that erroneous regions match the sample. When background differencing is enabled, the blank areas of the sample confidence values are scaled according to the color balance of the sample image. If the sample contains 90% white pixels, matched regions on the document falling below 90% confidence are effectively removed as matches.

Minimum Confidence|80%|This is the minimum confidence for a successful match (from 0% to 100%).

Template:Icon name="chevron-down"Template:/icon|(% style="width:18%" %)**Orientation and Scale**|(% style="width:10%" %) |

Maximum Angle|0 degrees|This can account for instances when the image on a document is slightly rotated from the sample image's orientation (between 0 and 360 degrees). Altering this property will also allow you to adjust the "Angle Step" during detection. For example, if you set the Maximum Angle to 25 degrees and an Angle Step of 5 degrees, Shape Detection would look for a match that is rotated -25, -20, -15, -10, -5, 0, 5, 10, 15, 20, and 25 degrees from the original image instead of every single degree from -25 to 25. The Maximum Angle must be an even multiple of the Angle Step (as in 5 is an even multiple of 25).

Minimum Scale|100%|This can account for instances when the image on a document is scaled slightly smaller than the sample image (between 10% and 100%). Altering this property will also allow you to adjust the "Scale Step" during detection. For example, if you set the Minimum Scale to 50% and Scale Step to 10%, Shape Detection would look for a match that is 100%, 90%, 80%, 70%, 60%, and 50% the size of the sample image instead of 100%, 99%, 98%, and so on.

Maximum Scale|100%|This can account for instances when the image on a document is scaled slightly larger than the sample image (between 100% and 400%). Altering this property will also allow you to adjust the "Scale Step" during detection. For example, if you set the Maximum Scale to 150% and Scale Step to 10%, Shape Detection would look for a match that is 100%, 110%, 120%, 130%, 140%, and 150% the size of the sample image instead of 100%, 101%, 102%, and so on.

Template:Icon name="chevron-down"Template:/icon|(% style="width:18%" %)**Preprocessing**|(% style="width:10%" %) |

Processing Resolution|Dpi50|This sets the resolution at which the image is processed during Shape Detection. This does not change the output resolution of the document itself. It only effects the resolution when Grooper is looking for match to the sample image. A higher dpi will force a more specific 1:1 match to the sample image. A lower resolution will allow for a "looser" or "fuzzier" match, accounting for differences in the quality of the sample compared to the document set.

Binarization|Disabled|(((

Binarization converts color images to black and white by "thresholding" the image. Searching for a flat black and white shape on a (also binarized) black and white document may end up producing more accurate results. This does not binarize the document itself, it only does so temporarily for Shape Detection. After detection is performed, the image reverts to its original form.

Thresholding is the process of setting a threshold value on the pixel intensity of the original image. Pixel intensity is a pixel's "lightness" or "brightness". Essentially, once a midpoint between the most intense ("whitest") and least intense ("blackest") pixel on a page is established, lighter pixels are converted to white and darker are converted to black. Or put another way, pixels with an intensity value above the threshold are converted to white, and those below the threshold are converted to black. This midpoint (or "threshold") can be set manually or found automatically by a software application. The Thresholding Method can be set to one of four ways:

Simple - Thresholds an image to black and white using a fixed threshold value between 1 and 255.
Auto - Selects a threshold value automatically using Otsu's Method.
Adaptive - Thesholds pixels based on the intensity of pixels in the local neighborhood.
Dynamic - Performs adaptive thresholding, while preserving dark areas on the page.

Each method has its own set of configurable properties. For more information on binarization and these methods, visit the [[Binarize>>url:https://wiki.grooper.com/xwiki/bin/view/Grooper%20Space/Articles/Core/Batch%20Processing/Image%20Processing/Commands/Binarize/]] article. )))

Dilation Factor|0|"Dilation Factor" will bloat the edges of the sample image. This is another way of getting a "fuzzy" match from the sample. It will increase the range of pixels possible to produce a match along a shape's edge.

Region of Interest (inches)|(0,0) : (0,0)|If you know the physical location (give or take) the shape will be on a document, you can limit where Grooper looks for it using the "Region of Interest" property. Pressing the ellipsis button at the end of the property will bring up a new window that allows you to lasso the area you expect to find the shape with your mouse.

Binarization Details

(% class="box floatinginfobox" %) ((( )))

Binarization converts color images to black and white by "thresholding" the image. Once a sample shape is found on a document, the document is binarized in order to target the pixels to be removed.

Thresholding is the process of setting a threshold value on the pixel intensity of the original image. Pixel intensity is a pixel's "lightness" or "brightness". Essentially, once a midpoint between the most intense ("whitest") and least intense ("blackest") pixel on a page is established, lighter pixels are converted to white and darker are converted to black. Or put another way, pixels with an intensity value above the threshold are converted to white, and those below the threshold are converted to black. This midpoint (or "threshold") can be set manually or found automatically by a software application. The Thresholding Method can be set to one of four ways:

Simple - Thresholds an image to black and white using a fixed threshold value between 1 and 255.
Auto - Selects a threshold value automatically using Otsu's Method.
Adaptive - Thesholds pixels based on the intensity of pixels in the local neighborhood.
Dynamic - Performs adaptive thresholding, while preserving dark areas on the page.

Each method has its own set of configurable properties. For more information on binarization and these methods, visit the [[Binarize>>url:https://wiki.grooper.com/xwiki/bin/view/Grooper%20Space/Articles/Core/Batch%20Processing/Image%20Processing/Commands/Binarize/]] article.

Dilation Factor Details

(% class="box floatinginfobox" %) ((( )))

Dilation Factor here, in the main Shape Removal property panel, controls how dilated the Shape Mask is.  The Shape Mask is overlaid on a binarized document after one of the sample shapes was detected.  All pixels falling under the Shape Mask will be dropped out.  Dilating the mask widens the sample image, adding a pixel border around it, effectively expanding its edges.  Since all pixels underneath the Shape Mask will be removed, dilating it can account for small variations between the sample image and the image being removed.  The objective is to bloat the Shape Mask enough to intersect these small variations, but not too much to intersect other meaningful features on the page, such as text.  Only positive numbers are allowed here.  Meaning the Shape Mask can only be dilated, not eroded.

Dropout Method Details

(% class="box floatinginfobox" %) ((( )))

This property determines how pixels targeted for removal during the dropout operation will be "removed".  They are not removed in that they are deleted.  They are removed in that they are colored in to match the image's background.  This can be set to "Fill" or "Inpaint".

(% style="width:50%" %)

(((

"Inpaint" fills the dropout mask using color information from pixels around the removed pixels. This method is designed to match removed pixels to a colored or complex background. Student transcripts are a great example. They often are printed on paper with some kind of patterned background. There are two "Inpaint Method" options: Telea and NavierStokes.

"Telea" restores pixels by approximating the value of the removed pixels based on the average value of pixels around it. If 75% of the pixels around it are white and 25% of the pixels around it are black, the pixel would become white. The area of known pixels used to find this color average is called a "neighborhood".
"NavierStokes" uses equations from fluid dynamics to fill in pixels the same way a fluid would fill a void.

The "Inpaint Radius" property specifies how large the area around the dropped out pixels Grooper is "looking at" to get a picture of how to fill it in. It increases the size of the analyzed "neighborhood" of pixels. The "Mask Dilation Factor" will dilate the filled shape. Colored pixels will be added to the shape's borders, increasing the size of the removed area. )))|(% style="vertical-align:top; width:50%" %)

2.80:Shape Removal (IP Command)

Version Differences

Use Cases

How To: Add Shape Removal to an IP Profile

Before you begin

Add Shape Removal to your IP Profile

Open Shape Detection settings

Give sample images

Set how similarity is determined

Account for alternate image scale and angle

Preprocessing options

A brief aside about masks

Binarize the document

Remove the shape's pixels

Dropout Method: Fill

Dropout Method: Inpaint

Dilation Factor vs. Dilation Factor vs. Mask Dilation Factor

Property Details

Detection Settings Details

Binarization Details

Dilation Factor Details

Dropout Method Details