Initialize Card (Activity): Difference between revisions

From Grooper Wiki
m Randallkinard moved page Initialize Card to Initialize Card (Activity): new naming convention
No edit summary
Line 1: Line 1:
{{Migrated}}
{{AutoVersion}}
{{2.80:{{PAGENAME}}}}
 
<blockquote>{{#lst:Glossary|Initialize Card}}</blockquote>
 
The '''Initialize Card''' activity is a specialized step in Grooper’s microfiche processing workflow. Its primary purpose is to organize and prepare the raw image tiles from a scanned microfiche card, setting the stage for accurate frame detection and extraction in subsequent activities.
 
== What is Initialize Card? ==
 
'''Initialize Card''' the first processing step applied to a newly imported microfiche card. Microfiche cards are typically scanned as a collection of small, overlapping image tiles, each representing a portion of the card. These tiles must be sorted, grouped, and assembled into a logical structure before further processing can occur.
 
== What is it for? ==
 
The main goals of the Initialize Card activity are:
 
* '''Sorting and organizing tiles''': Automatically arranges the raw image tiles into subfolders by strip and tile position, based on their filenames.
* '''Preview assembly''': Creates a low-resolution preview image of the entire fiche card surface, which is used for visual reference and quality assurance in later steps.
* '''Validation''': Ensures that the expected number of strips and tiles are present, flagging any discrepancies for review.
 
This organization is essential for accurate frame detection, extraction, and downstream data processing.
 
== How does it work? ==
 
The Initialize Card activity performs the following key operations:
 
# '''Tile sorting and grouping''': 
#* Uses a configurable regular expression (the "Ordering Pattern" property) to extract row and column information from each tile’s filename.
#*  Groups tiles into subfolders by strip (row) and tile (column), creating a logical folder structure within the [[Batch Folder]].
#* Validates that the number of detected strips and tiles matches the expected configuration, raising errors if not.
# '''Preview image assembly''': 
#* Combines the sorted tiles to generate a low-resolution preview image of the fiche card.
#* Handles vertical overscan (extra image area at the top or bottom of strips) using the "Overscan Size" property, cropping as needed for a clean preview.
#* Stitches the strip images together, aligning them to minimize visible seams.
# '''Cleanup and finalization''': 
#* Removes the original "previews" folder after the preview image is created.
#* Saves the preview image as an attachment to the [[Batch Folder]] for use in review and diagnostics.
 
== General configuration steps ==
 
The Initialize Card activity is typically the first step in a microfiche [[Batch Process]]. To use it:
 
# Add the Initialize Card (Sort Tiles) activity to your [[Batch Process]].
# Configure the "Ordering Pattern" property to match your tile filename format. For example:
#: <code>(?&lt;Row&gt;[A-Z])\d\d-(?&lt;Column&gt;\d+)\.jpg$</code>
#*<li class="attn-bullet"> This pattern must include named groups "Row" and "Column" to extract the strip and tile identifiers.
# Set the expected "Strip Count" and "Tile Count" to match your fiche card layout.
# Adjust the "Overscan Size" if your tiles include extra image area at the top or bottom of strips.
# Run the [[Batch Process]] on your imported microfiche cards.
 
== Example workflow ==
 
After Initialize Card completes, the card’s images are organized and a preview is available. The next steps typically include:
 
* '''Detect Frames''': Locates the boundaries of individual document frames within each strip.
* '''Clip Frames''': Crops out each detected frame as a separate page for further processing.
 
A typical microfiche processing workflow might look like this:
 
# Fiche scan files are imported into Grooper.
# '''Initialize Card''' – Organize tiles and create a preview image.
# '''Detect Frames''' – Detect frame locations on each strip.
# '''Clip Frames''' – Extract each frame as a separate page.
# Additional steps such as image cleanup, OCR, classification, and data extraction.
 
== Best practices ==
 
* Ensure your tile filenames are consistent and match the configured "Ordering Pattern".
* Verify that the expected number of strips and tiles are present for each card.
* Review the generated preview image to confirm correct assembly and alignment.
* Use the error messages and flags generated by the activity to catch missing or misnamed tiles early.
 
== Related concepts ==
 
* [[Microfiche Processing]]
* [[Detect Frames]]
* [[Clip Frames]]

Revision as of 09:54, 13 August 2025

This article is about the current version of Grooper.

Note that some content may still need to be updated.

2025 2.80

view_module Initialize Card is a specialized Activity for processing microfiche in Grooper. It prepares and configures microfiche card images for further processing.

The Initialize Card activity is a specialized step in Grooper’s microfiche processing workflow. Its primary purpose is to organize and prepare the raw image tiles from a scanned microfiche card, setting the stage for accurate frame detection and extraction in subsequent activities.

What is Initialize Card?

Initialize Card the first processing step applied to a newly imported microfiche card. Microfiche cards are typically scanned as a collection of small, overlapping image tiles, each representing a portion of the card. These tiles must be sorted, grouped, and assembled into a logical structure before further processing can occur.

What is it for?

The main goals of the Initialize Card activity are:

  • Sorting and organizing tiles: Automatically arranges the raw image tiles into subfolders by strip and tile position, based on their filenames.
  • Preview assembly: Creates a low-resolution preview image of the entire fiche card surface, which is used for visual reference and quality assurance in later steps.
  • Validation: Ensures that the expected number of strips and tiles are present, flagging any discrepancies for review.

This organization is essential for accurate frame detection, extraction, and downstream data processing.

How does it work?

The Initialize Card activity performs the following key operations:

  1. Tile sorting and grouping:
    • Uses a configurable regular expression (the "Ordering Pattern" property) to extract row and column information from each tile’s filename.
    • Groups tiles into subfolders by strip (row) and tile (column), creating a logical folder structure within the Batch Folder.
    • Validates that the number of detected strips and tiles matches the expected configuration, raising errors if not.
  2. Preview image assembly:
    • Combines the sorted tiles to generate a low-resolution preview image of the fiche card.
    • Handles vertical overscan (extra image area at the top or bottom of strips) using the "Overscan Size" property, cropping as needed for a clean preview.
    • Stitches the strip images together, aligning them to minimize visible seams.
  3. Cleanup and finalization:
    • Removes the original "previews" folder after the preview image is created.
    • Saves the preview image as an attachment to the Batch Folder for use in review and diagnostics.

General configuration steps

The Initialize Card activity is typically the first step in a microfiche Batch Process. To use it:

  1. Add the Initialize Card (Sort Tiles) activity to your Batch Process.
  2. Configure the "Ordering Pattern" property to match your tile filename format. For example:
    (?<Row>[A-Z])\d\d-(?<Column>\d+)\.jpg$
    • This pattern must include named groups "Row" and "Column" to extract the strip and tile identifiers.
  3. Set the expected "Strip Count" and "Tile Count" to match your fiche card layout.
  4. Adjust the "Overscan Size" if your tiles include extra image area at the top or bottom of strips.
  5. Run the Batch Process on your imported microfiche cards.

Example workflow

After Initialize Card completes, the card’s images are organized and a preview is available. The next steps typically include:

  • Detect Frames: Locates the boundaries of individual document frames within each strip.
  • Clip Frames: Crops out each detected frame as a separate page for further processing.

A typical microfiche processing workflow might look like this:

  1. Fiche scan files are imported into Grooper.
  2. Initialize Card – Organize tiles and create a preview image.
  3. Detect Frames – Detect frame locations on each strip.
  4. Clip Frames – Extract each frame as a separate page.
  5. Additional steps such as image cleanup, OCR, classification, and data extraction.

Best practices

  • Ensure your tile filenames are consistent and match the configured "Ordering Pattern".
  • Verify that the expected number of strips and tiles are present for each card.
  • Review the generated preview image to confirm correct assembly and alignment.
  • Use the error messages and flags generated by the activity to catch missing or misnamed tiles early.

Related concepts