Attachment Type: Difference between revisions

From Grooper Wiki
Tag: Redirect target changed
Tag: Removed redirect
Line 1: Line 1:
#REDIRECT [[ESP_Auto_Separation_(Separation Provider)#Attachment_Rules]]
{{AutoVersion}}
 
<blockquote>Files attached to a {{BatchFolderIcon}} [[Batch Folder]] are defined by their "[[Attachment Type]]". Their Attachment Type (determined by the files MIME type) controls what properties and commands are applicable in Grooper.</blockquote>
 
When a file is imported into Grooper, two things happen:
# A {{BatchFolderIcon}} [[Batch Folder]] is created.
# The file is attached to that Batch Folder.
 
Depending on the file's file type (PDF, ZIP, TXT, etc), Grooper will have different sets of commands it uses to process the file. For example, Grooper can execute the "Unzip" command on ZIP files. Grooper assigns attachment files an "Attachment Type" based on their MIME type. It is the "Attachment Type" that determines which commands are accessible and executable.
:*<li class="fyi-bullet"> Importing files is the most common way a file is attached to a Batch Folder. However, the {{MergeIcon}} [[Merge]] activity can also generate an attachment file.
 
== Attachment Types and their commands ==
 
Please note, these Attachment Types are listed in order of how common they are across a variety of Grooper users, not by alphabetical order.
 
=== PDF Document ===
 
Represents a PDF document. Provides properties and commands which apply to PDF documents. Handles MIME type "application/pdf". If the file has no MIME type value, "application/pdf" will be inferred from file extension ".pdf".
 
<big>PDF Commands</big>
* '''Burst''' - Splits a PDF document into smaller documents.  Generates child Batch Folders with PDF attachments containing pages from the original document according to a "PDF Expand Method" (Fixed Page Count, Tag Based, Bookmarks, or Page Piece).
* '''Compact''' - Reduces the size of a PDF file by removing duplicate fonts, images, and other artifacts. Performs a hash-based deduplication of the objects inside a PDF file.
* '''Repair''' - Repairs PDF files which contain minor errors.
 
=== TIFF Document ===
Represents a TIFF document.
 
<big>TIFF Commands</big>
* There are no Grooper commands for TIFF documents (but TIFF files can be processed by several Grooper Activities).
 
=== Text Document ===
 
Represents a plain text file (TXT).
 
<big>Text Document Commands</big>
* '''Insert Page Breaks''' - Inserts page breaks into a text document.
* '''Normalize''' - Normalizes the encoding and control characters in a text document.
* '''Split''' - Splits a text document into smaller documents, using an extractor to identify split positions within the text content.
 
=== Microsoft Office Documents ===
 
==== Excel Document ====
 
Represents an Excel document.
 
<big>Excel Commands</big>
* '''[[Convert to CSV]]''' - Converts an Excel spreadsheet to a Comma-Separated Values (CSV) file.
 
==== Word Document ====
 
Represents a Word Document.
 
<big>Word Commands</big>
* '''[[Word to PDF]]''' - Generates a PDF from a Word document.
 
==== PowerPoint Document ====
 
Represents a PowerPoint Document.
 
<big>PowerPoint Commands</big>
* There are currently no Grooper commands for PowerPoint documents.
 
=== HTML Document ===
 
Represents an HTML document (such as a webpage).
 
 
<big>HTML Commands</big>
* [[Condition HTML]]
 
=== EDI File ===
 
Represents an EDI X12 document.
 
<big>EDI Commands</big>
* '''Bundle''' - Replaces the selected EDI files with a new set of files containing N transactions each.
* '''Load Data''' - Loads data from an EDI document into a Data Model.
* '''Split Envelops''' - Splits an EDI 837 file, creating a child document for each interchange control envelope (i.e. each ISA envelope.).
 
=== PST File ===
 
Represents a Microsoft Outlook PST (Personal Storage Table) file.
 
<big>PST Commands</big>
* '''Burst''' - Creates a child document for each message in the PST file. Extracts mail messages only. Appointments, Contacts, Tasks, and other non-mail items present in the PST file will be ignored.
 
=== vCard ===
 
Represents an RFC6350 vCard (VCF) file. Handles MIME type "text/vcard". If the file has no MIME type value, "text/vcard" will be inferred from file extensions ".vcf".
 
<big>vCard Commands</big>
* '''Expand Photo''' - Expands included photo (if available) as a child of the folder object.

Revision as of 11:30, 25 June 2025

This article is about the current version of Grooper.

Note that some content may still need to be updated.

2025

Files attached to a folder Batch Folder are defined by their "Attachment Type". Their Attachment Type (determined by the files MIME type) controls what properties and commands are applicable in Grooper.

When a file is imported into Grooper, two things happen:

  1. A folder Batch Folder is created.
  2. The file is attached to that Batch Folder.

Depending on the file's file type (PDF, ZIP, TXT, etc), Grooper will have different sets of commands it uses to process the file. For example, Grooper can execute the "Unzip" command on ZIP files. Grooper assigns attachment files an "Attachment Type" based on their MIME type. It is the "Attachment Type" that determines which commands are accessible and executable.

  • Importing files is the most common way a file is attached to a Batch Folder. However, the file_save Merge activity can also generate an attachment file.

Attachment Types and their commands

Please note, these Attachment Types are listed in order of how common they are across a variety of Grooper users, not by alphabetical order.

PDF Document

Represents a PDF document. Provides properties and commands which apply to PDF documents. Handles MIME type "application/pdf". If the file has no MIME type value, "application/pdf" will be inferred from file extension ".pdf".

PDF Commands

  • Burst - Splits a PDF document into smaller documents. Generates child Batch Folders with PDF attachments containing pages from the original document according to a "PDF Expand Method" (Fixed Page Count, Tag Based, Bookmarks, or Page Piece).
  • Compact - Reduces the size of a PDF file by removing duplicate fonts, images, and other artifacts. Performs a hash-based deduplication of the objects inside a PDF file.
  • Repair - Repairs PDF files which contain minor errors.

TIFF Document

Represents a TIFF document.

TIFF Commands

  • There are no Grooper commands for TIFF documents (but TIFF files can be processed by several Grooper Activities).

Text Document

Represents a plain text file (TXT).

Text Document Commands

  • Insert Page Breaks - Inserts page breaks into a text document.
  • Normalize - Normalizes the encoding and control characters in a text document.
  • Split - Splits a text document into smaller documents, using an extractor to identify split positions within the text content.

Microsoft Office Documents

Excel Document

Represents an Excel document.

Excel Commands

  • Convert to CSV - Converts an Excel spreadsheet to a Comma-Separated Values (CSV) file.

Word Document

Represents a Word Document.

Word Commands

PowerPoint Document

Represents a PowerPoint Document.

PowerPoint Commands

  • There are currently no Grooper commands for PowerPoint documents.

HTML Document

Represents an HTML document (such as a webpage).


HTML Commands

EDI File

Represents an EDI X12 document.

EDI Commands

  • Bundle - Replaces the selected EDI files with a new set of files containing N transactions each.
  • Load Data - Loads data from an EDI document into a Data Model.
  • Split Envelops - Splits an EDI 837 file, creating a child document for each interchange control envelope (i.e. each ISA envelope.).

PST File

Represents a Microsoft Outlook PST (Personal Storage Table) file.

PST Commands

  • Burst - Creates a child document for each message in the PST file. Extracts mail messages only. Appointments, Contacts, Tasks, and other non-mail items present in the PST file will be ignored.

vCard

Represents an RFC6350 vCard (VCF) file. Handles MIME type "text/vcard". If the file has no MIME type value, "text/vcard" will be inferred from file extensions ".vcf".

vCard Commands

  • Expand Photo - Expands included photo (if available) as a child of the folder object.