Importing Documents in Grooper: Difference between revisions
Dgreenwood (talk | contribs) |
Dgreenwood (talk | contribs) No edit summary |
||
| Line 1: | Line 1: | ||
There are two ways to ingest documents into Grooper: | There are two ways to ingest documents into Grooper: | ||
* You can scan physical documents. | * You can scan physical documents. | ||
*:*<li class="fyi-bullet"> Information about scanning can be found in the [[Scan Viewer]] article. | |||
* You can '''import''' digital documents. | * You can '''import''' digital documents. | ||
Revision as of 08:20, 12 May 2025
There are two ways to ingest documents into Grooper:
- You can scan physical documents.
- Information about scanning can be found in the Scan Viewer article.
- You can import digital documents.
About
If you've got files, you're going to get them into Grooper by importing them. These could be PDFs in a Windows file system. They could be emails in an email inbox. They could be TIFF documents in an FTP share. They could be Word documents in a content management system.
If they're files in a digital storage platform, they're coming in by an import.
Import Jobs
To import documents, you must submit an "Import Job" to Grooper. Import Jobs are created in one of two ways:
- By a user - User-directed imports are "one off" imports submitted by a user from the Imports Page in the Grooper UI. This kind of import is good for when you have a large number of files in some file system
- By an Import Watcher service - Import Watcher services allow import operations to be automated. Using an Import Watcher, you can schedule imports at regular intervals. This could be on a continuous polling cycle (ever x seconds, minutes, hours) or at a set schedule (every day at 6:00, every Monday at noon, etc).
Import Providers
To configure an import, you need to define at least where the files are coming from and what Batch Process to use. Both are done by configuring an Import Provider. Import Providers define how an import operation occurs in Grooper.
The "standard" Import Providers in Grooper are the two "CMIS Import" providers: Import Descendants and Import Query Results
CMIS Import is standard because it uses a CMIS Connection to import from a CMIS Repository. These are the nodes in Grooper that represent connections to external storage platforms and locations in that storage platform.
- This allows Grooper to standardize connection and import operations.
- With CMIS Connections and CMIS Repositories, we can to connect to several different kinds of systems and use a single set of Import Providers (Import Descendants and Import Query Results) for all of them.