Training Batch (Concept): Difference between revisions
No edit summary |
No edit summary |
||
| Line 57: | Line 57: | ||
It is important to understand that the '''Training Set''' is not tied to the actual '''TF-IDF Weightings''' that is associated with the '''Content Type''' or '''Content Category'''. Purging the training from a '''Content Model''' does not delete any or all of the documents in the '''Training Set'''. Conversely, deleting a document from the '''Training Set''' does not remove or purge any'''TF-IDF Weightings'''from a '''Content Type''' or '''Content Category.''' | It is important to understand that the '''Training Set''' is not tied to the actual '''TF-IDF Weightings''' that is associated with the '''Content Type''' or '''Content Category'''. Purging the training from a '''Content Model''' does not delete any or all of the documents in the '''Training Set'''. Conversely, deleting a document from the '''Training Set''' does not remove or purge any'''TF-IDF Weightings'''from a '''Content Type''' or '''Content Category.''' | ||
<br/> | <br/> | ||
</ | <p/> | ||
==Version Differences== | ==Version Differences== | ||
Versions prior to '''Grooper 2.9''' do not automatically generate a '''Training Set''' batch in the local resources folder | Versions prior to '''Grooper 2.9''' do not automatically generate a '''Training Set''' batch in the local resources folder | ||
Revision as of 16:47, 16 April 2020
The Training Set batch is more convenient way to work with all of the samples a Content Model has been trained against
A Content Model and accompanying set of Batches can be found by following this link and downloading the provided file. It is not required to download to understand this article, but can be helpful because it can be used to follow along with the steps in this article. This file was exported from and meant for use in Grooper 2.9
About
During the development and training of Classification of a Grooper Content Model, it can be challenging to keep track of all of the samples you have trained TF-IDF against. In previous versions, each trained sample was stored under each content type in the Grooper Design Studio node tree. In 2.9, the trained samples are stored both under each content type and in the Training Set batch.
How To
|
Following is an example of how to perform TF-IDF classification that creates the Training Set batch. In the example content model, there are five different content types from three different batches. |
| ! | Some of the tabs in this tutorial are longer than the others. Please scroll to the bottom of each step's tab before going to the step. |
Prerequisites
Train Content Types
Review the Training Set batch
It is important to understand that the Training Set is not tied to the actual TF-IDF Weightings that is associated with the Content Type or Content Category. Purging the training from a Content Model does not delete any or all of the documents in the Training Set. Conversely, deleting a document from the Training Set does not remove or purge anyTF-IDF Weightingsfrom a Content Type or Content Category.
Version Differences
Versions prior to Grooper 2.9 do not automatically generate a Training Set batch in the local resources folder