Create a Distilled Collection
IN THIS ARTICLE:
A distilled collection uses the metadata from e-mail threading, near duplicates, and text duplicates to remove data from a collection.
To create a distilled collection:
- Browse to the collection from which you want to create a distilled collection in the Explore panel.
Right-click the collection name and choose Create > Distilled Collection.
A distilled collection configuration window opens for the collection you choose.
- Choose an EMAIL THREADING SET.
- (Optional) Choose an NDD SET.
- Choose a TD SET.
Click RUN.
A progress bar shows the progress of the distilled collection run.
View Distilled Collection Results
Results are displayed once a distilled collection run completes.
The following document count information is available in the results:
Document Count Result | Description |
---|---|
Unique documents before TD/NDD | Number of documents found after e-mail threading. |
After applying TD | Number of documents found after removing Text Duplicates. |
After apply NDD | Number of documents found after removing Near Duplicates. |
Clicking a document count link opens a new document search window with just the documents from that specific document count result.
Right-clicking a document count link in the results performs one of the following actions:
- Creates a new collection
- Copies document IDs to the clipboard
- Exports document IDs to a text or Excel file (CSV file)
Click Logs to view details on how the distilled collection results were processed.