Create a Distilled Collection

IN THIS ARTICLE:

A distilled collection uses the metadata from e-mail threading, near duplicates, and text duplicates to remove data from a collection.

To create a distilled collection:

  1. Browse to the collection from which you want to create a distilled collection in the Explore panel.
  2. Right-click the collection name and choose Create > Distilled Collection.


    A distilled collection configuration window opens for the collection you choose.



  3. Choose an EMAIL THREADING SET.
  4. (Optional) Choose an NDD SET.
  5. Choose a TD SET.
  6. Click RUN.


    A progress bar shows the progress of the distilled collection run.


View Distilled Collection Results

Results are displayed once a distilled collection run completes.

The following document count information is available in the results:

Document Count Result Description
Unique documents before TD/NDD Number of documents found after e-mail threading.
After applying TD Number of documents found after removing Text Duplicates.
After apply NDD Number of documents found after removing Near Duplicates.

Clicking a document count link opens a new document search window with just the documents from that specific document count result.

Right-clicking a document count link in the results performs one of the following actions:

  • Creates a new collection
  • Copies document IDs to the clipboard
  • Exports document IDs to a text or Excel file (CSV file)

Click Logs to view details on how the distilled collection results were processed.