Datasets

Datasets form the core component for managing and organizing data on the platform. The features include:

  • An intuitive Gallery-style view for visualizing DICOM Series thumbnails and metadata (configurable).

  • Multiselect function for performing operations on multiple series simultaneously such as adding/removing to/from a dataset, executing workflows, or creating new datasets.

  • A configurable side-panel metadata dashboard for exploring metadata distributions.

  • Shortcut-based tagging functionality for quick and effective data annotation and categorization.

  • Full-text search for filtering items based on metadata.

  • A side panel series viewer using an adjusted OHIF Viewer-v3 to display DICOM next to the series metadata.

  • Download functionality for selected series, allowing users to export data in DICOM format.

In the following sections, we delve into these functionalities.

Dataset Management and Workflow Execution

Interaction actions for the Gallery View are located above it. The first row is dedicated to selecting and managing datasets. Once a dataset is selected, the Gallery View will automatically update. A dataset management dialog, accessible from the same row, provides an overview of the platform’s datasets and enables deletion of unnecessary datasets.

Note

Deleting a dataset does not erase its contained data from the platform.

The second row is dedicated to filtering and searching. We offer a Lucene-based full-text search.

Note

Useful commands:

  • Use * for wildcarding, e.g., LUNG1-* shows all series with metadata starting with LUNG1-.

  • Use - for excluding, e.g., -CHEST excludes all series with metadata containing CHEST.

  • For more information, check the OpenSearch Documentation.

You can add additional filters for specific DICOM tags, with an autocomplete feature for convenience.

Note

Individual filters are combined with AND, while the different values within a filter are combined with OR.

Filtering

The following row handles tagging, a convenient way to structure data. Tags are free-text, but an autocomplete feature allows reusing existing tags. To tag a series, activate the tag(s) and then click on the series. The switch next to the tags enables tagging with multiple tags at once.

Note

  • Activate tags using shortcuts. Press 1 to toggle the first tag, 2 for the second, and so on.

  • If a series already has the currently active tag, clicking the series again will remove it. This also applies in multiple tags mode.

  • Remove tags by clicking the X next to the tag. (Note: Removing a tag this way will not update the Metadata Dashboard dashboard if it’s visualized there)

Tagging items in the gallery view

Metadata Dashboard

Next to the Gallery View is the Metadata Dashboard (configurable in Settings). This dashboard displays the metadata of the currently selected items in the Gallery View.

Note

Clicking on a bar in a bar chart will set the selected value as a filter. Click ‘search’ to execute the query.

Interacting with the Metadata Dashboard

Validation Results

Series cards include small icons in the form of warnings or errors in the lower right corner of the thumbnail that indicate the validation results for the series. Validation results are generated in the ingestion process using one of two supported tools: dciodvfy and pydicom-validator . They check DICOM files for missing or unexpected attributes.

Validation results in the gallery view

Detail View

For a more detailed look at a series, double-click a series card or click the eye icon at the top-right of the thumbnail to open the detail view in the side panel. This view comprises an OHIF-v3 viewer and a searchable metadata table for the selected series.

Detail view with OHIF viewer and metadata table

Settings

Settings can be found by clicking on the gear icon in the header of the navigation bar and then selecting DATASET CONFIGURATION. A dialog will open.

The Dataset view is highly configurable, allowing you to tailor the display to your needs. You can choose between the Gallery View and Structured Gallery View, set the number of items displayed per row, and decide whether to show only thumbnails or include series metadata as well.

Additionally, you can adjust the number of items displayed per page and specify the sorting value and direction. For large datasets, sorting can become slow. In such cases, it is recommended to use Slicing Search: the dataset is divided into slices, and only these slices are sorted, improving performance.

For each field in the metadata, the following options are available:

  • Dashboard: Display aggregated metadata in the Metadata Dashboard

  • Patient View: Display values in the patient card (if the Structured Gallery View is enabled)

  • Study View: Display values in the series card (if the Structured Gallery View is enabled)

  • Series Card: Display values in the Series Card

  • Truncate: Limit values in the Series Card to a single line for visual alignment across series

Saving the settings will update the configuration and reload the page.

Opening the settings window and adjusting the configuration.

Note

For now, the configuration of Settings is only stored in the browser’s local storage. Implications:

  • Clearing the browser cache will restore the default settings

  • Different users logging in from the same computer will access the same settings

  • Logging in with the same user on different computers will load the default settings