Checking for duplicates in the registry

This task is useful during the audit and harmonisation phase as it helps to identify duplicate entries within the Aristotle registry. By removing duplicates, the registry becomes more accurate and reliable, which is important for ensuring compliance with regulations and maintaining data integrity. The two-step process outlined in this task can help streamline the audit and harmonisation process, ultimately saving time and resources.

  1. Extracting the metadata using the report builder

  2. Using Excel to check for duplicates.

Metadata Report Builder

The metadata report builder is a feature that is integrated into Aristotle's registry platform, designed specifically to provide administrators with the ability to generate detailed reports on the metadata stored within the registry. This feature is intended to provide administrators with a comprehensive overview of the metadata, including its quality and relationships with other metadata elements stored in the registry.

  1. You can access the report builder in the Toolbox in the header dropdown.

  2. Scroll down to Metadata Reports section and click on 'Go to metadata reports'.

  3. Select 'New Report' to generate a new report.

  4. You will need to select a Stewardship Organisation. You can select the type of metadata for which duplicates need to be checked in the registry.

  5. Select the model you are trying to find duplicates for. For example, if you are trying to find duplicates of Object Classes select 'Object Class' from the dropdown.

  6. In the 'Add columns' section, you will need to drag and drop the id and name of the metadata type from the Item fields in the left hand column.

  7. You can also add filters, these can be dragged from the left hand 'Item fields' column.

  8. Once done, select 'generate report' to view the report. You can download all the items related to that metadata in Excel or a CSV format.

Checking for duplicates in Excel

  1. You can select the column which contains the 'name' of the metadata.

  2. Under the Home tab in the Excel, navigate to the 'conditional formatting' feature. From the list, select 'Highlight cells rules' and click on 'Duplicate values'.

  3. Click 'OK' to reflect the changes to the selected column.

  4. The next step is to filter the rows that has highlighted the duplicated values. To do so, navigate to the 'sort & filter' feature under the Home tab. Click on 'Filter' from the drop down list.

  5. You will notice that the filter is now applied to the column.

  6. You need to filter on 'Sort by Cell Color' to get the list of highlighted duplicate values.

  7. You will now have the list of duplicate values.

You can merge this metadata using the 'Metadata merger' tool. Click here to learn how to use the Metadata Merger Tool.

Last updated