Analysis & Strategy
We collaborated with the UX and development teams to iteratively refine an overall taxonomy strategy and document the metadata needed for each content type. Working with stakeholders, we identified opportunities to limit the amount of effort by content creators when entering metadata. We included only essential properties specified by the site requirements in the final framework.
Once we had sign-off on the taxonomy and metadata strategy, we audited and compared existing vocabularies and identified gaps and conflicts. Through research into specific subject areas, we carefully crafted labels and levels of granularity, especially around sensitive human rights issues.
Since the taxonomy development coincided with a broader content audit project, we could leverage the audit to “test drive” our proposed content types to ensure they fit the content.
After finalizing the taxonomy and validating it with stakeholders, we cross-mapped the taxonomy with the existing vocabularies. These mappings supported both programmatic retagging of content to be migrated and the creation of links to other legacy systems. Using these mappings, web developers easily incorporated the retagging into the content migration process using scripts and APIs.
Migration and Retagging
Not all legacy content had been tagged consistently, so we needed to perform additional retagging and migration after completing our cross mapping. Using a pattern matching script that looked for clues in URLs, titles, and other metadata, we determined which tags to apply.
After developers migrated content into the Drupal CMS, the Dovecot tagging team reviewed and adjusted the automated tags as needed. They simultaneously conducted QA testing and cleaned up migrated content (e.g., added thumbnail images, or other missing metadata properties).