What is Content ETL?

ETL stands for: Extract, Transform, and Load, in short ETL This is a concept from computing and is traditionally applied to structured data. It refers to the processes used to unify data from different structured databases (such as ECM, WCM, DM systems) into another database, often a data warehouse.

The concept of Content ETL has been introduced recently and it is applied to unstructured data only, as stored in file systems and ECM, WCM, DM systems. Content ETL refers to the processes that are used to simply exchange data between multiple content repositories. The ETL (Extract, Transform and Load) process for unstructured data is much more difficult than for structured data. This has a number of causes. More about ETL.

What's the point?

Content ETL is used for various purposes, namely:

  • Compliance: Tracking of an audit trail and Access Control List (ACL), and the compliancy of unstructured content with laws and regulations.
  • For integration, migration and conversion processes: With extensive metadata and classification, it is possible to easily migrate or convert unstructured data from multiple source systems to a new target system.
  • Data fit: To provide for a better fit of data within the functionality of the new ECM or DM system.