To automate a data export/extraction of a website based on MediaWiki, there are several possibilities, including the standard export opportunities MediaWiki offers, combined with our Xillio Content ETL platform.
To export the content a manual export is required from the source system; via the standard export options an XML file with all content is generated. Performing these exports is fairly simple. The images/files are taken directly from the front end using its own extraction robot.
Then the relevant content from the XML file is stored in the Unified Data Model (based on MongoDB). With the Content ETL platform and the MediaWiki scripts developed by Xillio, it is possible to analyze the data and eventually transform it so that the content complies with the data model of the target system.
Important features of the connector
Fill out the form and we will contact you as soon as possible.