Topological sorting and the ETL process

In this paper, we point out that a typical ETL process to populate a database can be thought of as an acyclic directed graph and that existing graph algorithms can thus be used to resolve the order of processing the tables in the load process. While simple databases can easily be managed manually, such methods prove very useful with big and complex data warehouses. We extend an existing sorting algorithm to provide with information on which tables can be loaded in parallel.

Download the paper here.

Thanks for your registration!

Making NLP easySave the Date: common sense18/01Innovationsschub in Anwaltskanzleien durch Startup Herlock.aiBuilding a Data Vault using dbtvault with google BigQueryData Intelligence Days - introducing a Data PlatformX-Mas: A Closer Look at StoriesStart up Winji in der NZZD ONE is on MediumHWZ Yea(h)rbook 2017 FachbeitragMachine learning for productionStrata LondonReal-Time Tracking of Swiss Covid-19 CasesGT-Conference Talk: Zhamak DehghaniArtificial Intelligence in Claim ManagementTalk @ Google Data Cloud Live: AthensWe're at the Data+AI Summit in San Francisco!Moving from SAP BW to Databricks - Live from the DATA+AI Summit in San FranciscoDan Linstedt @ D ONEInsights of the Data Vault 2.0 Bootcamp with Dan LinstedtData Vault 2.0 Bootcamp and certification with Dan Linstedt by D ONE