A Formal Category Theoretical Framework for Multi-model Data Transformations

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Abstract

Data integration and migration processes in polystores and multi-model database management systems highly benefit from data and schema transformations. Rigorous modeling of transformations is a complex problem. The data and schema transformation field is scattered with multiple different transformation frameworks, tools, and mappings. These are usually domain-specific and lack solid theoretical foundations. Our first goal is to define category theoretical foundations for relational, graph, and hierarchical data models and instances. Each data instance is represented as a category theoretical mapping called a functor. We formalize data and schema transformations as Kan lifts utilizing the functorial representation for the instances. A Kan lift is a category theoretical construction consisting of two mappings satisfying the certain universal property. In this work, the two mappings correspond to schema transformation and data transformation.
Original languageEnglish
Title of host publicationHeterogeneous Data Management, Polystores, and Analytics for Healthcare : VLDB Workshops, Poly 2021 and DMAH 2021
EditorsEl Kindi Rezig, Vijay Gadepally, Timothy Mattson, Michael Stonebraker, Tim Kraska, Fusheng Wang, Gang Luo, Jun Kong, Alevtina Dubovitskaya
PublisherSpringer, Cham
Publication date20 Aug 2021
Pages14-28
ISBN (Print)978-3-030-93662-4
ISBN (Electronic)978-3-030-93663-1
DOIs
Publication statusPublished - 20 Aug 2021
MoE publication typeA4 Article in conference proceedings
EventVLDB Workshop on Polystore Systems for Heterogeneous Data in Multiple Databases with Privacy and Security Assurances -
Duration: 20 Aug 202120 Aug 2021

Fields of Science

  • 113 Computer and information sciences
  • multi-model databases
  • data transformations
  • category theory

Cite this