Skip to main content

Query processing of schema design problems for data-driven renormalization

Resource type
Thesis type
(Thesis) M.Sc.
Date created
2017-10-16
Authors/Contributors
Author: Meng, Xiao
Abstract
In the past decades, more and more information has been stored or delivered in non-relational data models—either in NoSQL databases or via a Software as a Service (SaaS) application. Users often want to load these data sets into a BI application or a relational database for further analysis. The data-driven renormalization framework is often used to transform non-relational data into relational data. In this thesis, we explore how to help users to make design decisions in such a framework. We formally define two kinds of queries—the point query and the stable interval query—to help users making design decisions. We propose two index structures, which can represent a list of FDs concisely but also process the queries efficiently. We conduct experiments on two real datasets and show that our algorithms greatly outperform the baseline method when processing a large set of FDs.
Document
Identifier
etd10419
Copyright statement
Copyright is held by the author.
Permissions
This thesis may be printed or downloaded for non-commercial research and scholarly purposes.
Scholarly level
Supervisor or Senior Supervisor
Thesis advisor: Pei, Jian
Member of collection
Download file Size
etd10419_XMeng.pdf 1.58 MB

Views & downloads - as of June 2023

Views: 0
Downloads: 0