IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Towards a Normal Form and a Query Language for Extended Relations Defined by Regular Expressions

Towards a Normal Form and a Query Language for Extended Relations Defined by Regular Expressions
View Sample PDF
Author(s): András Benczúr (Eötvös Loránd University, Faculty of Informatics, Budapest, Hungary)and Gyula I. Szabó (Eötvös Loránd University, Faculty of Informatics, Budapest, Hungary)
Copyright: 2016
Volume: 27
Issue: 2
Pages: 22
Source title: Journal of Database Management (JDM)
Editor(s)-in-Chief: Keng Siau (City University of Hong Kong, Hong Kong SAR)
DOI: 10.4018/JDM.2016040102

Purchase

View Towards a Normal Form and a Query Language for Extended Relations Defined by Regular Expressions on the publisher's website for pricing and purchasing information.

Abstract

This paper introduces a generalized data base concept that unites relational and semi structured data models. As an important theoretical result we could find a quadratic decision algorithm for the implication problem of functional and join dependencies defined on the united data model. As practical contribution we presented a normal form for the new data model as a tool for data base design. With our novel representations of regular expressions, a more effective searching method could be developed. XML elements are described by XML schema languages such as a DTD or an XML Schema definition. The instances of these elements are semi-structured tuples. A semi-structured tuple is an ordered list of (attribute: value) pairs. We may think of a semi-structured tuple as a sentence of a formal language, where the values are the terminal symbols and the attribute names are the non-terminal symbols. In the authors' former work (Szabó and Benczúr, 2015) they introduced the notion of the extended tuple as a sentence from a regular language generated by a grammar where the non-terminal symbols of the grammar are the attribute names of the tuple. Sets of extended tuples are the extended relations. The authors then introduced the dual language, which generates the tuple types allowed to occur in extended relations. They defined functional dependencies (regular FD - RFD) over extended relations. In this paper they rephrase the RFD concept by directly using regular expressions over attribute names to define extended tuples. By the help of a special vertex labeled graph associated to regular expressions the specification of substring selection for the projection operation can be defined. The normalization for regular schemas is more complex than it is in the relational model, because the schema of an extended relation can contain an infinite number of tuple types. However, the authors can define selection, projection and join operations on extended relations too, so a lossless-join decomposition can be performed. They extended their previous model to deal with XML schema indicators too, e.g., with numerical constraints. They added line and set constructors too, in order to extend their model with more general projection and selection operators. This model establishes a query language with table join functionality for collected XML element data.

Related Content

Pasi Raatikainen, Samuli Pekkola, Maria Mäkelä. © 2024. 30 pages.
Zhongliang Li, Yaofeng Tu, Zongmin Ma. © 2024. 25 pages.
Zongmin Ma, Daiyi Li, Jiawen Lu, Ruizhe Ma, Li Yan. © 2024. 32 pages.
Lavlin Agrawal, Pavankumar Mulgund, Raj Sharman. © 2024. 37 pages.
Jizi Li, Xiaodie Wang, Justin Z. Zhang, Longyu Li. © 2024. 34 pages.
Amit Singh, Jay Prakash, Gaurav Kumar, Praphula Kumar Jain, Loknath Sai Ambati. © 2024. 25 pages.
Ruizhe Ma, Weiwei Zhou, Zongmin Ma. © 2024. 21 pages.
Body Bottom