Infraction Detection in Database Using Fragmentation

A.S. Pathma Priyaa*, S.Kiruthika**, S.Iyswarya***, N.Rajganesh****
*_**_*** Final Year B.Tech Student, Department of Information Technology, A.V.C College of Engineering, India.
**** Assistant Professor [SL], Department of Information Technology, A.V.C College of Engineering, India.
Periodicity:December - February'2015
DOI : https://doi.org/10.26634/jit.4.1.3278

Abstract

The Real life data is often dirty. To clean the data, efficient algorithms for detecting errors have to be in place. Errors in the data are typically detected as violations of constraints (Data quality rules), such as Functional Dependencies (FDs), Denial Constraints, and Conditional Functional Dependencies (CFDs). When the data is in a centralized database, it is known that two SQL queries be adequate to detect its violations of a set of CFDs. It is increasingly common to find data partitioned vertically or horizontally, and distributed across different sites. This is highlighted by the recent interests in SaaS of Cloud computing, Map Reduce, and columnar DBMS. In the distributed settings, however, it is much harder to detect errors in the data. To find violations in both settings, it is necessary to ship data from one site to another. It is NP-complete, to find violations of CFDs, with minimum data shipment, in a distributed relation that is partitioned either horizontally or vertically. So, the proposed work introduces such incremental algorithms for vertically and horizontally partitioned data, and show that the algorithms are absolute. Further, propose an optimization technique for the incremental algorithm over vertical partitions to reduce data shipment for error detection.

Keywords

Conditional Functional Dependencies, Incremental Algorithm, and Optimization Technique.

How to Cite this Article?

Priyaa. A. S. P, Kiruthika. S, Iyswarya. S and Rajganesh. N (2015). Infraction Detection in Database Using Fragmentation. i-manager’s Journal on Information Technology, 4(1), 6-10. https://doi.org/10.26634/jit.4.1.3278

References

[1]. A. Bernstein and D.-M. W. Chiu. (1981). “Using semijoins to solve relational queries”. ACM, Vol. 28(1).
[2]. A. Gupta and J. Widom. (1993). “Local verification of global integrity constraints in distributed databases”. In SIGMOD.
[3]. A. Kementsietsidis, F. Neven, D. Craen, and S. Vansummeren.(2008). “ Scalable multi-query optimization for exploratory queries over federated scientific databases”. In VLDB.
[4]. Dean and S. Ghemawat. (2004). “MapReduce: Simplified data processing on large clusters”. In OSDI.
[5]. D. Kossman. (2000). “The State of the Art in Distributed Query Processing”. ACM Comput. Surv., Vol 32(4).
[6]. J. Li, A. Deshpande, and S. Khuller. (2009). “Minimizing communication cost in distributed multi-query processing”. In ICDE.
[7]. M. Arenas, L. E. Bertossi, and J. Chomicki, (1999). “Consistent query answers in inconsistent databases,” in Proc. PODS, Philadelphia, PA, USA.
[8]. M. Stonebraker (2005). “C-store: A column- oriented DBMS”. In VLDB.
[9]. M. Garey and D. Johnson. (1979). Computers and Intractability: A Guide to the Theory of NP-Completeness. W. H. Freeman and Company.
[10]. S. Agrawal, S. Deb, K. V. M. Naidu, and R. Rastogi, (2007). “Efficient detection of distributed constraint violations,” in Proc. ICDE, Istanbul, Turkey.
[11]. T. Nykiel, M. Potamias, C. Mishra, G. Kollios, and N. Koudas. (2010). MR Share: Sharing across multiple queries in Map Reduce. PVLDB.
[12]. W. Fan, F. Geerts, S. Ma, and H. M uller. (2010). “Detecting inconsistencies in distributed data”. In ICDE.
[13]. W. Fan, J. Li, N. Tang, and W. Yu. (2012). “Incremental detection of inconsistencies in distributed data”. In ICDE.
[14]. W. Fan, F. Geerts, X. Jia, and A. Kementsietsidis, (2008). “Conditional functional dependencies for capturing data. inconsistencies,” ACM Trans. Database Syst., Vol. 33, No. 2, Article 6, Jun 2008.
[15]. W. Fan, F. Geerts, S. Ma, and H. Müller, (2010). “Detecting inconsistencies in distributed data,” in Proc. ICDE, Long Beach, CA, USA.
If you have access to this article please login to view the article or kindly login to purchase the article

Purchase Instant Access

Single Article

North Americas,UK,
Middle East,Europe
India Rest of world
USD EUR INR USD-ROW
Pdf 35 35 200 20
Online 35 35 200 15
Pdf & Online 35 35 400 25

Options for accessing this content:
  • If you would like institutional access to this content, please recommend the title to your librarian.
    Library Recommendation Form
  • If you already have i-manager's user account: Login above and proceed to purchase the article.
  • New Users: Please register, then proceed to purchase the article.