JIT_V4_N4_RP4
Preparing Data Sets by Using Horizontal Aggregations in SQL for Data Mining Analysis
K. Sentamilselvan
S. Vinoth Kumar
A. Jeevanantham
Journal on Information Technology
2277-5250
4
4
33
41
Data Mining, Data Set, SQL, Horizontal Aggregation, BY-LOGIC, CASE, GROUP BY, Query Evaluation, Vertical
Aggregation
Data Mining is one of the emerging fields in research. Preparing a Data set is one of the important tasks in Data Mining. To analyze data efficiently, Data Mining systems are widely using datasets with columns in horizontal tabular layout. Building a datasets for analysis is normally a most time consuming task. Existing SQL aggregations have limitation to build data sets because they return one column for aggregated group using group functions. A method is developed to generate SQL code to return aggregated columns in a horizontal tabular layout, returning a set of numbers instead of one number per row. This new class of functions are called horizontal aggregations. This method is termed as BY-LOGIC. SQL code generator generates automatic SQL code for producing horizontal aggregation. A fundamental method to evaluate horizontal aggregation called CASE (exploiting the case programming construct) is used. Basically, there are three parameters available namely: grouping, sub-grouping and aggregating fields for creating horizontal aggregation. Query evaluation shows that CASE method responses faster than BY-LOGIC method.
September - November 2015
Copyright © 2015 i-manager publications. All rights reserved.
i-manager Publications
http://www.imanagerpublications.com/Article.aspx?ArticleId=3646