JCC_V3_N1_RP2
A Methodology for WebLog Data analysis using HadoopMapReduce and PIG
Durga Prasad P.S.
T. Vivekanandan
A. Srinivasan
Journal on Cloud Computing
2350-1308
3
1
13
17
Hadoop, Embedded Pig, MapReduce, Web Log Data
In the recent time, world is severely facing the problem related to the data storage and processing. Especially, the size of weblog data is exponentially increasing in terms of petabytes and zettabytes. The dependency of weblog data shows its importance on the users' actions on web. To solve and improve the business in all aspects, web data is prominent and hence it is vital. The traditional data management system is not adequate to handle the data in very large size. The Map Reduce programming approach is introduced to deal with the large data processing. In this paper, the authors have proposed a large scale data processing system for analysing web log data through MapReduce programming in Hadoop framework using Pig script. The experimental results show the processing time for classification of different status code in the web log data is efficient, than the traditional techniques.
November 2015 - January 2016
Copyright © 2016 i-manager publications. All rights reserved.
i-manager Publications
http://www.imanagerpublications.com/Article.aspx?ArticleId=8074