JIT_V5_N2_A1
An Introduction to Data Lake
K.V.N. Rajesh
K.V.N. Ramesh
Journal on Information Technology
2277-5250
5
2
1
4
Data Lake, Schema-on-Write, Schema-on-Read
Now-a-days companies are concentrating on more data to take informed decisions. Companies that are able to effectively use data are the world leaders in terms of wealth, development and growth. Even to survive, operate and compete in this age, organizations need to be able to effectively use their data. Huge amount of investment is made in storing and processing large amounts of data to make better decisions. Data lake is a massive, easily accessible data store/repository that allows for collecting large volumes of structured and unstructured data in its native format from disparate data sources. This paper describes Data Lake, Schema-on-Write, Schema-on-Read, Characteristics and implementation of data lake.
March - May 2016
Copyright © 2016 i-manager publications. All rights reserved.
i-manager Publications
http://www.imanagerpublications.com/Article.aspx?ArticleId=5997