An Introduction to Data Lake

K.V.N. Rajesh*, K.V.N. Ramesh**
* HOD, Department of Information Technology, Vignan’s Institute of Information Technology, Visakhapatnam, India.
** Project Manager, Tech Mahindra, Visakhapatnam, India.
Periodicity:March - May'2016
DOI : https://doi.org/10.26634/jit.5.2.5997

Abstract

Now-a-days companies are concentrating on more data to take informed decisions. Companies that are able to effectively use data are the world leaders in terms of wealth, development and growth. Even to survive, operate and compete in this age, organizations need to be able to effectively use their data. Huge amount of investment is made in storing and processing large amounts of data to make better decisions. Data lake is a massive, easily accessible data store/repository that allows for collecting large volumes of structured and unstructured data in its native format from disparate data sources. This paper describes Data Lake, Schema-on-Write, Schema-on-Read, Characteristics and implementation of data lake.

Keywords

Data Lake, Schema-on-Write, Schema-on-Read.

How to Cite this Article?

Rajesh K. V. N and Ramesh. K. V. N (2016). An Introduction to Data Lake. i-manager’s Journal on Information Technology, 5(2), 1-4. https://doi.org/10.26634/jit.5.2.5997

References

[1]. Rajesh K.V.N. (2008). “Business Intelligence for Enterprises”. The IUP Journal of Information Technology, Vol. 4, No. 1, pp. 45-54.
[2]. Rajesh K.V.N. (2011). “Location Intelligence Mashup Using Open Source Software and Google Maps API”. The IUP Journal of Information Technology, Vol. 7, No. 1, pp. 35-46.
[3]. Rajesh K.V.N. (2013). “Big Data Analytics: Applications and Benefits”. The IUP Journal of Information Technology, Vol. 9, No. 4, pp. 41-51.
[4]. Rajesh K.V.N. (2014).“Business Analytics: Its Application in Various Industry Verticals from Banking to Government”. CSI Communications, Vol. 38, No. 4, pp. 7- 9.
[5]. Rajesh K.V.N. and Ramesh K.V.N. (2014). “A Brief Histor y of BIDW (Business Intelligence and Data Warehousing)”. CSI Communications, Vol. 38, No. 6, pp. 26-28.
[6]. Rajesh K.V.N. and Ramesh K.V.N. (2015). “Security in Business Intelligence Reporting Systems ”. CSI Communications, Vol. 39, No. 4, pp. 35-37.
[7]. James Dixon, (2010). Pentaho, Hadoop and Data Lakes. Retrieved from, https://jamesdixon.wordpress.com /2010/10/14/ pentaho-hadoop-and-data-lakes/
[8]. James Dixon, (2014). Data Lakes Revisited. Retrieved from, https://jamesdixon.wordpress.com/2014/09/25/ data-lakes-revisited/
[9]. James Dixon, (2015). Imagines a Data Lakes that Matter. Retrieved from, http://www.forbes.com/sites /dan woods/2015/01/26/james-dixon-imagines-a-data-lakethat- matters/
[10]. CITO Research: Putting the Data Lake to work A Guide to Best practices”. Retrieved from, http://hortonworks.com /wp-content/uploads/2014/05/Teradata Hortonworks_Da talake_White-Paper _20140410.pdf
[11]. Mark Jacobsohn and Michael Delurey, (2014). How the Data Lake Works? Retrieved from, https://www. Boozal len.com/content/dam/boozallen/documents/Data_ Lake.pdf
[12]. Andrew C. Oliver, (2014). How to create a Data Lake for Fun and Profit? Retrieved from, http://www.infoworld. com/article/2608490/application-development/how-tocreate- a-data-lake-for-fun-and-profit.html
[13]. Steve Jones, (2013). Why Business Needs a Lake for Data Not a wave house? Retrieved from, http:// www.capgemini.com/blog/capping-it-off /2013/12/whybusiness- needs-a-lake-for-data-not-a-warehouse
[14]. Brian Stein and Alan Morrison, (2014). The Enterprise Data Lake: Better Integration and Deeper Analytics. Retrieved from, https://www.pwc.com/us/en/healthindustries/ assets /pwc-tech-forecast-data-lakes.pdf
[15]. “The principles of the Business Data Lake”. Retrieved from, http://pivotal.io/big-data/white-paper/theprinciples -of-the-business-data-lake
If you have access to this article please login to view the article or kindly login to purchase the article

Purchase Instant Access

Single Article

North Americas,UK,
Middle East,Europe
India Rest of world
USD EUR INR USD-ROW
Pdf 35 35 200 20
Online 35 35 200 15
Pdf & Online 35 35 400 25

Options for accessing this content:
  • If you would like institutional access to this content, please recommend the title to your librarian.
    Library Recommendation Form
  • If you already have i-manager's user account: Login above and proceed to purchase the article.
  • New Users: Please register, then proceed to purchase the article.