i-manager Publications

i-manager's Journal on Cloud Computing (JCC)

Current Issue Vol. 12 Issue 2

Ubiquitous Learning: Refining Education in the Digital Age
A Hybrid Approach for Resource Utilization Prediction in Cloud Data Center using Functional Link Neural Network and Convolutional Neural Network
An Intelligent QoS-Aware Resource Optimization Framework for Performance-Efficient Cloud Computing Environments
Federated Learning-Based IoT Security Model for Privacy Preserving Analytics
Pathways for Preserving Indigenous Knowledge through Cloud Technology for Sustainable Education

Most Cited

Volume 3 Issue 1 November - January 2016 [Open Access]

Research Paper

Clustering of Summarizing Multi-Documents (Large Data) by Using MapReduce Framework

K. Thirumalesh* , Srinivasulu Asadi**

* Research Scholar, Department of Information Technology, Sree Vidyanikethan Engineering College, Tirupathi, India.

** Associate Professor, Department of Information Technology, Sree Vidyanikethan Engineering College, Tirupathi, India

Thirumalesh, K., and Asadi, S. (2016). Clustering Of Summarizing Multi-Documents (Large Data) By Using MapReduce Framework. i-manager’s Journal on Cloud Computing.,3(1), 1-12.

Abstract

Multi document summarization differs from the single document. Issues of compression, speed, and redundancy and passage selection are critical in the form of useful summaries. A collection of different documents is given to a variety of summarization methods based on different strategies to extract the most important sentences from the original document. LDA (Latent Dirichlet Allocation) topic modeling technique is used to divide the documents topic wise for summarizing the large text collection over the MapReduce framework. Compression ratio, retention ratio, Rouge and Pyramid score are different summarization parameters used to measure the performance of the summarizing documents. Semantic similarity and clustering methods are used efficiently for generating the summary of large text collections from multiple documents. Summarizing multi documents is a time consuming problem and it is a basic tool for understanding the summary. The presented method is compared with the MapReduce framework based k-means clustering algorithm applied on Four Multi-document summarization methods. Support for multilingual text summarization is provided over the MapReduce framework in order to provide the summary generation from the text document collections available in different languages.

References

[1]. N.K Nagwani, (2015). “Summarizing Large Text Collection Using Topic Modeling and Clustering Based on MapReduce Framework”. Journal of Big Data, Springer Open Journal, DOI:10.1186\S40537-015-0020-5.

[2]. Zhang G., and Zhang M., (2013). “The Algorithm of Data Preprocessing in Web Log Mining Based on Cloud Computing”. In 2012 International Conference on Information Technology and Management Science (ICITMS 2012) Proceedings, Springer, Berlin, Heidelberg, Germany, pp. 467–474.

[3]. Morales GDF, Gionis A., and Sozio M., (2011). “Social Content Matching in MapReduce”. Proceedings of the VLDB Endowment, Vol. 4, No. 7, pp. 460-469.

[4]. Verma A., Llora X., Goldberg DE., and Campbell RH, (2009). “Scaling Genetic algorithms using MapReduce Intelligent Systems Design and Application (ISDA)”. Ninth International Conference, Pisa, Italy, pp 13–18.

[5]. Cambria E., Rajagopal D., Olsher D., and Das D., (2013). “Big Social Data Analysis”. Big Data Computing Chapter, Vol. 13, pp. 401-414.

[6]. Lieberman M., (2014). “Visualizing Big Data: Social Network Analysis”. Digital Research Conference, San Antonio, Texas, pp. 1-23.

[7]. López V., Río S.D., Benítez J.M, and Herrera F., (2014). “Cost-Sensitive Linguistic Fuzzy Rule Based Classification Systems Under the MapReduce Framework for Imbalanced Big Data”. Fuzzy Sets Syst, Vol. 1, pp. 1-34.

[8]. Blanas S., Patel J.M, Ercegovac V., Rao J., Shekita E.J, and Tian Y., (2010). “A Comparison of Join Algorithms for Log Processing in MapReduce”. Proc. of the 2010 ACM SIGMOD International Conference on Management of Data, New York, USA, pp. 975-986.

[9]. Hoi SCH, Wang J., Zhao P., and Jin R., (2012). “Online st Feature Selection for Mining Big Data”. Proc. of the 1 International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications, ACM, New York, USA, pp. 93–100.

[10]. Chen S.Y, Li J.H, Lin K.C, Chen H.M, and Chen T.S., (2013). “Using MapReduce Framework for Mining Association Rules ”. In Information Technology Convergence Springer, Netherlands, pp. 723–731.

[11]. Urbani J., Maassen J., and Bal H., (2010). “Massive Semantic Web data compression with MapReduce”. th Proc. of the 19 ACM International Symposium on High Performance Distributed Computing, New York, USA, pp. 795–802.

[12]. Rajdho A., and Biba M., (2013). “Plugging Text Processing and Mining in a Cloud Computing Framework”. In Internet of Things and Inter-cooperative Computational Technologies for Collective Intelligence Springer, Berlin, Heidelberg, Germany, pp. 369–390.

[13]. Balkir A.S, Foster I., and Rzhetsky A., (2011). “A Distributed Look-up Architecture for Text Mining Applications using MapReduce”. High Performance. Computing, Networking, Storage and Analysis (SC), 2011 International Conference, Seattle, US, pp. 1–11

[14]. Zongzhen H., Weina Z., and Xiaojuan D., (2013). “A Fuzzy Approach to Clustering of Text Documents Based on MapReduce”. In Computational and Information Sciences (ICCIS), 2013 Fifth International Conference on IEEE. Shiyang, China, pp. 666-669.

[15]. Chen F., and Hsu M., (2013). “A Performance Comparison of Parallel DBMSs and MapReduce on Large Scale Text Analytics”. Proc. of the 16^th International Conference on Extending Database Technology ACM, New York, USA, pp. 613-624.

[16]. Das T.K, and Kumar P.M., (2013). “Big Data Analytics: A Framework for Unstructured Data Analysis”. International Journal of Engineering and Technology (IJET), Vol. 5, No. 1, pp. 153-156.

[17]. Momtaz A., and Amreen S., (2012). “Detecting Document Similarity in Large Document Collection using MapReduce and the Hadoop Framework”. BS Thesis. BRAC University, Dhaka, Bangladesh, pp. 1–54.

[18]. Lin J., and Dyer C., (2010). “Data-Intensive Text Processing with MapReduce”. Morgan & Claypool Publishers, Vol. 3, No. 1, pp. 1-177.

[19]. Elsayed T., Lin J., and Oard D.W., (2008). “Pairwise Document Similarity in Large Collections with MapReduce”. Proc. of the 46^th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies, Stroudsburg, US, pp. 265–268.

[20]. Galgani F., Compton P., and Hoffmann A., (2012). “Citation Based Summarisation of Legal Texts”. Proc. of 12^th Pacific Rim International Conference on Artificial Intelligence, Kuching, Malaysia, pp. 40–52

[21]. Hassel M., (2004). “Evaluation of Automatic Text Summarization”. Licentiate Thesis, Stockholm, Sweden, pp. 1–75.

[22]. Wang Y., Bai H., Stanton M., Chen W.Y, and Chang E.Y., (2009). “PLDA: Parallel Latent Dirichlet Allocation for Large-Scale Applications”. 5^th International Conference, A AIM (Algorithmic Aspects in Information and Management), San Francisco, CA, USA, pp. 309–322.

[23]. Hu Q., and Zou X., (2011). “Design and implementation of multi-document automati c summarization using MapReduce”. Computer Engineering and Applications, Vol. 47, No. 35, pp. 67–70.

[24]. Lai C., and Renals S., (2014). “Incorporating Lexical and Prosodic Information at Different Levels for Meeting Summarization”. Proceedings of the 15^th Annual Conference of the International Speech Communication Association, INTERSPEECH 2014. ISCA, Singapore, pp. 1875–1879.

[25]. M. Cannataro and D. Talia, (2004). “Semantics and Knowledge Grids: Building the Next-generation Grid”. Intelligent Systems, IEEE, Vol. 19, No. 1, pp. 56–63.

[26]. S. Wang, H.-J. Wang, X.-P. Qin, and X. Zhou, (2011). “Architecting Big Data: Challenges, Studies and Forecasts”. Jisuanji Xuebao (Chinese Journal of Computers), Vol. 34, No. 10, pp. 1741–1752.

[27]. K. Chen and W.-M. Zheng, (2009). “Cloud Computing: System Instances and Current Research”. Journal of Software, Vol. 20, No. 5, pp. 1337–1348.

[28]. J. Dean and S. Ghemawat, (2010). “MapReduce: A Flexible Data Processing Tool”. Communications of the ACM, Vol. 53, No. 1, pp. 72–77.

[29]. W. Xi-Zhao, (2003). “Optimization of k-means Clustering by Feature Weight Learning”. Journal of Computer Research and Development, Vol. 6.

[30]. H.-G. Li, G.-Q. Wu, X.-G. Hu, J. Zhang, L. Li, and X. Wu, (2011). “K-means Clustering with Bagging and th MapReduce”. In System Sciences (HICSS), 2011 44 Hawaii International Conference on IEEE, pp. 1–8.

[31]. Steve L., (2012). “The Age of Big Data”. Big Data's Impact in the World, New York, USA, pp. 1–5.

[32]. Lee K.H, Lee Y.J, Choi H,, Chung Y.D, and Moon B., (2011). “Parallel Data Processing with MapReduce: A Survey”. ACM SIGMOD Record, Vol. 40, No. 4, pp.11–20.

[33]. Fowkes J., Ranca R., Allamanis M., Lapata M., and Sutton C., (2014). “Autofolding for Source Code Summarization”. Computing Research Repository, 1403(4503): pp. 1-12.

[34]. Tzouridis E., Nasir J.A, Lahore LUMS, and Brefeld U., (2014). “Learning to Summarise Related Sentences”. The 25^thInternational Conference on Computational Linguistics (COLING'14), Dublin, Ireland, pp. 1–12, ACL

[35]. Wang Y., Bai H., Stanton M., Chen W.Y, and Chang E.Y, (2009). “PLDA: Parallel Latent Dirichlet Allocation for Large-Scale Applications”. 5^th International Conference, A AIM (Algorithmic Aspects in Information and Management), San Francisco, CA, USA, pp. 309–322.

[36]. Miller G.A., (1995). “WordNet: A Lexical Database for English”. Commun ACM, Vol. 38, No. 11, pp. 39-41.

[37]. Blei D.M, Ng AY, and Jordan M.I, (2003). “Latent Dirichlet Allocation”. The Journal of Machine Learning Research, Vol. 3, pp. 993–1022.

[38]. Feldman R., and Sanger J., (2007). The Text Mining Handbook-Advanced Approaches in Analyzing Unstructured Data. Press, Cambridge University, ISBN 978- 0-521-83657-9

[39]. McCallum A.K., (2002). “Mallet: A Machine Learning for Language Toolkit”. Retrieved from http://mallet. cs.umass.edu/ on 10 May 2014.

[40]. Galgani F., Compton P., and Hoffmann A., (2012). “Combining Different Summarization Techniques for Legal Text”. Proc. of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data. Association for Computational Linguistics, Avignon, France, pp. 115–123.

[41]. Galgani F., Compton P., and Hoffmann A., (2014). “HAUSS: Incrementally Building a Summarizer Combining Multiple Techniques”. Int. J. Human-Computer Studies, Vol. 72, pp. 584–605.

[42]. Li W., (1992). “Random Texts Exhibit Zipf's-Law-Like Word Frequency Distribution”. IEEE Trans Inf Theory, Vol. 38, No. 6, pp. 1842–1845.

[43]. Reed W.J., (2001). “The Pareto, Zipf and Other Power Laws”. Econ Lett, Vol. 74, No. 1, pp.15–19.

[44]. Goldstein J., Mittal V., Carbonell J.G, and Kantrowitz M., (2000). “Multi-Document Summarization By Sentence Extraction”. School of Computer Science, Carnegie Mellon University, Research Showcase, pp. 40–48.

[45]. Lin C.Y., (2004). “Rouge: a Package for Automatic Evaluation of Summaries”. In: Out TSB (ed) Proceedings of the ACL-04 Workshop Association for Computational Linguistics, Barcelona, Spain, pp. 74–81.

[46]. Nenkova A., and Passonneau R., (2004). “Evaluating Content Selection in Summarization: The Pyramid Method”. Proc. Human Language Technology Conf. North Am, Chapter of the Assoc. for Computational Linguistics (HLT-NAACL), Boston, Massachusetts, pp. 145–152.

[47]. Harnly A., Nenkova A., Passonneau R., and Rambow O., (2005). “Automation of Summary Evaluation by the Pyramid Method”. In Recent Advances in Natural Language Processing (RANLP), Borovets, Bulgaria, pp. 226–232.

[48]. Qazvinian V., and Radev D.R., (2008). “Scientific Paper Summarization Using Citation Summary Networks”. nd Proceedings of the 22 International Conference on Computational Linguistics, Vol. 1, Stroudsburg, PA, pp. 689–696.

[49]. Wang D., and Li T., (2012). “Weighted Consensus Multi-document Summarization”. Inf Process Manag, Vol. 48, pp. 513–523.

[50]. Amdahl G.M., (1967). “Validity of the Single Processor Approach to Achieving Large Scale Computing Capabilities”. Proceedings of the April 18–20, 1967, Spring Joint Computer Conference, Atlantic City, New Jersey, USA, pp. 483–485.

Full Article (HTML)

Pdf

Research Paper

A Methodology for WebLog Data analysis using HadoopMapReduce and PIG

Durga Prasad P S* , T. Vivekanandan, A.Srinivasan*

* PG Scholar, Department of Computer Science and Engineering, SITAMS, Chittor, Andhra Pradesh, India.

-* Associate Professor, Department of Computer Science and Engineering, SITAMS, Chittor, Andhra Pradesh, India.

Prasad, P. S. D., Vivekanandan, T., and Srinivasan, A. (2016). A Methodology for WebLog Data analysis using HadoopMapReduce and PIG. i-manager’s Journal on Cloud Computing, 3(1), 13-17.

Abstract

In the recent time, world is severely facing the problem related to the data storage and processing. Especially, the size of weblog data is exponentially increasing in terms of petabytes and zettabytes. The dependency of weblog data shows its importance on the users' actions on web. To solve and improve the business in all aspects, web data is prominent and hence it is vital. The traditional data management system is not adequate to handle the data in very large size. The Map Reduce programming approach is introduced to deal with the large data processing. In this paper, the authors have proposed a large scale data processing system for analysing web log data through MapReduce programming in Hadoop framework using Pig script. The experimental results show the processing time for classification of different status code in the web log data is efficient, than the traditional techniques.

References

[1]. Siddharth Adhikari, Devesh Saraf, Mahesh Revanwar, and Nikhil Ankam, (2014). “Analysis of Log Data and Statistics Report Generation using Hadoop”. In IJIRCCE, Vol. 2, No. 4.

[2]. Thanakorn Pamutha, Siriporn Chimphlee and Chom Kimpan, (2012). “Data Pre-processing on Web Server Log Files for Mining Users Access Patterns”. International Journal of Research and Reviews in Wireless Communications, Vol. 2, No. 2, ISSN: 2046-6447.

[3]. Natheer Khasawneh and Chien-Chung Chan, (2006). “Active User-Based and Ontology-Based Web Log Data Pre-processing for Web Usage Mining”. Proceedings of the IEEE International Conference on Web Intelligence.

[4]. Murat Ali Bayir, and Ismail Hakki Toroslu, (2009). “Smart Miner: A New Framework for Mining Large Scale Web Usage Data”. WWW 2009, ACM, Madrid, Spain, 978- 1-60558-487-4/09/04, April 20–24, 2009.

[5]. P. Srinivasa Rao, K. Thammi Reddy and MHM. Krishna Prasad, (2013). “A Novel and Efficient Method for Protecting Internet Usage from Unauthorized Access using MapReduce”. International Journal of Information Technology and Computer Science, Vol. 3, pp. 49-55.

[6]. Sayalee Narkhede and Tripti Baraskar, (2013). “HMR Log Analyzer: Analyze Web Application Logs over Hadoop MapReduce”. International Journal of UbiComp (IJU), Vol. 4, No. 3.

[7]. Ramesh Rajamanickam and C. Kavitha, (2013). “Fast Real Time Analysis of Web Server Massive Log Files using an Improved Web Mining Architecture”. Journal of Computer Science, Vol. 9, No. 6, pp. 771-779, ISSN: 1549- 3636.

[8]. NASA-HTTP, Web Logs Files. Retrieved from Http://ita. ee.lbl.gov/html/contrib/Saskatchawan-HTTP. html

[9]. Tom White, (2015). Hadoop: The Definitive Guide, Fourth Edition, ISBN: 978-1-449-31152-0 1327616795, 2015.

[10]. Naseera Shaik, T. Vivekanandan and K V Madhu Murthy, (2008). “Data Replication using Experience Based Trust in a Data Grid Environment”. Distributed Computing and Internet Technology, Springer, Berlin, Heidelberg, Vol. 5375, pp. 39-50.

[11]. Economist, (2016). Data, Data Everywhere. th Retrieved from http://www.economist.com /node, on 13 July 2016.

[12]. Hadoop, (2016). Welcome to Apache Hadoop. Retrieved from https://hadoop.apache.org.

[13]. Doug Cutting, “Hadoop Overview”. Retrieved from http://research. yahoo.com/node/2116

[14]. “PIG”, https://pig.apache.org.

[15]. Alan Gates, (2011). Programming PIG, O'reilly- First Edition.

[16]. Naseera Shaik, T. Vivekanandan and K V Madhu Murthy, (2008). “Trust Based Data Replication Strategy in a Data Grid Environment”. In Proceedings of International Conference on Information processing (ICIP), Banglore.

Full Article (HTML)

Pdf

Research Paper

An Effective Feature Selection Technique for Mining High Dimensional Data on Bigdata

K. Bhaskar Naik* , S.P Sindhuja**

* Assistant Professor, Department of Computer Science and Engineering, Sree Vidyanikethan Engineering College, Tirupati, India.

** PG Scholar, Department of Computer Science and Engineering, Sree Vidyanikethan Engineering College, Tirupati, India.

Naik, K. B., and Sindhuja, S. P. (2016). An Effective Feature Selection Technique for Mining High Dimensional Data on Bigdata. i-manager’s Journal on Cloud Computing, 3(1), 18-23.

Abstract

In the recent years, many research innovations have come into foray in the area of big data analytics. Advanced analysis of the big data stream is bound to become a key area of data mining research as the number of applications requiring such processing increases. Big data sets are now collected in many fields eg., Finance, business, medical systems, internet and other scientific research. Data sets rapidly increase their size as they are often generated in the form of incoming stream. Feature selection has been used to lighten the processing load in inducing a data mining model, but mining a high dimensional data becomes a tough task due to its exponential growth of size. This paper aims to compare the two algorithms, namely Particle Swarm Optimization and FAST algorithm in the feature selection process. The proposed algorithm FAST is used in order to reduce the irrelevant and redundant data, while streaming high dimensional data which would further increase the analytical accuracy for a reasonable processing time.

References

[1]. J. Kennedy and Eberhart, (1995). “Particle Swarm Optimization”. IEEE Transactions on International Conference on Neural Networks, Piscataway, NJ, pp. 1942-1948.

[2]. David Waha and Richard L. Bankert, (1996). “A Comparative Evaluation of Sequential Feature Selection Algorithms”. Springer-Verlag, pp. 199-206.

[3]. Anil Jain and Douglas Zongker, (1997). “Feature Selection: Evaluation, Application and Small Sample Performance”. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.19, No. 2, pp. 153-158.

[4]. M. Dash, and H. Liu, (1997). “Feature Selection for Classification”. Elsevier, Intelligent Data Analysis, Vol. 1, pp. 131-156.

[5]. Huan Liu and Rudy Setiono, (1997). “Chi2: Feature Selection and Discretization of Numeric Attributes”. IEEE Transactions, pp. 388-391.

[6]. Yuhui Shi and Russell Eberhart, (1998). “A Modified Particle Swarm Optimizer”. IEEE Transactions, Evolutionary Computation Proceedings, pp. 69-73.

[7]. MohdSaberi Mohamad, Safaai Deris, Safie Mat Yatim, and Muhammad Razib Othman, (2004). “Feature Selection Method using Genetic Algorithm for Classification of Small and High Dimension Data”. First International Symposium on Information and Communications Technologies, pp. 7-8.

[8]. Muhammad Imran, Rathiah Hashima and Noor Elaiza AbdKhalidb, (2013). “An Overview of Particle Swarm Optimization”. Elsevier, Procedia Engineering , Vol. 53, pp. 491-496.

[9]. S Gracia Galan, R P Prado, Je Munoz Esposito, (2015). “Rules Discovery in Fuzzy Classifier System with PSO for Scheduling in Grid Computational Infrastructure”. Elsevier, Applied Soft Computing, Vol. 29, pp. 424-435.

[10]. Simon Fong, Raymond Wong, and Athanasios V. Vasilakos, (2015). “Accelerated PSO Swarm Search Feature Selection for Data Stream Mining Big Data”. IEEE Transactions on Service Computing, Vol. 9, No. 1, pp. 33- 45.

Full Article (HTML)

Pdf

Research Paper

Enhanced E-tree for Mining High Dimensional Data

S. Salam* , M. Roja, T. V. Rao*

* Associate Professor, Department of Computer Science and Engineering, Sree Vidyanikethan Engineering College, Tirupati, India.

PG Scholar, Department of Computer Science and Engineering, Sree Vidyanikethan Engineering College, Tirupati, India.

* Professor, Department of Computer Science and Engineering, Sree Vidyanikethan Engineering College, Tirupati, India.

Salam, S., Roja, M., and Rao, T. V. (2016). Enhanced E-tree for Mining High Dimensional Data. i-manager’s Journal on Cloud Computing, 3(1), 24-29.

Abstract

Data Stream classification is one of the critical tasks in data mining. At the point when DataStream touches the base at a pace of GB/sec, we need to recognize spam, web observing and capacity. It is a troublesome operation and falls flat in the existing System. Actualizing two Algorithms namely, E-tree Algorithm (Ensemble-tree) and Avaricious Algorithm and Executing E-tree algorithm, the authors have maintained a strategic distance from the existing issues. Ensemble tree (Etree) takes care of extensive volumes of stream data and drifting. E-tree, Classifies and groups the Data Stream and stores the data effectively. Furthermore, foresee web checking and spam identification precisely. Controlling the web movement, the authors have actualized the greedy algorithm.

References

[1]. ChuanZhou, (2015). “E-Tree: An Efficient Indexing Structure for Ensemble Models on Data streams”. IEEE Transactions on Knowledge and Data Engineering, Vol.27, No.2.

[2]. J. Gao, R. Sebastiao, and P. Rodrigues, (2009). “Issues in Evaluation of Stream Learning Algorithms”. In KDD 2009. pp.329-338.

[3]. H. Yu, L. Ko, K. Y, S. Hwang, and W. Han, (2011). “Exact Indexing for Support Vector Machines”. In SIGMOD 2011. pp. 709-720.

[4]. Z. Lu, X. Wu, X. Zhu, and J. Bongard, (2010). “Ensemble Pruning Via Individual Contribution Ordering”. In KDD 2010. pp. 871-880.

[5]. A. Machanavajjhala, E. Vee, M. Garofalakis, and J. Shanmugasundaram, (2008). “Scalable Ranked Publish Subscribe”. In VLDB 2008. Vol. 1, No. 1, pp. 451-462.

[6]. Y. Zhang, S. Burer, and W. Street, (2007). “Ensemble Pruning Via Semi Definite Programming”. Journal of Machine Learning Research, Vol. 7, pp. 1315-1338.

[7]. C. Domeniconi and D. Gunopulos, (2011). “Incremental Support Vector Machine Construction”. In ICDM 2011, pp. 589-592.

[8]. Y. Tao and D. Papadias, (2014). “Performance Analysis of R*-trees with Arbitrary Node Extents”. IEEE Transactions on Knowledge and Data Engineering, Vol. 16, No. 6, pp. 653-668.

[9]. A. Guttman, (1984). “R-Trees: A Dynamic Index Structure for Spatial Searching”. Proc. ACM SIGMOD, pp. 47-57.

[10]. P. Domingos and G. Hulten, (2000). “Mining High- Speed Data Streams”. Proc. Sixth ACM SIGKDD Int'l Conf. Knowledge Discovery and DataMining (KDD), pp. 71-80.

[11]. C. Domeniconi and D. Gunopulos, (2001). “Incremental Support Vector Machine Construction”, Proc. IEEE Int'l Conf. Data Mining (ICDM).

Full Article (HTML)

Pdf

Review Paper

A Survey on Energy Aware Job Scheduling Algorithms in Cloud Environment

Shaik Naseera* , P. Jyotheeswai**

* Associate Professor, Department of Computing Science and Engineering, VIT University, Vellore, India.

** Associate Professor, Department of Computing Science and Engineering, SVCET, Chittoor, India.

Naseera, S., and Jyotheeswai, P. (2016). A Survey on Energy Aware Job Scheduling Algorithms in Cloud Environment. i-manager’s Journal on Cloud Computing, 3(1), 30-36.

Abstract

Now-a-days there is a lot of attention to cloud computing by the Research community. Cloud computing is a platform that supports the sharing of resources, communication and storage capacity over the internet. The primary benefit of moving to the Clouds is application scalability. It provides virtualized resources and are built on the base of Grid & distributed computing. Cloud computing is also environmental friendly framework. It benefits from the efficient utilization of resources and optimal scheduling algorithms. The growth of internet based applications demands the need for the development of algorithms that cope with the escalation in energy consumption and reduce the operational cost and emission of CO gases. In this paper, the authors present a review on energy aware job scheduling algorithms existing 2 in the literature. This paper helps the readers to understand the functionality and parameters focus of various energy aware scheduling algorithms available in the literature.

References

[1]. A Quarati, A Clematis, A Galizia, and D D'Agostino, (2013). “Hybrid Clouds Brokering: Business Opportunities, QoS and Energy-saving Issues”. Simul. Model. Pract. Theory, Vol. 39, No. 2, pp. 121-134.

[2]. A Quarati, D D'Agostino, A Galizia, M Mangini, and A Clematis, (2012). “Delivering cloud services with QoS th requirements: an opportunity for ICT SMEs”. In 9 International Conference on Economics of Grids, Clouds, Systems, and Services, (Springer, Berlin, 2012), pp. 197- 211.

[3]. R. Brown, (2007). Report to Congress on Server and Data Center Energy Efficiency Public Law 109-431. U.S. Environ. Protection Agency, Washington, DC, USA.

[4]. J. Koomey, (2007). Growth in Data Center Electricity Use 2005 to 2010. Oakland, CA, USA: Analytics Press.

[5]. G. Meijer, (2010). “Cooling Energy-Hungry Data Centers”. Science, Vol. 328, No. 5976, pp. 318–319.

[6]. LalShriVratt Singh, Jawed Ahmed, and AsifKhan, (2014). ”An Algorithm to Optimize the Traditional Backfill Algorithm Using Priority of Jobs for Task Scheduling Problems in Cloud Computing”. International Journal of Computer Science and Information Technologies, Vol. 5, No. 2, pp. 1671-1674.

[7]. Jinn-Tsong Tsai, Jia-Cen Fang and Jyh-Horng Chou, (2013). “Optimized Task Scheduling and resources allocation on cloud Computing environment using improved differential evolution Algorithm”. Elsevier, Computer of operations Research, Vol. 40, pp. 3045- 3055.

[8]. Chia-Ming Wu, Ruay-Shiung Chang, and Hsin-Yu Chan, (2014). “A Green Energy-Efficient Scheduling Algorithm Using the DVFS Technique for Cloud Data Centers”. Future Generation Computer Systems, Vol. 37, pp. 141–147.

[9]. Jiayin Li, Meikang Qiu, Zhong Ming , Gang Quan, Xiao Qin, and Zonghua Gu, (2012). “Online Optimization for Scheduling Preemptive Tasks on IAAS Cloud Systems”. J. Parallel Distribute Computing, Vol. 72, pp. 666-677.

[10]. Baomin Xu, Chunyan Zhao, Enzhao Hu, and Bin Hu, (2011). “Job Scheduling Algorithm Based on Berger Model in Cloud Environment”. Advances in Engineering Software, Vol. 42, pp. 419-425.

[11]. Deepak Poola, Kotagiri Ramamohanarao, and Raj kumar Buyya, (2014). “Fault-Tolerant Workflow Scheduling th Using Spot Instances on Clouds”. ICCS 2014, 14 International Conference on Computational Science, Vol. 29, pp. 523- 533.

[12]. Wei Liu, Wei Du, Jing Chen, Wei Wang, and Guo Sun Zeng, (2014). “Adaptive Energy-Efficient Scheduling Algorithm for Parallel Tasks on Homogeneous Clusters” Journal of Network and Computer Applications, Vol. 41, pp. 101-113.

[13]. Li, K., et al. (2011). “Cloud Task Scheduling Based on th Load Balancing Ant Colony Optimization”. 6 Annual China Grid Conference, Dalian, pp. 22-23.

[14]. Dutta, D. and Joshi, R.C. (2011). “A Genetic- Algorithm Approach to Cost-Based Multi-QoS Job Scheduling in Cloud Computing Environment ”. International Conference and Workshop on Emerging Trends in Technology (ICWET 2011)- TCET, Mumbai, pp. 25- 26.

[15]. Palmieri, F., Buonanno, L., Venticinque, S., Aversa, R. and Di Martino, B., (2013). “A Distributed Scheduling Framework Based on Selfish Autonomous Agents for Federated Cloud Environments”. Future Generation Computer Systems, Vol. 29, pp. 1461-1472. http://dx.doi.org /10.1016/j.future.2013.01.012 http://dx.doi.org/10.1016/j.proeng.2011.08. 626

[16]. Ghanbari, S. and Othman, M. (2012). “A Priority Based Job Scheduling Algorithm in Cloud Computing”. Procedia Engineering, Vol. 50, pp. 778-785.

[17]. Zhang, Y.H., Feng, L. and Yang, Z. (2011). “Optimization of Cloud Database Route Scheduling Based on Combination of Genetic Algorithm and Ant Colony Algorithm”. Procedia Engineering, Vol. 15, pp. 3341-3345.

[18]. Sen Su, Jian Li, Qingjia Huang, Xiao Huang, Kai Shuang, and Jie Wang, (2013). “Cost-Efficient Task Scheduling for Executing Large Programs in the Cloud”. Science Direct, Parallel Computing, Vol. 39, pp. 177-188.

[19]. Ying feng B, Lin Zhang A, and T.W. Liao, (2014). “CLPS-GA: A Case Library and Pareto Solution-based Hybrid Genetic Algorithm for Energy Aware Cloud Service Scheduling”. Science Direct, Applied Soft Computing, Vol. 19, pp. 264–279.

[20]. Cui Lin, and Shiyong Lu, (2011). “Scheduling Scientific Workflows Elastically for Cloud Computing”. IEEE th 4 International Conference on Cloud Computing.

[21]. Dhinesh Babu L.D.A, P. Venkata Krishnab et al., (2013). “Honey Bee Behavior Inspired Load Balancing of Tasks in Cloud Computing Environments”. Science Direct, Ap

i-manager's Journal on Cloud Computing (JCC)

Current Issue Vol. 12 Issue 2

Most Read

Most Cited

Volume 3 Issue 1 November - January 2016 [Open Access]

Clustering of Summarizing Multi-Documents (Large Data) by Using MapReduce Framework

K. Thirumalesh* , Srinivasulu Asadi**

* Research Scholar, Department of Information Technology, Sree Vidyanikethan Engineering College, Tirupathi, India. ** Associate Professor, Department of Information Technology, Sree Vidyanikethan Engineering College, Tirupathi, India

Thirumalesh, K., and Asadi, S. (2016). Clustering Of Summarizing Multi-Documents (Large Data) By Using MapReduce Framework. i-manager’s Journal on Cloud Computing.,3(1), 1-12.

Abstract

References

Full Article (HTML)

Pdf

A Methodology for WebLog Data analysis using HadoopMapReduce and PIG

Durga Prasad P S* , T. Vivekanandan**, A.Srinivasan***

* PG Scholar, Department of Computer Science and Engineering, SITAMS, Chittor, Andhra Pradesh, India. **-*** Associate Professor, Department of Computer Science and Engineering, SITAMS, Chittor, Andhra Pradesh, India.

Prasad, P. S. D., Vivekanandan, T., and Srinivasan, A. (2016). A Methodology for WebLog Data analysis using HadoopMapReduce and PIG. i-manager’s Journal on Cloud Computing, 3(1), 13-17.

Abstract

References

Full Article (HTML)

Pdf

An Effective Feature Selection Technique for Mining High Dimensional Data on Bigdata

K. Bhaskar Naik* , S.P Sindhuja**

* Assistant Professor, Department of Computer Science and Engineering, Sree Vidyanikethan Engineering College, Tirupati, India. ** PG Scholar, Department of Computer Science and Engineering, Sree Vidyanikethan Engineering College, Tirupati, India.

Naik, K. B., and Sindhuja, S. P. (2016). An Effective Feature Selection Technique for Mining High Dimensional Data on Bigdata. i-manager’s Journal on Cloud Computing, 3(1), 18-23.

Abstract

References

Full Article (HTML)

Pdf

Enhanced E-tree for Mining High Dimensional Data

S. Salam* , M. Roja**, T. V. Rao***

Salam, S., Roja, M., and Rao, T. V. (2016). Enhanced E-tree for Mining High Dimensional Data. i-manager’s Journal on Cloud Computing, 3(1), 24-29.

Abstract

References

Full Article (HTML)

Pdf

A Survey on Energy Aware Job Scheduling Algorithms in Cloud Environment

Shaik Naseera* , P. Jyotheeswai**

* Associate Professor, Department of Computing Science and Engineering, VIT University, Vellore, India. ** Associate Professor, Department of Computing Science and Engineering, SVCET, Chittoor, India.

Naseera, S., and Jyotheeswai, P. (2016). A Survey on Energy Aware Job Scheduling Algorithms in Cloud Environment. i-manager’s Journal on Cloud Computing, 3(1), 30-36.

Abstract

References

Full Article (HTML)

Pdf

* Research Scholar, Department of Information Technology, Sree Vidyanikethan Engineering College, Tirupathi, India.

** Associate Professor, Department of Information Technology, Sree Vidyanikethan Engineering College, Tirupathi, India

Durga Prasad P S* , T. Vivekanandan, A.Srinivasan*

* PG Scholar, Department of Computer Science and Engineering, SITAMS, Chittor, Andhra Pradesh, India.

-* Associate Professor, Department of Computer Science and Engineering, SITAMS, Chittor, Andhra Pradesh, India.

* Assistant Professor, Department of Computer Science and Engineering, Sree Vidyanikethan Engineering College, Tirupati, India.

** PG Scholar, Department of Computer Science and Engineering, Sree Vidyanikethan Engineering College, Tirupati, India.

S. Salam* , M. Roja, T. V. Rao*

* Associate Professor, Department of Computing Science and Engineering, VIT University, Vellore, India.

** Associate Professor, Department of Computing Science and Engineering, SVCET, Chittoor, India.