International Journal of Scientific & Technology Research

Home About Us Scope Editorial Board Blog/Latest News Contact Us
10th percentile
Powered by  Scopus
Scopus coverage:
Nov 2018 to May 2020


IJSTR >> Volume 9 - Issue 5, May 2020 Edition

International Journal of Scientific & Technology Research  
International Journal of Scientific & Technology Research

Website: http://www.ijstr.org

ISSN 2277-8616

Enhancing Data Processing And Management For Big-Data Intensive Applications Based On Cloud Computing

[Full Text]






Big data; Data management; Cloud Computing; Map-Reduce; Hadoop; Data Workflow



An important factor that acts as a hindrance to the greater adoption of clouds for scientific computing is data management. The reason behind this is that data-intensive scientific workflow does not possess support for handling data workflow. At present, cloud computing handles data workflow by employing application overlays which map the output of a specific task to another input of specific tasks which may be in pipeline order and this technique was enhanced by MapReduce programming such as Amazon Elastic MapReduce, Hadoop on Azure - HDInsight. To resolve these challenges for managing data, an approach is proposed which can enhance virtual disks. The proposed approach enhances data workflow and management based on the cloud platform.



[1]. Cloud Service : https://rajivramachandran.wordpress.com/2012/06/19/cloud-service-models-iaas-vs-paas-vs-saas/
[2]. Big Data Architecture https://en.wikipedia.org/wiki/Big_data#Architecture
[3]. Tianqi Chen and Carlos Guestrin. Xgboost: A scalable tree boosting system." In: arXiv preprint arXiv:1603.02754 (2016).
[4]. K. Liu, H. Jin, J. Chen, X. Liu, D. Yuan, Y. Yang, A Compromised-Time-Cost Scheduling Algorithm in SwinDeWC for Instance-Intensive Cost-Constrained Workflows on a Cloud Computing Platform, Int. J. High Perform. Comput.Appl.
[5]. K. Kaur, A. Chhabra, G. Singh, Heuristics based genetic algorithm for scheduling static tasks in homogeneous parallel system, Int. J. of Comp. Sci. and Sec.
[6]. R.Rengasamy and M.Chidambaram, “A Novel Predictive Resource Allocation Framework for Cloud Computing”, In Proceedings of International Conference on Advanced Computing and Communications Systems (ICACCS), 2019.
[7]. R.Rengasamy and M.Chidambaram “Challenges and Oppurtunities of Resource Allocation Frameworks for Big data Tools in Cloud Computing”, International Journal of Computer Sciences and Engineering, Vol.6, Issue 12. Dec-2018, e-ISSN-2347-2693.
[8]. P. Sempolinski, D. Thain, A Comparison and Critique of Eucalyptus, OpenNebula and Nimbus, in: Proceedings of the 2010 IEEE Second International Conference on Cloud Computing Technology and Science, CLOUDCOM ’10, IEEE Computer Society, Washington, DC, USA, ISBN 978-0-7695-4302-4, 417–426, doi: 10.1109/CloudCom.2010.42, URL http://dx.doi.org/10.1109/CloudCom.2010.42, 2010.population, Future Generation Computer Systems 27 (8) (2011) 1035–1046, ISSN 0167-739X, doi:10.1016/j.future.2011.04.011, URL http://dx.doi.org/10.1016/j.future.2011.04.011.
[9]. B. Palanisamy, A. Singh, L. Liu, and B. Jain. Purlieus: locality-aware resource allocation for MapReduce in a cloud. In Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2011.
[10]. J. Dean and S. Ghemawat, “Mapreduce: Simplified data processing on large clusters,” ACM Commun., vol. 51, Jan. 2008, pp. 107-113.
[11]. M. Zaharia, D. Borthakur, J. Sen Sarma, K. Elmeleegy, S. Shenker, and I. Stoica. Delay scheduling: a simple technique for achieving locality and fairness in cluster scheduling. In Proceedings of the 5th European Conference on Computer systems (EuroSys), 2010.
[12]. Udendhran R, “A Hybrid Approach to Enhance Data Security in Cloud Storage”, ICC '17 Proceedings of the Second International Conference on Internet of things and Cloud Computing at Cambridge University, United Kingdom — March 22 - 23, 2017, ISBN:978-1-4503-4774-7 https://dl.acm.org/citation.cfm?doid=3018896.3025138.
[13]. Suresh, A., Udendhran, R., Balamurgan, M. et al. "A Novel Internet of Things Framework Integrated with Real Time Monitoring for Intelligent Healthcare Environment "Springer-Journal of Medical System (2019) 43: 165. https://doi.org/10.1007/s10916-019-1302-9.
[14]. Udendhran R., Balamurgan M. (2020) An Effective Hybridized Classifier Integrated with Homomorphic Encryption to Enhance Big Data Security. In: Haldorai A., Ramu A., Mohanram S., Onn C. (eds) EAI International Conference on Big Data Innovation for Sustainable Cognitive Computing. EAI/Springer Innovations in Communication and Computing. Springer, Cham
[15]. Suresh, A., Udendhran, R. & Balamurgan, M. " Hybridized neural network and decision tree based classifier for prognostic decision making in breast cancers " Springer - Journal of Soft Computing (2019). https://doi.org/10.1007/s00500-019-04066-4.
[16]. D. Kreuter, “Where server virtualization was born”, Virtual Strategy Magazine, July 2004
[17]. Hive Performance Benchmarks. https://issues.apache.org/jira/browse/HIVE-396 .
[18]. S. L. Faraz Ahmad and T. V. Mithuna Thottethodi. MapReduce with communication overlap (marco). http://docs.lib.purdue.edu/cgi/viewcontentcgi?article=1412&context=ecetr, 2007.
[19]. Jon Kleinberg. The small-world phenomenon: An algorithmic perspective. In Proceedings of the Thirty-second Annual ACM Symposium on Theory of Computing, STOC ’00, pages 163–170, New York, NY, USA, 2000. ACM
[20]. Lun Li, David Alderson, John C Doyle, and Walter Willinger. Towards a theory of scale-free graphs: Definition, properties, and implications. Internet Mathematics, 2(4):431–523, 2005. .
[21]. Tim Mather, Subra Kumaraswamy, and Shahed Latif. Cloud Security and Pri­ vacy: An Enterprise Perspective on Risks and Compliance. O’Reilly Media, Inc., 2009. .
[22]. Xiaoqiao Meng, Vasileios Pappas, and Li Zhang. Improving the scalability of data center networks with traffic-aware virtual machine placement. In INFOCOM, 2010 Proceedings IEEE, pages 1–9, 2010
[23]. Issawi, S. F., Halees, A. A., & Radi, M. (2015). An efficient adaptive load-balancing algorithm for cloud computing under bursty workloads. Engineering, Technology, & Applied Science Research, 5(3), 795-800.
[24]. Jena, S. R., & Ahmad, Z. (2013). Response time minimization of different load balancing algorithms in cloud computing environment. International Journal of Computer Applications, 69(17), 22-27.
[25]. LaCurts, K. L. (2014, June). Application workload prediction and placement in cloud computing systems (Unpublished doctoral dissertation). Massachusetts Institute of Technology, Cambridge Massachusetts.
[26]. Lee, R., & Jeng, B. (2011). Load-balancing tactics in cloud. In Proceedings of the International Conference on Cyber-Enabled Distributed Computing and Knowledge CyberC Discovery, pp. 447-454.
[27]. Mahmood, Z. (2011). Cloud computing: characteristics and deployment approaches. In the 11th IEEE International Conference on Computer and Information Technology, pp. 121-126.
[28]. Mathur, S., Larji, A. A., & Goyal, A. (2017). Static load balancing using SA Max-Min algorithm. International Journal for Research in Applied Science & Engineering Technology, 5(4), 1886-1893.
[29]. Nema, R., & Edwin, S. T. (2016). A new efficient balancing algorithm for a cloud computing environment. International Journal of Latest Research in Engineering and Technology, 2(2), 69-75.
[30]. What is the effective way to handle Big Data? https://www.zarantech.com/blog/effective-way-handle-big-data/