International Journal of Scientific & Technology Research

Home About Us Scope Editorial Board Blog/Latest News Contact Us
10th percentile
Powered by  Scopus
Scopus coverage:
Nov 2018 to May 2020


IJSTR >> Volume 9 - Issue 1, January 2020 Edition

International Journal of Scientific & Technology Research  
International Journal of Scientific & Technology Research

Website: http://www.ijstr.org

ISSN 2277-8616

Empirical Aspects to Analyze Population of India using Apache Pig in Evolutionary of Big Data Environment

[Full Text]



Yogesh Kumar Gupta, Tanusha Mittal



Big data, Hadoop, Apache Pig, and Gender Ratio



Big data contains great variety of data arrives in incrementing volume and with high velocity. The data sets are so voluminous that the conventional data processing software just can’t able to manage them. Hence, big data tools i.e. Hadoop came into the glare due to its high scalability, availability and the cluster environment mechanism which provides the facility to work in the distributed manner. One of the important components of Hadoop is MapReduce which is able to handle the unstructured data but to use this, high programming skills are needed. Therefore, due to the reason of high programming skill, users are now a days moving towards the tool i.e. Apache Pig, as we can analyze the data simply by executing the queries. In this paper, we analyze the gender ratio of India according to the age group of 0 to 24 from the year of 2001-2018 that is further analyzed through Pig Latin scripts and results are represented in the pictorial form. The government of India introduces a policy i.e. Two-Child Policy. The policies are implemented by disallowing the people with more than two children from serving the government. Firstly, the policy was implemented by Assam in 2017. The motive of this paper is to analyze whether the introduced policy of government is fulfilling or not.



[1] Z. Zhuoyao, C. Ludmila, V. Abhishek and L.T. Boon, “Meeting service level objective of Pig programs”. Proceedings of the 2nd International Workshop on Cloud Computing Platforms. ACM, 2012.
[2] Dhawan, S. and Rathee, S. “Big Data Analytics using Hadoop Components like pig and hive”. American International Journal of Research in Science, Technology, Engineering & Mathematics, pp.88-93, March-May, 2013.
[3] Kataria, M. And Mittal, P. “Big Data and Hadoop with components like Flume, Pig, Hive and Jaql”, International Journal of Computer Science and Mobile Computing, Vol. 3 Issue 7, pp. 759-765, July 2014.
[4] Swarna, C. and Ansari, Z. “Apache Pig- A Data Flow Framework, Based on Hadoop MapReduce”, International Journal of Engineering Trends and Technology (IJETT)- Vol. 50, Number 5, pp. 271-275, August 2017.
[5] Jain, A. and Bhatnagar, V. “Crime Data Analysis Using Pig with Hadoop”, International Conference on Information Security & Privacy (ICISP2015), 11-12 December, 2015, Procedia Computer Science 78, Pp. 571-578 (2016).
[6] R. A. Preethi and J. Elavarasi. “Big data analytics using Hadoop tools – Apache Hive vs Apache Pig.” Int. J. Emerg. Technol. Computer. Sci. Electron, Vol. 24, 2017.
[7] F.G. Alan, D. Jianyong and N. Thejas “Apache Pig’s Optimizer.” IEEE Data Eng. Bull. pp. 34-45, 2013.
[8] Gupta, Y.K. and Sharma, S. “Impact of Big Data to Analyze Stock Exchange Data Using Apache Pig”. International Journal of Innovative Technology and Exploring Engineering (IJITEE), ISSN: 2278-3075, Volume-8, Issue-7. pp. 1428-1433, May 2019
[9] Gema Bello-Orgaz, Jason J. Jung, David Camacho, “Social big data: Recent achievements and new challenges”. Information Fusion, Volume 28. Pp. 45-59, March 2016.
[10] E. Nada and E. Ahmed “Big data analytics: a literature review paper.” Industrial Conference on Data Mining. Springer, Cham, pp. 214-227, 2014.
[11] Ouakine, Keren, Michael Carey, and Scott Kirkpatrick. “The PigMix Benchmark on Pig, MapReduce and HPCC systems.” Big Data (Big Data Congress), International Congress on. IEEE, 2015.
[12] Shvachko, Konstantin, “The Hadoop distributed file system.” Mass storage systems and technologies (MSTT), IEEE 26th symposium on. IEEE, 2010.
[13] Agarwal, S. and Khanam, Z. “Map Reduce: A Survey Paper on Recent Expansion.” International Journal of Advanced Computer Science and Applications 6.8, pp.209-215 (2015).
[14] Olshannikova, Ekaterina. “Conceptualizing Big Social Data.” Journal of Big Data 4.1 (2017).
[15] Gupta, Y.K. and Barhaiya, G. “Analysis of Crime Rates of Different States in India Using Apache Pig in HDFS Environment”, Recent Patents on Engineering 13: 1. https://doi.org/10.2174/1872212113666190227162314, ISSN 2212-4047 (2019).