IJSTR

International Journal of Scientific & Technology Research

Home About Us Scope Editorial Board Blog/Latest News Contact Us
0.2
2019CiteScore
 
10th percentile
Powered by  Scopus
Scopus coverage:
Nov 2018 to May 2020

CALL FOR PAPERS
AUTHORS
DOWNLOADS
CONTACT

IJSTR >> Volume 9 - Issue 4, April 2020 Edition



International Journal of Scientific & Technology Research  
International Journal of Scientific & Technology Research

Website: http://www.ijstr.org

ISSN 2277-8616



Expanding Queries in the Information Retrieval System Using Stemming Approach

[Full Text]

 

AUTHOR(S)

Yellepeddi Vijayalakshmi, B. ArunKumar, & G. K. D. PrasannaVenkatesan

 

KEYWORDS

Information Retrieval, text retrieval, Multi-source, bunching, scoring, stemming

 

ABSTRACT

Numerous approaches have been made to find solutions for constructing text information stemmers. These stemmers are generally utilized in perception of application-oriented projects, specifically when they deal with the development of information retrieval (IR) schemes. Moreover, text stemming, as an approach for stripping sets of their suffixes or prefixes is considered as task suffering, when there are problems like single solution, vocalization ambiguity, incorrect removal and so on. However, many investigators claim that stemming approach has reached a high level for accuracy and precision while retrieving texts. In some cases, these stemmer algorithms are measured as black boxes, and it is not probable to access either source code or for corpora to validate estimation, which is used to attain accuracy. As stemmer algorithms are extremely significant for researchers, its comparison and estimation are more significant to facilitate choice of stemming approach to use in information retrieval. Here, stemming algorithm is anticipated based on feature reduction from multi-source-Ant Colony (MS) by performing word bunching and scoring (W-CS), so as to provide solutions to the drawbacks mentioned above. Here, an automatic approach for pre-processing the text has been carried out for evaluation of and comparison of text retrieval from queries that consider performance metrics like accuracy and evaluation period for the stemming algorithm. Simulation was carried out in MATLAB environment. The suggested model outperforms prevailing approaches in terms of accuracy.

 

REFERENCES

[1] Michalski, R. S., Carbonell, J. G., and Mitchell, T. M. (Eds.), Machine learning: An artificial intelligence approach. Springer Science and Business Media, 2013.
[2] Kotsiantis, S. B., Zaharakis, I., and Pintelas, P., Supervised machine learning: A review of classification techniques, 2007.
[3] Bijalwan, V., Kumar, V., Kumari, P., and Pascual, J., KNN based machine learning approach for text and document mining. International Journal of Database Theory and Application, 7(1), 61-70, 2014.
[4] F. Harrag, and E. El-Qawasmeh, “Neural Network for Arabic Text Classification”, Proceedings of ICADIWT’09, pp. 778-783, 2009.
[5] R.M. Duwairi, M.N. Al-Refai, N. Khasawneh, “Feature Reduction Techniques for Arabic Text Categorization”, Journal of the American society for information science and technology, 60(11), pp. 2347-2352, 2009.
[6] M. N. Singh, “The Improved K-Means with Particle Swarm Optimization,” Journal of Information Engineering and Applications, Vol.3, No.11, 2013.
[7] O. Ghanem and W. Ashour, “Stemming Effectiveness in Clustering of Arabic Documents,” International Journal of Computer Applications (0975 – 8887), 2012.
[8] S. Alghamdi and Shahriza., “Arabic web pages clustering and annotation using semantic class features,” Journal of King Saud University, Computer and Information Sciences, 2014.
[9] Kelaiaia and H. Merouani, “Clustering with Probabilistic Topic Models on Arabic Texts: A Comparative Study of LDA and K-Means,” The International Arab Journal of Information
Technology VOL. 13, NO. 2, 2016.
[10] Ismael and N. Hashimah, “Text Document Preprocessing and Dimension Reduction Techniques for Text Document Clustering,” International Conference on Artificial Intelligence with Applications in Engineering and Technology, 2014.
[11] M.M. Syiam, Z.T. Fayed, and M.B. Habib, “An Intelligent System for Arabic Text Categorization”, IJICIS, 6(1), pp. 1-19, 2006.
[12] A. A. Mesleh, “Chi Square Feature Extraction Based Svms Arabic Language Text Categorization System”, Journal of Computer Science, 3(6), pp. 430-435, 2007.
[13] R.M. Duwairi, M.N. Al-Refai, N. Khasawneh, “Feature Reduction Techniques for Arabic Text Categorization”, Journal of the American society for information science and technology, 60(11), pp. 2347-2352, 2009.
[14] F. Harrag, and E. El-Qawasmeh, “Neural Network for Arabic Text Classification”, Proceedings of ICADIWT’09, pp. 778-783, 2009.
[15] Rajendran T et al, “Recent Innovations in Soft Computing Applications”, Current Signal Transduction Therapy, Vol. 14, No. 2, pp. 129 – 130, 2019.
[16] Emayavaramban G et al, “Indentifying User Suitability in sEMG based Hand Prosthesis for using Neural Networks”, Current Signal Transduction Therapy, Vol. 14, No. 2, pp. 158 – 164, 2019.
[17] Rajendran T & Sridhar K P, “Epileptic seizure classification using feed forward neural network based on parametric features”. International Journal of Pharmaceutical Research, 10(4): 189-196, 2018.
[18] Hariraj V et al, “Fuzzy multi-layer SVM classification of breast cancer mammogram images”, International Journal of Mechanical Engineering and Technology, Vol. 9, No.8, pp. 1281-1299, 2018.
[19] Muthu F et al, “Design of CMOS 8-bit parallel adder energy efficient structure using SR-CPL logic style”, Pakistan Journal of Biotechnology, Vol. 14, No. Special Issue II, pp. 257-260, 2017.
[20] Yuvaraj P et al, “Design of 4-bit multiplexer using Sub-ThresholdAdiabatic Logic (STAL)”, Pakistan Journal of Biotechnology, Vol. 14,No. Special Issue II, pp. 261-264, 2017.
[21] Keerthivasan S et al, “Design of low intricate 10-bit current steering digital to analog converter circuitry using full swing GDI”, Pakistan Journal of Biotechnology, Vol. 14, No. Special Issue II, pp. 204-208, 2017.
[22] Vijayakumar P et al, “Efficient implementation of decoder using modified soft decoding algorithm in Golay (24, 12) code”, Pakistan Journal of Biotechnology, Vol. 14, No. Special Issue II, pp. 200-203, 2017.
[23] Rajendran T & Sridhar K P, “Epileptic Seizure-Classification using Probabilistic Neural Network based on Parametric Features”, International Journal of Scientific & Technological Research, Vol.9, No. 3, 2020 (Accepted for Publication).
[24] Rajendran T et al, “Performance analysis of fuzzy multilayer support vector machine for epileptic seizure disorder classification using auto regression features”, Open Biomedical Engineering Journal, Vol. 13, pp. 103-113, 2019.
[25] Rajendran T et al, “Advanced algorithms for medical image processing”, Open Biomedical Engineering Journal, Vol. 13, 102, 2019.
[26] Anitha T et al, “Brain-computer interface for persons with motor disabilities - A review”, Open Biomedical Engineering Journal, Vol. 13, pp. 127-133, 2019.