ASSOCIATION OF IDENTICAL PAIRS USING NATURAL LANGUAGE PROCESSING

Authors

  • Saravanan , Alagarsamy Department of Computer Science and Engineering. Kalasalingam Academy of Research and Education, Anand Nagar, Author

DOI:

https://doi.org/10.61841/83dpa732

Keywords:

- Nature Language Processing, Vector Space Modeling,, Artificial Intelligence.

Abstract

Question duplication is the serious issue experienced by question and answer discussion forum like Quora, Stack-flood, Reddit, and so on. Answers get divided across various adaptations of a similar inquiry because of the repetition of inquiries in these gatherings. In the end, this outcome in absence of a reasonable pursuit, answer weakness, isolation of data and the lack of reaction to the examiners. The copied questions can be identified utilizing Machine Learning and Natural Language Processing. Dataset of in excess of 400,000 inquiries sets gave by Quora are preprocessed through tokenization, lemmatization and evacuation of stop words. This pre- handled dataset is utilized for the element extraction. Fake Neural Network is then planned and the highlights thus removed, are fit into the model. This neural system gives exactness of 86.09%. More or less, this examination predicts the semantic fortuitous event between the inquiry sets removing profoundly prevailing aspects and consequently, decide the likelihood of inquiry being copy.

 

 

Downloads

Download data is not yet available.

References

1. Alagarsamy, S., Kamatchi, K., Govindaraj, V., A Novel Technique for identification of tumor region in MR Brain Image,(pp-1061-1066), in proceedings of the IEEE Third International Conference on Electronics Communication and Aerospace Technology,2019.

2. Alagarsamy, S., Kamatchi, K., Govindaraj, V., Thiyagarajan, A., A fully automated hybrid methodology using Cuckoo-based fuzzy clustering technique for magnetic resonance brain image segmentation, Vol.27 (pp.317-332), International journal of Imaging systems and technology,2017.

3. Alagarsamy. S.,, Kamatchi, K., Govindaraj, V., Zhang, YD., Thiyagarajan, A., Multi-channeled MR brain image segmentation: A new automated approach combining BAT and clustering technique for better identification of heterogeneous tumors,Vol.39 (pp.1005-1035), Biocybernetics and Biomedical Engineering,2019.

4. Bowman, SR., Angeli, G., Potts, C., Manning, CD., A large annotated corpus for learning natural language inference, 2015.

5. Howland, P., Park, H., Generalizing discriminant analysis using the generalized singular value decomposition,Vol.26(pp. 995 – 1006), IEEE Transactions on Pattern Analysis and Machine Intelligence,2004.

6. Liu ,M., Lang, B., Zepeng, G., Zeeshan, A., Measuring similarity of academic articles with semantic profile and joint word embedding,Vol.22(pp. 619 – 632), Tsinghua Science and Technology, 2017.

7. Medjahed, B., Bouguettaya, A., A multilevel composability model for semantic Web services, Vol.(pp. 954 – 968), IEEE Transactions on Knowledge and Data Engineering, 2005.

8. Miller, A., WordNet: a lexical database for English, Vol.38 (pp.39-41),Communications of the ACM,1995.

9. Xie, X., Cai, X., Zhou, J., Cao, N., Wu, Y., A Semantic-Based Method for Visualizing Large Image Collections,Vol.25(pp. 2362 – 2377), IEEE Transactions on Visualization and Computer Graphics,2019.

10. Zhou, P., Shi, W., Tian, J., Qi ,Q, Li,B., Hao,H., Attention-Based Bidirectional Long Short-Term Memory Networks for Relation Classification,(pp.2017-212), Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics,2016.

11. Prasanthi, E., & Deepa, N. (2019). Real time web based information using natural language processing (NLP) algorithm. Test Engineering and Management, 81(11-12), 5616-5620. Retrieved from www.scopus.com

Downloads

Published

30.06.2020

How to Cite

Alagarsamy, S. ,. (2020). ASSOCIATION OF IDENTICAL PAIRS USING NATURAL LANGUAGE PROCESSING. International Journal of Psychosocial Rehabilitation, 24(6), 7320-7327. https://doi.org/10.61841/83dpa732