ASSOCIATION OF IDENTICAL PAIRS USING NATURAL LANGUAGE PROCESSING
DOI:
https://doi.org/10.61841/83dpa732Keywords:
- Nature Language Processing, Vector Space Modeling,, Artificial Intelligence.Abstract
Question duplication is the serious issue experienced by question and answer discussion forum like Quora, Stack-flood, Reddit, and so on. Answers get divided across various adaptations of a similar inquiry because of the repetition of inquiries in these gatherings. In the end, this outcome in absence of a reasonable pursuit, answer weakness, isolation of data and the lack of reaction to the examiners. The copied questions can be identified utilizing Machine Learning and Natural Language Processing. Dataset of in excess of 400,000 inquiries sets gave by Quora are preprocessed through tokenization, lemmatization and evacuation of stop words. This pre- handled dataset is utilized for the element extraction. Fake Neural Network is then planned and the highlights thus removed, are fit into the model. This neural system gives exactness of 86.09%. More or less, this examination predicts the semantic fortuitous event between the inquiry sets removing profoundly prevailing aspects and consequently, decide the likelihood of inquiry being copy.
Downloads
References
1. Alagarsamy, S., Kamatchi, K., Govindaraj, V., A Novel Technique for identification of tumor region in MR Brain Image,(pp-1061-1066), in proceedings of the IEEE Third International Conference on Electronics Communication and Aerospace Technology,2019.
2. Alagarsamy, S., Kamatchi, K., Govindaraj, V., Thiyagarajan, A., A fully automated hybrid methodology using Cuckoo-based fuzzy clustering technique for magnetic resonance brain image segmentation, Vol.27 (pp.317-332), International journal of Imaging systems and technology,2017.
3. Alagarsamy. S.,, Kamatchi, K., Govindaraj, V., Zhang, YD., Thiyagarajan, A., Multi-channeled MR brain image segmentation: A new automated approach combining BAT and clustering technique for better identification of heterogeneous tumors,Vol.39 (pp.1005-1035), Biocybernetics and Biomedical Engineering,2019.
4. Bowman, SR., Angeli, G., Potts, C., Manning, CD., A large annotated corpus for learning natural language inference, 2015.
5. Howland, P., Park, H., Generalizing discriminant analysis using the generalized singular value decomposition,Vol.26(pp. 995 – 1006), IEEE Transactions on Pattern Analysis and Machine Intelligence,2004.
6. Liu ,M., Lang, B., Zepeng, G., Zeeshan, A., Measuring similarity of academic articles with semantic profile and joint word embedding,Vol.22(pp. 619 – 632), Tsinghua Science and Technology, 2017.
7. Medjahed, B., Bouguettaya, A., A multilevel composability model for semantic Web services, Vol.(pp. 954 – 968), IEEE Transactions on Knowledge and Data Engineering, 2005.
8. Miller, A., WordNet: a lexical database for English, Vol.38 (pp.39-41),Communications of the ACM,1995.
9. Xie, X., Cai, X., Zhou, J., Cao, N., Wu, Y., A Semantic-Based Method for Visualizing Large Image Collections,Vol.25(pp. 2362 – 2377), IEEE Transactions on Visualization and Computer Graphics,2019.
10. Zhou, P., Shi, W., Tian, J., Qi ,Q, Li,B., Hao,H., Attention-Based Bidirectional Long Short-Term Memory Networks for Relation Classification,(pp.2017-212), Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics,2016.
11. Prasanthi, E., & Deepa, N. (2019). Real time web based information using natural language processing (NLP) algorithm. Test Engineering and Management, 81(11-12), 5616-5620. Retrieved from www.scopus.com
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
You are free to:
- Share — copy and redistribute the material in any medium or format for any purpose, even commercially.
- Adapt — remix, transform, and build upon the material for any purpose, even commercially.
- The licensor cannot revoke these freedoms as long as you follow the license terms.
Under the following terms:
- Attribution — You must give appropriate credit , provide a link to the license, and indicate if changes were made . You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.
Notices:
You do not have to comply with the license for elements of the material in the public domain or where your use is permitted by an applicable exception or limitation .
No warranties are given. The license may not give you all of the permissions necessary for your intended use. For example, other rights such as publicity, privacy, or moral rights may limit how you use the material.