DIGITAL LIBRARY
NATURAL LANGUAGE PROCESSING FOR PLAGIARISM DETECTION – A SURVEY
1 Riga Technical University (LATVIA)
2 University of Latvia (LATVIA)
About this paper:
Appears in: INTED2022 Proceedings
Publication year: 2022
Pages: 9624-9629
ISBN: 978-84-09-37758-9
ISSN: 2340-1079
doi: 10.21125/inted.2022.2527
Conference name: 16th International Technology, Education and Development Conference
Dates: 7-8 March, 2022
Location: Online Conference
Abstract:
Plagiarism has become a serious problem in academic writing over the years. The rise of the internet has improved the accessibility of information and facilitated the sharing of media, this in turn has made plagiarizing academic work something trivial. The huge availability of sources of material that can be found on the internet make finding a source to plagiarism from as easy as finding a grain of salt on a desert, and conversely detecting whether a paper has been plagiarized is equivalent to finding where that grain of sand used to belong to.

However not all hope is gone. With the years not only has the availability of information improved but also computer's computational power, storage and clever algorithms such as Deep Learning which allows us to use that computational power and storage to train models capable of learning and automating task that were previously though to be almost impossible.

In this survey paper we analyze the evolution of different natural language processing algorithms being used for extrinsic and intrinsic plagiarism detection, the types of plagiarism they excel at finding and the ones they struggle with, their limitations, performance and in which direction the future of plagiarism detection seems to be heading.
Keywords:
Deep Learning, Natural Language Processing, Neural Networks, Plagiarism Detection.