METHODS OF CHECKING THE AUTHENTICITY OF TEXTS

Authors

  • V. Voitsekh Ukrainian American Concordia University

DOI:

https://doi.org/10.30890/2567-5273.2022-23-02-026

Keywords:

Authenticity of texts, Shingles method, Plagiarism, Text analysis, Document Indexing

Abstract

A general overview of the available methods for verifying the authenticity of texts was performed, the advantages and disadvantages of each were analyzed. The shingle method has been implemented, as well as a modified string-matching algorithm, which all

Metrics

Metrics Loading ...

References

Jaccard Similarity and Shingling, https://www.cs.utah.edu/~jeffp/teaching/cs5955/L4-Jaccard+Shingle.pdf

The Shingles algorithm, https://en.ryte.com/wiki/Shingle_Algorithm

WordNet. George A. Miller (1995). WordNet: A Lexical Database for English. Communications of the ACM Vol. 38, No. 11: 39-41. Christiane Fellbaum (1998, ed.) WordNet: An Electronic Lexical Database. Cambridge, MA: MIT Press, https://wordnet.princeton.edu/

Apache Lucene, https://lucene.apache.org/

Stanford Log-linear Part-Of-Speech Tagger, https://nlp.stanford.edu/software/tagger.shtml

Stein, Benno; Koppel, Moshe; Stamatatos, Efstathios (Dec 2007), "Plagiarism Analysis, Authorship Identification, and Near-Duplicate Detection PAN'07" (PDF), SIGIR Forum, 41, https://www.uni-weimar.de/medien/webis/publications/papers/stein_2007o.pdf

Dreher, Heinz (2007), "Automatic Conceptual Analysis for Plagiarism Detection" (PDF), Information and Beyond: The Journal of Issues in Informing Science and Information Technology, 4: 601–614, http://proceedings.informingscience.org/InSITE2007/IISITv4p601-614Dreh383.pdf

Stylometry-based Fraud and Plagiarism Detection for Learning at Scale, https://www.researchgate.net/publication/271836873_Stylometry-based_Fraud_and_Plagiarism_Detection_for_Learning_at_Scale

Top 10 Free Plagiarism Detection Tools, https://elearningindustry.com/top-10-free-plagiarism-detection-tools-for-teachers

The Winning Approach to Text Alignment for Text Reuse Detection at PAN 2014, http://ceur-ws.org/Vol-1180/CLEF2014wn-Pan-SanchezPerezEt2014.pdf

Finding near-duplicate documents, http://www.cs.princeton.edu/courses/archive/spr08/cos435/Class_notes/duplicateDocs_corrected.pdf

Published

2022-10-30

How to Cite

Войцех, В. (2022). METHODS OF CHECKING THE AUTHENTICITY OF TEXTS. Modern Engineering and Innovative Technologies, 2(23-02), 128–132. https://doi.org/10.30890/2567-5273.2022-23-02-026

Issue

Section

Articles