Plagiarism Detection in arXiv

Connotea Imports 2012-05-15

Summary:

Abstract: We describe a large-scale application of methods for finding plagiarism in research document collections. The methods are applied to a collection of 284,834 documents collected by arXiv.org over a 14 year period, covering a few different research disciplines. The methodology efficiently detects a variety of problematic author behaviors, and heuristics are developed to reduce the number of false positives. The methods are also efficient enough to implement as a real-time submission screen for a collection many times larger.

Link:

http://arxiv.org/abs/cs.DB/0702012

From feeds:

Open Access Tracking Project (OATP) ยป Connotea Imports

Tags:

oa.physics oa.repositories.disciplinary oa.arxiv oa.plagiarism oa.repositories

Authors:

petersuber

Date tagged:

05/15/2012, 12:42

Date published:

05/13/2012, 21:15