Health & Environmental Research Online (HERO)


Print Feedback Export to File
7317558 
Journal Article 
Deja vu: a database of highly similar citations in the scientific literature 
Errami, M; Sun, Z; Long, TC; George, AC; Garner, HR 
2009 
Nucleic Acids Research
ISSN: 0305-1048
EISSN: 1362-4962 
37 
Database issue 
D921-D924 
English 
In the scientific research community, plagiarism and covert multiple publications of the same data are considered unacceptable because they undermine the public confidence in the scientific integrity. Yet, little has been done to help authors and editors to identify highly similar citations, which sometimes may represent cases of unethical duplication. For this reason, we have made available Déjà vu, a publicly available database of highly similar Medline citations identified by the text similarity search engine eTBLAST. Following manual verification, highly similar citation pairs are classified into various categories ranging from duplicates with different authors to sanctioned duplicates. Déjà vu records also contain user-provided commentary and supporting information to substantiate each document's categorization. Déjà vu and eTBLAST are available to authors, editors, reviewers, ethicists and sociologists to study, intercept, annotate and deter questionable publication practices. These tools are part of a sustained effort to enhance the quality of Medline as 'the' biomedical corpus. The Déjà vu database is freely accessible at http://spore.swmed.edu/dejavu. The tool eTBLAST is also freely available at http://etblast.org. 
access to information; article; bibliographic database; computer interface; data analysis; information processing; Internet; medical research; MEDLINE; priority journal; publishing; research ethics; scientific literature; search engine; publishing; Databases, Bibliographic; Duplicate Publication as Topic; MEDLINE; User-Computer Interface