Software library for authorship identification

Ivan Ivanov, Cvetina Hantova, Maria Nisheva, Peter L. Stanchev, Phillip Ein-Dor

Research output: Contribution to journalConference articlepeer-review

Abstract

The aim of this paper is to review some methods for text authorship attribution and to discuss the development of a software library with tools for automatic authorship attribution. The presentation is focused on an analysis of two groups of tools oriented to: (1) methods for extraction of features and (2) methods for computing the distance between character strings based on data compression algorithms.

Original languageEnglish
Pages (from-to)91-97
Number of pages7
JournalDigital Presentation and Preservation of Cultural and Scientific Heritage
Volume5
StatePublished - 16 Feb 2017
Event5th International Conference on Digital Presentation and Preservation of Cultural and Scientific Heritage, DiPP 2015 - Veliko Tarnovo, Bulgaria
Duration: 28 Sep 201530 Sep 2015

Keywords

  • Compression algorithms
  • N-grams
  • Natural frequency zoned word distribution
  • Normalized compression distance
  • Text authorship identification

Fingerprint

Dive into the research topics of 'Software library for authorship identification'. Together they form a unique fingerprint.

Cite this