Back to articles
Articles
Volume: 16 | Article ID: art00036
Image
Cluster-based Unsupervised Automatic Keyphrases Extraction algorithms: experimentations on Cultural Heritage datasets
  DOI :  10.2352/issn.2168-3204.2019.1.0.36  Published OnlineMay 2019
Abstract

Automatic keyword extraction is the process of identifying key terms and key phrases from documents that can appropriately represent the subject of the documents. We present here a work-in-progress, an experimentation done on unsupervised keyword extraction, with the aim of automatically associating scored keyphrases to texts, using (standard or innovative) cluster based methods, and integrating word embedding to enhance semantic relatedness of keyphrases. In the paper we present the datasets used, the state-of-theart for unsupervised automatic extraction algorithms, based on cluster methods, and we describe in details the algorithms implemented and preliminary results obtained. The results obtained are discussed, commented, and compared with those obtained, in previous experimentations, using TextRank, RAKE and Tf-idf.

Subject Areas :
Views 8
Downloads 2
 articleview.views 8
 articleview.downloads 2
  Cite this article 

Maria Teresa Artese, Isabella Gagliardi, "Cluster-based Unsupervised Automatic Keyphrases Extraction algorithms: experimentations on Cultural Heritage datasetsin Proc. IS&T Archiving 2019,  2019,  pp 156 - 160,  https://doi.org/10.2352/issn.2168-3204.2019.1.0.36

 Copy citation
  Copyright statement 
Copyright © Society for Imaging Science and Technology 2019
72010361
Archiving Conference
archiving
2161-8798
Society for Imaging Science and Technology