Thesaurus based term ranking for keyword extraction

Authors Christian Wartena , Rogier Brussee , Luit Gazendam
Publication date 3 September 2010
Type Lecture


A common strategy to assign keywords to documents is to select the most appropriate words from the document text. One of the most important criteria for a word to be selected as keyword is its relevance for the text. The tf.idf score of a term is a widely used relevance measure. While easy to compute and giving quite satisfactory results, this measure does not take (semantic) relations between words into account.

Language English
Key words Information retrieval, keywords, Keyword extraction, Crossmedialab