July 1, 2003Open Access

An approach for measuring semantic similarity between words using multiple information sources

Key Points

Key points are not available for this paper at this time.

Abstract

Semantic similarity between words is becoming a generic problem for many applications of computational linguistics and artificial intelligence. This paper explores the determination of semantic similarity by a number of information sources, which consist of structural semantic information from a lexical taxonomy and information content from a corpus. To investigate how information sources could be used effectively, a variety of strategies for using various possible information sources are implemented. A new measure is then proposed which combines information sources nonlinearly. Experimental evaluation against a benchmark set of human similarity ratings demonstrates that the proposed measure significantly outperforms traditional similarity measures.

Read Full Paperexternally

AI에게 질문

Bookmark

View Full Paper