Li, Yuhua, Bandar, Zuhair A. and McLean, David A. (2003) An approach for measuring semantic similarity between words using multiple information sources. IEEE transactions on knowledge and data engineering, 15 (4). pp. 871-882. ISSN 1041-4347
File not available for download.Abstract
Semantic similarity between words is becoming a generic problem for many applications of computational linguistics and artificial intelligence. This paper explores the determination of semantic similarity by a number of information sources, which consist of structural semantic information from a lexical taxonomy and information content from a corpus. To investigate how information sources could be used effectively, a variety of strategies for using various possible information sources are implemented. A new measure is then proposed which combines information sources nonlinearly. Experimental evaluation against a benchmark set of human similarity ratings demonstrates that the proposed measure significantly outperforms traditional similarity measures.
Impact and Reach
Statistics
Additional statistics for this dataset are available via IRStats2.