National Research Council CanadaSkip all menusSkip first menu Menu
National Research Council CanadaGovernment of Canada
NRC-IIT - Institute for Information Technology
NRC-IIT - Institute for Information Technology
List by Year
List by Group
List by Author
List by Project
Patents

Printable version Printable
version
Home | Publications

Publications

New

Similarity of Semantic Relations

Author: Turney, P.
Source: Computational Linguistics Journal. Volume 32, Issue 3. September 2006. pp. 379-416.
NRC Publication Number: NRC 48775.

Abstract

There are at least two kinds of similarity. Relational similarity is correspondence between relations, in contrast with attributional similarity, which is correspondence between attributes. When two words have a high degree of attributional similarity, we call them synonyms. When two pairs of words have a high degree of relational similarity, we say that their relations are analogous. For example, the word pair mason:stone is analogous to the pair carpenter:wood. This paper introduces Latent Relational Analysis (LRA), a method for measuring relational similarity. LRA has potential applications in many areas, including information extraction, word sense disambiguation, and information retrieval. Recently the Vector SpaceModel (VSM) of information retrieval has been adapted to measuring relational similarity, achieving a score of 47% on a collection of 374 college-level multiple-choice word analogy questions. In the VSMapproach, the relation between a pair of words is characterized by a vector of frequencies of predefined patterns in a large corpus. LRA extends the VSM approach in three ways: (1) the patterns are derived automatically from the corpus, (2) the Singular Value Decomposition (SVD) is used to smooth the frequency data, and (3) automatically generated synonyms are used to explore variations of the word pairs. LRA achieves 56% on the 374 analogy questions, statistically equivalent to the average human score of 57%. On the related problem of classifying semantic relations, LRA achieves similar gains over the VSM.

Download NRC-IIT's publication NRC-48775.pdf (353 KB) in PDF format.

Disclaimer

Please note that all downloadable files in this section of the Web site are provided in their original language and in PDF format.

If you do not have Adobe Acrobat Reader installed on your computer, you may also download it from this page, or you may convert the file to HTML or ASCII text by using one of the access services provided by Adobe.


This database is updated continuously.
Top of Page