Expressing Implicit Semantic Relations without Supervision

doi:10.48550/arXiv.cs/0607120

Expressing Implicit Semantic Relations without Supervision

Turney, Peter D.

We present an unsupervised learning algorithm that mines large text corpora for patterns that express implicit semantic relations. For a given input word pair X:Y with some unspecified semantic relations, the corresponding output list of patterns <P1,...,Pm> is ranked according to how well each pattern Pi expresses the relations between X and Y. For example, given X=ostrich and Y=bird, the two highest ranking output patterns are "X is the largest Y" and "Y such as the X". The output patterns are intended to be useful for finding further pairs with the same relations, to support the construction of lexicons, ontologies, and semantic networks. The patterns are sorted by pertinence, where the pertinence of a pattern Pi for a word pair X:Y is the expected relational similarity between the given pair and typical pairs for Pi. The algorithm is empirically evaluated on two tasks, solving multiple-choice SAT word analogy questions and classifying semantic relations in noun-modifier pairs. On both tasks, the algorithm achieves state-of-the-art results, performing significantly better than several alternative pattern ranking algorithms, based on tf-idf.

Publication:

arXiv e-prints

Pub Date:

July 2006

DOI:

10.48550/arXiv.cs/0607120

arXiv:

arXiv:cs/0607120

Bibcode:

2006cs........7120T

Keywords:

Computer Science - Computation and Language;
Computer Science - Artificial Intelligence;
Computer Science - Information Retrieval;
Computer Science - Machine Learning;
H.3.1;
I.2.6;
I.2.7

E-Print:

8 pages, related work available at http://purl.org/peter.turney/

ADS

Expressing Implicit Semantic Relations without Supervision

Abstract