Evaluation of Sense Embeddings with Optimized Vector-Meaning Correspondence

Tomoaki, Yamazaki; Seiya, Ito; Kouzou, Ohara

Evaluation of Sense Embeddings with Optimized Vector-Meaning Correspondence

Lemon: fruit🍋 or color🟡?
How well do sense embeddings, including from LLMs, capture multiple meaning (multi-sense, polysemy)?

Tomoaki Yamazaki¹, Seiya Ito², Kouzou Ohara¹,

¹Aoyama Gakuin University
²National Institute of Information and Communications Technology (NICT)
IEICE Transactions on Information and Systems

Paper Dataset (Please contact 1st author and must read BabelNet lisence) Code (partial)

Abstract

A word sense is an essential element for understanding what a sentence means and can be interpreted as a concept on its own. To realize this cognition in Computational Linguistics, embedding methods have been proposed to map words to dense vectors. Among them, sense embeddings assign multiple vectors to each word to represent its distinct meanings. Their special feature is that the boundary between the meanings of each word explicitly exists. However, their qualities are evaluated using a conventional approach to word embeddings that implicitly addresses meaning. In precise, these evaluations adopt datasets composed from combinations of pairs of words and similarities between two words, where the number of meanings to be evaluated is limited compared to the number of words. Moreover, their evaluation metrics reflect only a part of the relationships between multi-sense words. To overcome these problems, in this paper, we propose a novel evaluation method to sense embeddings that covers rich meanings and addresses the combinations arising from polysemy, such as the uniqueness and redundancy of vectors. Our key idea is a vector, appropriately representing its meanings, has neighbors that can be considered to be similar words in a vector space. Based on this idea, we automatically construct an evaluation dataset with similar words for each meaning by combining information from two reliable concept hierarchies; one is manually managed, and the other is automatically created and manually managed. Then, based on the constructed dataset, we devise three kinds of evaluation metrics that associate vectors of a multi-sense word with its meanings in the dataset in different manners. Through an experiment, we empirically show that the proposed evaluation method can adequately reflect the quality of sense embeddings compared to the conventional method.

An overview of the proposed evaluation method.

An overview of the evaluation flow and the difference of values subject to be evaluated between three kinds of the evaluation metrics.

Examples of ambiguity and redundancy vectors have in an embedding.

BibTeX

@inproceedings{Yamazaki2025,
    author    = {Tomoaki Yamazaki, Seiya Ito, Kouzou Ohara},
    title     = {Evaluation of Sense Embeddings with Optimized Vector-Meaning Correspondence},
    journal   = {IEICE Transactions on Information and Systems},
    year      = {2025}
  }

Evaluation of Sense Embeddings with Optimized Vector-Meaning Correspondence

Lemon: fruit🍋 or color🟡? How well do sense embeddings, including from LLMs, capture multiple meaning (multi-sense, polysemy)?

Abstract

An overview of the proposed evaluation method.

An overview of the evaluation flow and the difference of values subject to be evaluated between three kinds of the evaluation metrics.

Examples of ambiguity and redundancy vectors have in an embedding.

Examples of ambiguity and redundancy vectors have in an embedding.

BibTeX

Lemon: fruit🍋 or color🟡?
How well do sense embeddings, including from LLMs, capture multiple meaning (multi-sense, polysemy)?