1 Search Results for "Sousa, Tiago"


Document
Exploring Different Methods for Solving Analogies with Portuguese Word Embeddings

Authors: Tiago Sousa, Hugo Gonçalo Oliveira, and Ana Alves

Published in: OASIcs, Volume 83, 9th Symposium on Languages, Applications and Technologies (SLATE 2020)


Abstract
A common way of assessing static word embeddings is to use them for solving analogies of the kind "what is to king as man is to woman?". For this purpose, the vector offset method (king - man + woman = queen), also known as 3CosAdd, has been effectively used for solving analogies and assessing different models of word embeddings in different languages. However, some researchers pointed out that this method is not the most effective for this purpose. Following this, we tested alternative analogy solving methods (3CosMul, 3CosAvg, LRCos) in Portuguese word embeddings and confirmed the previous statement. Specifically, those methods are used to answer the Portuguese version of the Google Analogy Test, dubbed LX-4WAnalogies, which covers syntactic and semantic analogies of different kinds. We discuss the accuracy of different methods applied to different models of embeddings and take some conclusions. Indeed, all methods outperform 3CosAdd, and the best performance is consistently achieved with LRCos, in GloVe.

Cite as

Tiago Sousa, Hugo Gonçalo Oliveira, and Ana Alves. Exploring Different Methods for Solving Analogies with Portuguese Word Embeddings. In 9th Symposium on Languages, Applications and Technologies (SLATE 2020). Open Access Series in Informatics (OASIcs), Volume 83, pp. 9:1-9:14, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2020)


Copy BibTex To Clipboard

@InProceedings{sousa_et_al:OASIcs.SLATE.2020.9,
  author =	{Sousa, Tiago and Gon\c{c}alo Oliveira, Hugo and Alves, Ana},
  title =	{{Exploring Different Methods for Solving Analogies with Portuguese Word Embeddings}},
  booktitle =	{9th Symposium on Languages, Applications and Technologies (SLATE 2020)},
  pages =	{9:1--9:14},
  series =	{Open Access Series in Informatics (OASIcs)},
  ISBN =	{978-3-95977-165-8},
  ISSN =	{2190-6807},
  year =	{2020},
  volume =	{83},
  editor =	{Sim\~{o}es, Alberto and Henriques, Pedro Rangel and Queir\'{o}s, Ricardo},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops-dev.dagstuhl.de/entities/document/10.4230/OASIcs.SLATE.2020.9},
  URN =		{urn:nbn:de:0030-drops-130229},
  doi =		{10.4230/OASIcs.SLATE.2020.9},
  annote =	{Keywords: analogies, word embeddings, semantic relations, syntactic relations, Portuguese}
}
  • Refine by Author
  • 1 Alves, Ana
  • 1 Gonçalo Oliveira, Hugo
  • 1 Sousa, Tiago

  • Refine by Classification
  • 1 Computing methodologies → Lexical semantics

  • Refine by Keyword
  • 1 Portuguese
  • 1 analogies
  • 1 semantic relations
  • 1 syntactic relations
  • 1 word embeddings

  • Refine by Type
  • 1 document

  • Refine by Publication Year
  • 1 2020

Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail