Towards the Detection and Formal Representation of Semantic Shifts in Inflectional Morphology

Authors Dagmar Gromann , Thierry Declerck



PDF
Thumbnail PDF

File

OASIcs.LDK.2019.21.pdf
  • Filesize: 436 kB
  • 15 pages

Document Identifiers

Author Details

Dagmar Gromann
  • University of Vienna, Vienna, Austria
Thierry Declerck
  • DFKI GmbH, Saarbrücken, Germany
  • ACDH-OEAW, Vienna, Austria

Acknowledgements

We would like to thank the anonymous reviewers for their very helpful comments on the original submission of this paper.

Cite As Get BibTex

Dagmar Gromann and Thierry Declerck. Towards the Detection and Formal Representation of Semantic Shifts in Inflectional Morphology. In 2nd Conference on Language, Data and Knowledge (LDK 2019). Open Access Series in Informatics (OASIcs), Volume 70, pp. 21:1-21:15, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2019) https://doi.org/10.4230/OASIcs.LDK.2019.21

Abstract

Semantic shifts caused by derivational morphemes is a common subject of investigation in language modeling, while inflectional morphemes are frequently portrayed as semantically more stable. This study is motivated by the previously established observation that inflectional morphemes can be just as variable as derivational ones. For instance, the English plural "-s" can turn the fabric silk into the garments of a jockey, silks. While humans know that silk in this sense has no plural, it takes more for machines to arrive at this conclusion. Frequently utilized computational language resources, such as WordNet, or models for representing computational lexicons, like OntoLex-Lemon, have no descriptive mechanism to represent such inflectional semantic shifts. To investigate this phenomenon, we extract word pairs of different grammatical number from WordNet that feature additional senses in the plural and evaluate their distribution in vector space, i.e., pre-trained word2vec and fastText embeddings. We then propose an extension of OntoLex-Lemon to accommodate this phenomenon that we call inflectional morpho-semantic variation to provide a formal representation accessible to algorithms, neural networks, and agents. While the exact scope of the problem is yet to be determined, this first dataset shows that it is not negligible.

Subject Classification

ACM Subject Classification
  • Information systems
Keywords
  • Inflectional morphology
  • semantic shift
  • embeddings
  • formal lexical modeling

Metrics

  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    0
    PDF Downloads

References

  1. Oded Avraham and Yoav Goldberg. The Interplay of Semantics and Morphology in Word Embeddings. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, pages 422-426. Association for Computational Linguistics, 2017. Google Scholar
  2. Jiang Bian, Bin Gao, and Tie-Yan Liu. Knowledge-Powered Deep Learning for Word Embedding. In Joint European conference on machine learning and knowledge discovery in databases, pages 132-148. Springer, 2014. Google Scholar
  3. Steven Bird. NLTK: The Natural Language Toolkit. In In Proceedings of the ACL Workshop on Effective Tools and Methodologies for Teaching Natural Language Processing and Computational Linguistics. Association for Computational Linguistics, 2002. Google Scholar
  4. Piotr Bojanowski, Edouard Grave, Armand Joulin, and Tomas Mikolov. Enriching Word Vectors with Subword Information. Transactions of the Association for Computational Linguistics, 5:135-146, 2017. Google Scholar
  5. Greville G Corbett. Canonical Typolgy, Suppletion, and Possible Words. Language, pages 8-42, 2007. Google Scholar
  6. Ryan Cotterell and Hinrich Schütze. Joint Semantic Synthesis and Morphological Analysis of the Derived Word. Transactions of the Association of Computational Linguistics, 6:33-48, 2018. Google Scholar
  7. Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. CoRR, abs/1810.04805, 2018. URL: http://arxiv.org/abs/1810.04805.
  8. Christiane Fellbaum, editor. WordNet: An Electronic Lexical Database. The MIT Press, Cambridge, MA ; London, May 1998. Google Scholar
  9. Anna Gladkova, Aleksandr Drozd, and Satoshi Matsuoka. Analogy-based Detection of Morphological and Semantic Relations with Word Embeddings: What Works and What doesn't. In Proceedings of the NAACL Student Research Workshop, pages 8-15, 2016. Google Scholar
  10. Paul Kiparsky and Judith Tonhauser. Semantics of Inflection. Handbook of Semantics, 3:2070-2097, 2012. Google Scholar
  11. Tal Linzen. Issues in Evaluating Semantic Spaces Using Word Analogies. In In Proceedings of the 1st Workshop on Evaluating Vector-Space Representations for NLP, pages 13-18, 2016. Google Scholar
  12. Robert Malouf. Abstractive Morphological Learning with a Recurrent Neural Network. Morphology, 27(4):431-458, 2017. Google Scholar
  13. John McCrae, Guadalupe Aguado-de Cea, Paul Buitelaar, Philipp Cimiano, Thierry Declerck, Asuncion Gomez-Perez, Jorge Garcia, Laura Hollink, Elena Montiel-Ponsoda, Dennis Spohr, and Tobias Wunner. Interchanging Lexical Resources on the Semantic Web. Language Resources and Evaluation, 46(4):701-719, 2012. Google Scholar
  14. John P. McCrae, Paul Buitelaar, and Philipp Cimiano. The OntoLex-Lemon Model: Development and Applications. In Iztok Kosem, Jelena Kallas, Carole Tiberius, Simon Krek, Miloš Jakubíček, and Vít Baisa, editors, Proceedings of eLex 2017, pages 587-597. INT, Trojína and Lexical Computing, Lexical Computing CZ s.r.o., September 2017. Google Scholar
  15. Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. Efficient Estimation of Word Representations in Vector Space. CoRR, abs/1301.3781, 2013. URL: http://arxiv.org/abs/1301.3781.
  16. Sebastian Padó, Aurélie Herbelot, Max Kisselew, and Jan Šnajder. Predictability of Distributional Semantics in Derivational Word Formation. In Proceedings of the 26th International Conference on Computational Linguistics, pages 1285-1296, 2016. Google Scholar
Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail