Towards the Detection and Formal Representation of Semantic Shifts in Inflectional Morphology

Gromann, Dagmar; Declerck, Thierry

doi:10.4230/OASIcs.LDK.2019.21

File

Author Details

Dagmar Gromann

University of Vienna, Vienna, Austria

Thierry Declerck

DFKI GmbH, Saarbrücken, Germany
ACDH-OEAW, Vienna, Austria

Cite As Get BibTex

Dagmar Gromann and Thierry Declerck. Towards the Detection and Formal Representation of Semantic Shifts in Inflectional Morphology. In 2nd Conference on Language, Data and Knowledge (LDK 2019). Open Access Series in Informatics (OASIcs), Volume 70, pp. 21:1-21:15, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2019) https://doi.org/10.4230/OASIcs.LDK.2019.21

Abstract

Semantic shifts caused by derivational morphemes is a common subject of investigation in language modeling, while inflectional morphemes are frequently portrayed as semantically more stable. This study is motivated by the previously established observation that inflectional morphemes can be just as variable as derivational ones. For instance, the English plural "-s" can turn the fabric silk into the garments of a jockey, silks. While humans know that silk in this sense has no plural, it takes more for machines to arrive at this conclusion. Frequently utilized computational language resources, such as WordNet, or models for representing computational lexicons, like OntoLex-Lemon, have no descriptive mechanism to represent such inflectional semantic shifts. To investigate this phenomenon, we extract word pairs of different grammatical number from WordNet that feature additional senses in the plural and evaluate their distribution in vector space, i.e., pre-trained word2vec and fastText embeddings. We then propose an extension of OntoLex-Lemon to accommodate this phenomenon that we call inflectional morpho-semantic variation to provide a formal representation accessible to algorithms, neural networks, and agents. While the exact scope of the problem is yet to be determined, this first dataset shows that it is not negligible.

Subject Classification

ACM Subject Classification

Information systems

Keywords

Inflectional morphology
semantic shift
embeddings
formal lexical modeling

Metrics

Access Statistics
Total Accesses (updated on a weekly basis)

0

PDF Downloads

0

Metadata Views

References

Oded Avraham and Yoav Goldberg. The Interplay of Semantics and Morphology in Word Embeddings. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, pages 422-426. Association for Computational Linguistics, 2017.
Jiang Bian, Bin Gao, and Tie-Yan Liu. Knowledge-Powered Deep Learning for Word Embedding. In Joint European conference on machine learning and knowledge discovery in databases, pages 132-148. Springer, 2014.
Steven Bird. NLTK: The Natural Language Toolkit. In In Proceedings of the ACL Workshop on Effective Tools and Methodologies for Teaching Natural Language Processing and Computational Linguistics. Association for Computational Linguistics, 2002.
Piotr Bojanowski, Edouard Grave, Armand Joulin, and Tomas Mikolov. Enriching Word Vectors with Subword Information. Transactions of the Association for Computational Linguistics, 5:135-146, 2017.
Greville G Corbett. Canonical Typolgy, Suppletion, and Possible Words. Language, pages 8-42, 2007.
Ryan Cotterell and Hinrich Schütze. Joint Semantic Synthesis and Morphological Analysis of the Derived Word. Transactions of the Association of Computational Linguistics, 6:33-48, 2018.
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. CoRR, abs/1810.04805, 2018. URL: http://arxiv.org/abs/1810.04805.
Christiane Fellbaum, editor. WordNet: An Electronic Lexical Database. The MIT Press, Cambridge, MA ; London, May 1998.
Anna Gladkova, Aleksandr Drozd, and Satoshi Matsuoka. Analogy-based Detection of Morphological and Semantic Relations with Word Embeddings: What Works and What doesn't. In Proceedings of the NAACL Student Research Workshop, pages 8-15, 2016.
Paul Kiparsky and Judith Tonhauser. Semantics of Inflection. Handbook of Semantics, 3:2070-2097, 2012.
Tal Linzen. Issues in Evaluating Semantic Spaces Using Word Analogies. In In Proceedings of the 1st Workshop on Evaluating Vector-Space Representations for NLP, pages 13-18, 2016.
Robert Malouf. Abstractive Morphological Learning with a Recurrent Neural Network. Morphology, 27(4):431-458, 2017.
John McCrae, Guadalupe Aguado-de Cea, Paul Buitelaar, Philipp Cimiano, Thierry Declerck, Asuncion Gomez-Perez, Jorge Garcia, Laura Hollink, Elena Montiel-Ponsoda, Dennis Spohr, and Tobias Wunner. Interchanging Lexical Resources on the Semantic Web. Language Resources and Evaluation, 46(4):701-719, 2012.
John P. McCrae, Paul Buitelaar, and Philipp Cimiano. The OntoLex-Lemon Model: Development and Applications. In Iztok Kosem, Jelena Kallas, Carole Tiberius, Simon Krek, Miloš Jakubíček, and Vít Baisa, editors, Proceedings of eLex 2017, pages 587-597. INT, Trojína and Lexical Computing, Lexical Computing CZ s.r.o., September 2017.
Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. Efficient Estimation of Word Representations in Vector Space. CoRR, abs/1301.3781, 2013. URL: http://arxiv.org/abs/1301.3781.
Sebastian Padó, Aurélie Herbelot, Max Kisselew, and Jan Šnajder. Predictability of Distributional Semantics in Derivational Word Formation. In Proceedings of the 26th International Conference on Computational Linguistics, pages 1285-1296, 2016.

Towards the Detection and Formal Representation of Semantic Shifts in Inflectional Morphology

Authors Dagmar Gromann , Thierry Declerck

File

Document Identifiers

Author Details

Acknowledgements

Cite As Get BibTex

Abstract

Subject Classification

ACM Subject Classification

Keywords

Metrics

References

Thanks for your feedback!

Could not send message

Towards the Detection and Formal Representation of Semantic Shifts in Inflectional Morphology

Authors Dagmar Gromann , Thierry Declerck

File

Document Identifiers

Author Details

Funding

Acknowledgements

Cite As Get BibTex

Abstract

Subject Classification

ACM Subject Classification

Keywords

Metrics

References

Thanks for your feedback!

Could not send message