Search Results

Documents authored by Croft, William


Document
Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics (Dagstuhl Seminar 23191)

Authors: Timothy Baldwin, William Croft, Joakim Nivre, Agata Savary, Sara Stymne, and Ekaterina Vylomova

Published in: Dagstuhl Reports, Volume 13, Issue 5 (2023)


Abstract
The Dagstuhl Seminar 23191 entitled "Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics" took place May 7-12, 2023. Its main objectives were to deepen the understanding of language universals and linguistic idiosyncrasy, to harness idiosyncrasy in treebanking frameworks in computationally tractable ways, and to promote a higher degree of convergence in universalism-driven initiatives to natural language morphology, syntax and semantics. Most of the seminar was devoted to working group discussions, covering topics such as: representations below and beyond word boundaries; annotation of particular kinds of constructions; semantic representations, in particular for multiword expressions; finding idiosyncrasy in corpora; large language models; and methodological issues, community interactions and cross-community initiatives. Thanks to the collaboration of linguistic typologists, NLP experts and experts in different annotation frameworks, significant progress was made towards the theoretical, practical and networking objectives of the seminar.

Cite as

Timothy Baldwin, William Croft, Joakim Nivre, Agata Savary, Sara Stymne, and Ekaterina Vylomova. Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics (Dagstuhl Seminar 23191). In Dagstuhl Reports, Volume 13, Issue 5, pp. 22-70, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2023)


Copy BibTex To Clipboard

@Article{baldwin_et_al:DagRep.13.5.22,
  author =	{Baldwin, Timothy and Croft, William and Nivre, Joakim and Savary, Agata and Stymne, Sara and Vylomova, Ekaterina},
  title =	{{Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics (Dagstuhl Seminar 23191)}},
  pages =	{22--70},
  journal =	{Dagstuhl Reports},
  ISSN =	{2192-5283},
  year =	{2023},
  volume =	{13},
  number =	{5},
  editor =	{Baldwin, Timothy and Croft, William and Nivre, Joakim and Savary, Agata and Stymne, Sara and Vylomova, Ekaterina},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops.dagstuhl.de/entities/document/10.4230/DagRep.13.5.22},
  URN =		{urn:nbn:de:0030-drops-193648},
  doi =		{10.4230/DagRep.13.5.22},
  annote =	{Keywords: computational linguistics, morphosyntax, multiword expressions, language universals, idiosyncrasy}
}
Document
Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics (Dagstuhl Seminar 21351)

Authors: Timothy Baldwin, William Croft, Joakim Nivre, and Agata Savary

Published in: Dagstuhl Reports, Volume 11, Issue 7 (2021)


Abstract
Computational linguistics builds models that can usefully process and produce language and that can increase our understanding of linguistic phenomena. From the computational perspective, language data are particularly challenging notably due to their variable degree of idiosyncrasy (unexpected properties shared by few peer objects), and the pervasiveness of non-compositional phenomena such as multiword expressions (whose meaning cannot be straightforwardly deduced from the meanings of their components, e.g. red tape, by and large, to pay a visit and to pull one’s leg) and constructions (conventional associations of forms and meanings). Additionally, if models and methods are to be consistent and valid across languages, they have to face specificities inherent either to particular languages, or to various linguistic traditions. These challenges were addressed by the Dagstuhl Seminar 21351 entitled "Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics", which took place on 30-31 August 2021. Its main goal was to create synergies between three distinct though partly overlapping communities: experts in typology, in cross-lingual morphosyntactic annotation and in multiword expressions. This report documents the program and the outcomes of the seminar. We present the executive summary of the event, reports from the 3 Working Groups and abstracts of individual talks and open problems presented by the participants.

Cite as

Timothy Baldwin, William Croft, Joakim Nivre, and Agata Savary. Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics (Dagstuhl Seminar 21351). In Dagstuhl Reports, Volume 11, Issue 7, pp. 89-138, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2021)


Copy BibTex To Clipboard

@Article{baldwin_et_al:DagRep.11.7.89,
  author =	{Baldwin, Timothy and Croft, William and Nivre, Joakim and Savary, Agata},
  title =	{{Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics (Dagstuhl Seminar 21351)}},
  pages =	{89--138},
  journal =	{Dagstuhl Reports},
  ISSN =	{2192-5283},
  year =	{2021},
  volume =	{11},
  number =	{7},
  editor =	{Baldwin, Timothy and Croft, William and Nivre, Joakim and Savary, Agata},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops.dagstuhl.de/entities/document/10.4230/DagRep.11.7.89},
  URN =		{urn:nbn:de:0030-drops-155911},
  doi =		{10.4230/DagRep.11.7.89},
  annote =	{Keywords: computational linguistics, morphosyntax, multiword expressions, language universals, idiosyncrasy}
}
Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail