Syntactic Transformations in Rule-Based Parsing of Support Verb Constructions: Examples from European Portuguese

Authors Jorge Baptista , Nuno Mamede



PDF
Thumbnail PDF

File

OASIcs.SLATE.2020.11.pdf
  • Filesize: 492 kB
  • 14 pages

Document Identifiers

Author Details

Jorge Baptista
  • University of Algarve, Campus de Gambelas, Faro, Portugal
  • INESC-ID, Lisboa, Portugal
Nuno Mamede
  • Universidade de Lisboa, Instituto Superior Técnico, Portugal
  • INESC-ID, Lisboa, Portugal

Acknowledgements

The authors would like to thank Sónia Reis (U.Algarve and INESC-ID Lisboa for her help in the linguistic data compilation and revision, as well as her attentive reading of initial versions of this manuscript. Naturally, any errors herein are our own responsibility alone.

Cite AsGet BibTex

Jorge Baptista and Nuno Mamede. Syntactic Transformations in Rule-Based Parsing of Support Verb Constructions: Examples from European Portuguese. In 9th Symposium on Languages, Applications and Technologies (SLATE 2020). Open Access Series in Informatics (OASIcs), Volume 83, pp. 11:1-11:14, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2020)
https://doi.org/10.4230/OASIcs.SLATE.2020.11

Abstract

This paper reports on-going work on building a rule-based grammar for (European) Portuguese, incorporating support verb constructions (SVC). The paper focuses on parsing sentences resulting from syntactic transformations of SVC, and presents a methodology to automatically generate testing examples directly from the SVC Lexicon-Grammar matrix where their linguistic properties are represented. These examples allow both to improve the linguistic description of these constructions and to test intrinsically the system parser, spotting unforeseen issues due to previous natural language processing steps.

Subject Classification

ACM Subject Classification
  • Computing methodologies → Natural language processing
  • Computing methodologies → Natural language generation
  • Computing methodologies → Language resources
Keywords
  • Support verb constructions
  • Rule-based parsing
  • syntactic transformations
  • language resources
  • European Portuguese

Metrics

  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    0
    PDF Downloads

References

  1. S. Ait-Mokhtar, J. Chanod, and C. Roux. Robustness beyond shallowness: incremental dependency parsing. Natural Language Engineering, 8(2/3):121-144, 2002. Google Scholar
  2. M. F. Athayde. Construções com verbo-suporte (funktionsverbgefuge) do português e do alemão. Cadernos do CIEG Centro Interuniversitário de Estudos Germanísticos, 1, 2001. Google Scholar
  3. Jorge Baptista. Conversão, nomes parte-do-corpo e restruturação dativa. In Ivo Castro, editor, Actas do XII Encontro da Associação Portuguesa de Linguística, volume 1, pages 51-59, Lisboa, 30 de setembro a 2 de outubro de 1996, Braga-Guimarães, Portugal 1997. Associação Portuguesa de Linguística, APL/Colibri. Google Scholar
  4. Jorge Baptista. Sermão, tareia e facada: uma classificação das expressões conversas dar-levar. Seminários de Linguística 1, pages 5-37, 1997. Google Scholar
  5. Jorge Baptista. Construções simétricas: argumentos e complementos. In Olga Figueiredo, Graça Rio-Torto, and F. Silva, editors, Estudos de homenagem a Mário Vilela, pages 353-367. Faculdade de Letras da Universidade do Porto, 2005. Google Scholar
  6. Jorge Baptista. Sintaxe dos Nomes Predicativos com verbo-suporte SER DE. Fundação para a Ciência e a Tecnologia/Fundação Calouste Gulbenkian, Lisboa, 2005. Google Scholar
  7. Jorge Baptista. Viper: A Lexicon-Grammar of European Portuguese verbs. In Jam Radimsky, editor, Actes du 31e Colloque International sur le Lexique et la Grammaire, pages 10-17, République Tchèque, 2012. Université de Bohême du Sud. Google Scholar
  8. Jorge Baptista. ViPEr: uma base de dados de construções léxico-sintáticas de verbos do Português Europeu. In Fátima Silva, Isabel Falé, and Isabel Pereira, editors, Actas do XXVIII Encontro da APL - Textos Selecionados, pages 111-129, Lisboa, 2013. APL/Colibri. Google Scholar
  9. Jorge Baptista, Graça Fernandes, Rui Talhadas, Francisco Dias, and Nuno Mamede. Implementing European Portuguese verbal idioms in a natural language processing system. In Gloria Corpas Pastor, editor, Computerised and Corpus-based Approaches to Phraseology: Monolingual and Multilingual Perspectives / Fraseología computacional y basada en corpus: perspectivas monolingües y multilingües, pages 102-115. Proceedings of Conference of the European Society of Phraseology (EUROPHRAS 2015), June 28-July 2, 2015, Málaga, Spain, Geneva, Switzerland 2016. Google Scholar
  10. Jorge Baptista and Nuno Mamede. Reciprocal Echo Complements in Portuguese: Linguistic Description in view of Rule-based Parsing. In Jorge Baptista and Mario Monteleone, editors, Proceedings of the 32nd International Conference on Lexis and Grammar (CLG'2013), pages 33-40, Faro, Portugal, September 10-14, 2013 2013. CLG'2103, Universidade do Algarve - FCHS. Google Scholar
  11. Jorge Baptista and Nuno Mamede. Dicionário Gramatical de Verbos do Português Europeu. Universidade do Algarve, December 2020. Google Scholar
  12. Jorge Baptista, Nuno Mamede, and Ilia Markov. Integrating verbal idioms into an nlp system. In Jorge Baptista, Nuno Mamede, Sara Candeias, Ivandré Paraboni, Thiago Pardo, and Maria das Graças Volpe Nunes, editors, Computational Processing of the Portuguese Language, volume 8775 of Lecture Notes in Computer Science / Lecture Notes in Artificial Intelligence, pages 251-256, Berlin, 2014. 11^th International Conference PROPOR'2014, October 8-10, 2014, São Carlos - SP, Brazil, Springer. Google Scholar
  13. Eckhard Bick. Noun sense tagging: Semantic prototype annotation of a Portuguese treebank. In Proceedings of TLT, pages 127-138, 2006. Google Scholar
  14. Nathalia Calcia. Descrição e classificação das construções conversas no português do Brasil. Master’s thesis, Universidade Federal de São Carlos, São Carlos-SP, Brasil, 2016. Google Scholar
  15. Nicoletta Calzolari, Charles J. Fillmore, Ralph Grishman, Nancy Ide, Alessandro Lenci, Catherine Macleod, and Antonio Zampolli. Towards best practices for Multiword Expressions in Computational Lexicons. In Proceedings of LREC'02, pages 1934-1940, Las Palmas, Spain, May 2002. Google Scholar
  16. Lucília Chacoto. O Verbo Fazer em Construções Nominais Predicativas. PhD thesis, Universidade do Algarve, Faro, 2005. Google Scholar
  17. Noam Chomsky. Remarks on nominalization. Linguistics Club, Indiana University, 1968. Google Scholar
  18. Mathieu Constant, G. Eryigit, Joana Monti, L. van der Plas, Carlos Ramisch, Michael Rosner, and A. Todirascu. Multiword expression processing: A survey. Computational Linguistics, pages 837-892, 2017. Google Scholar
  19. Gloria Corpas Pastor, Ruslan Mitkov, Maria Kunilovskaya, and María Araceli Losey León, editors. Processing European Portuguese Verbal Idioms: From the Lexicon-Grammar to a Rule-based Parser, Malaga (Spain), September, 25-27 2019. Tradulex. Google Scholar
  20. Maria Francisca Mendes Queiroz-Pinto de Athayde. A estrutura semântica das construções com verbo-suporte preposicionadas do português e do alemão. PhD thesis, Faculdade de Letras da Universidade de Coimbra, Coimbra, 2000. Google Scholar
  21. Cláudia Dias de Barros. Descrição e classificaçãoo de predicados nominais com o verbo-suporte FAZER: especificidades do Português do Brasil. PhD thesis, Universidade Federal de São Carlos, São Carlos-SP, Brasil, 2014. Google Scholar
  22. Graça; Fernandes and Jorge Baptista. Frozen sentences with obligatory negation: Linguistic challenges for natural language processing. In Carmen Mellado-Blanco, editor, Colocaciones y fraseología en los diccionarios, pages 85-96. Peter Lang, Frankfurt, 2008. Google Scholar
  23. Ana Galvão, Jorge Baptista, and Nuno Mamede. New developments on processing European Portuguese verbal idioms. In Carlos Augusto Prolo and Leandro Henrique Mendonça de Oliveira, editors, 12th Symposium in Information and Human Language Technology, pages 229-238, Salvador, BA (Brazil), October, 15-18 2019. Google Scholar
  24. Gaston Gross. Les construction converses du français. Droz, Genève, 1989. Google Scholar
  25. Maurice Gross. Les bases empiriques de la notion de prédicat sémantique. Langages, 15(63):7-52, 1981. Google Scholar
  26. Maurice Gross. Methods and tactics in the construction of a lexicon-grammar. In Linguistics in the Morning Calm, Selected Papers from SICOL, pages 177-197, Seoul, 1988. Hanshin Pub. Co. Google Scholar
  27. Maurice Gross. Grammaire transformationnelle du français: 3 - Syntaxe de l'adverbe. ASSTRIL, Paris, 1996. Google Scholar
  28. Maurice Gross. Lexicon-grammar. In Keith Brown and Jim Miller, editors, Concise Encyclopedia of Syntactic Theories, pages 244-259. Pergamon, Cambridge, 1996. Google Scholar
  29. Alain Guillet and Christian Leclère. Restructuration du groupe nominal. Langages, 15e année(63):99-125, 1981. Google Scholar
  30. Caroline; Hagège, Jorge; Baptista, and Nuno João Mamede. Reconhecimento de entidades mencionadas com o xip: Uma colaboração entre o inesc-l2f e a xerox. In Cristina; Mota and Diana Santos, editors, Desafios na avaliação conjunta do reconhecimento de entidades mencionadas: Actas do Encontro do Segundo HAREM (Aveiro, 11 de Setembro de 2008). Linguateca, 2009. Google Scholar
  31. Zellig Harris. The elementary transformations. In Henry Hiz, editor, Papers on Syntax, pages 211-235. D. Reidel Publishing Company, 1964. Google Scholar
  32. Adam Kilgarriff, Pavel Rychly, Vojtech Kovar, and Vit Baisa. Finding multiwords of more than two words. In EURALEX, Oslo, 2012. Google Scholar
  33. Béatrice; Lamiroy. Le lexique-grammaire. In Travaux de Linguistique, volume 37. Duculot, 1998. Google Scholar
  34. Christian Leclère. Sur une restructuration dative. Language Research, 31:179-198, 1995. Google Scholar
  35. Nuno Mamede, Jorge Baptista, Cláudio Diniz, and Vera Cabarrão. STRING - A Hybrid Statistical and Rule-Based Natural Language Processing Chain for Portuguese. In Alberto Abad, editor, International Conference on Computational Processing of Portuguese (PROPOR 2012) - Demo Session, Coimbra, Portugal, April, 17-20 2012. Google Scholar
  36. Diogo Oliveira. Extraction and classification of named entities. Master’s thesis, Instituto Superior Técnico - Universidade Técnica de Lisboa Universidade Técnica de Lisboa, Lsuperscript2F/INESC-ID, Lisboa, 2010. Google Scholar
  37. Carlos Ramisch, Silvio Cordeiro, Agata Savary, Veronika Vincze, Verginica Mititelu, Archna Bhatia, Maja Buljan, Marie Candito, Polona Gantar, Voula Giouli, et al. Edition 1.1 of the parseme shared task on automatic identification of verbal multiword expressions. In Joint Workshop on Linguistic Annotation, Multiword Expressions and Constructions, 2018. Google Scholar
  38. Elisabete Ranchhod. Sintaxe dos predicados nominais com "estar". Instituto Nacional de Investigação Científica (INIC), 1990. Google Scholar
  39. Amanda Rassi, Nuno Mamede, Jorge Baptista, and Oto Vale. I. Integrating support verb constructions into a parser. In Proceedings of the Symposium in Information and Human Language Technology (STIL'2015), pages 57-62, 2015. Google Scholar
  40. Amanda Rassi, Cristina Santos-Turati, Jorge Baptista, Nuno Mamede, and Oto Vale. The fuzzy boundaries of operator verb and support verb constructions with dar "give" and ter "have" in Brazilian Portuguese. In Proceedings of the Workshop on Lexical and Grammatical Resources for Language Processing (LG-LP 2014), COLING 2014, Dublin, Ireland, August 2014. Workshop on Lexical and Grammatical Resources for Language Processing (LG-LP 2014), COLING 2014, Dublin, August 24, 2014, COLING 2014. Google Scholar
  41. Amanda P. Rassi. Descrição, classificação e processamento automático das construções com o verbo dar em português brasileiro. PhD thesis, Universidade Federal de São Carlos, São Carlos-SP, Brasil, 2015. Google Scholar
  42. Amanda. P. Rassi, Jorge Baptista, and Oto Araújo Vale. Um corpus anotado de construções com verbo-suporte em português. Gragoatá, 39(1):207-230, June, 2015 2015. Google Scholar
  43. Amanda. P. Rassi, N. P. Calcia, Oto A. Vale, and Jorge Baptista. Análise comparativa das construções conversas em português do brasil e português europeu. In Abstracts from I Congresso Internacional de Estudos do Léxico e suas Interfaces (CINELI), page 45, Araraquara, SP (Brazil), March 2014. Congresso Internacional de Estudos do Léxico e suas Interfaces (CINELI), FCL-UNESP. Google Scholar
  44. I. A. Sag, T. Baldwin, F. Bond, A. Copestake, and D. Flickinger. Multiword Expressions: A Pain in the Neck for NLP. In Proceedings of Computational Linguistics and Intelligent Text Processing, volume 2276 of LNAI/LNCS, pages 1-15, Berlin, 2002. 3rd International Conference CICLing-2002, Springer. Google Scholar
  45. Cristina Santos. Construções com verbo-suporte ter no Português do Brasilrasil. PhD thesis, Universidade Federal de São Carlos, São Carlos-SP, Brasil, 2015. Google Scholar
  46. Agata Savary, Carlos Ramisch, Silvio Cordeiro, Federico Sangati, Veronika Vincze, Behrang QasemiZadeh, Marie Candito, Fabienne Cap, Voula Giouli, and Ivelina Stoyanova. The PARSEME shared task on automatic identification of verbal multiword expressions. In 13th Workshop on Multiword Expressions (MWE), pages 31-47, 2017. URL: https://doi.org/10.18653/v1/W17-1704.
  47. Agata Savary, Manfred Sailer, Yannick Parmentier, Michael Rosner, Victoria Rosén, Adam Przepiórkowski, Cvetana Krstev, Veronika Vincze, Beata Wójtowicz, Gyri Smørdal Losnegaard, et al. PARSEME-PARSing and multiword expressions within a European multilingual network. In Multiword expressions at length and in depth: Extended papers from the MWE workshop, 2015. Google Scholar
  48. Rui Talhadas. Semantic Role Labelling in European Portuguese. Master’s thesis, Universidade do Algarve, Faculdade de Ciências Humanas e Sociais, Faro, Portugal, 2014. Google Scholar
  49. Rui Talhadas, Jorge Baptista, and Nuno Mamede. Semantic roles annotation guidelines. Technical report, L2F/INESC ID Lisboa, 2013. Google Scholar
  50. Rui Talhadas, Nuno Mamede, and Jorge Baptista. Semantic Roles for Portuguese Verbs. In Jorge Baptista and Mario Monteleone, editors, Proceedings of the 32nd International Conference on Lexis and Grammar (CLG'2013), pages 127-132, Faro, Portugal, September 10-14, 2013 2013. CLG'2103, Universidade do Algarve - FCHS. Google Scholar
  51. Aldina Vaza. Estruturas com nomes predicativos e verbo-suporte dar. Master’s thesis, Faculdade de Letras da Universidade de Lisboa, Lisboa, Portugal, 1988. Google Scholar
  52. Nicolas Zampieri, Carlos Ramisch, and Géraldine Damnati. The impact of word representations on sequential neural MWE identification. In Joint Workshop on Multiword Expressions and WordNet (MWE-WN), pages 169-175, 2019. URL: https://doi.org/10.18653/v1/W19-5121.
Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail