Document Open Access Logo

Towards a Morphological Analyzer for the Umbundu Language

Authors Alberto Simões , Bernardo Sacanene , Álvaro Iriarte , José João Almeida , Joaquim Macedo



PDF
Thumbnail PDF

File

OASIcs.SLATE.2020.10.pdf
  • Filesize: 0.75 MB
  • 11 pages

Document Identifiers

Author Details

Alberto Simões
  • 2Ai, School of Technology, IPCA, Barcelos, Portugal
Bernardo Sacanene
  • Centro de Estudos Humanísticos da Universidade do Minho, Braga, Portugal
Álvaro Iriarte
  • Centro de Estudos Humanísticos da Universidade do Minho, Braga, Portugal
José João Almeida
  • Algoritmi, Departamento de Informática, Universidade do Minho, Braga, Portugal
Joaquim Macedo
  • Algoritmi, Departamento de Informática, Universidade do Minho, Braga, Portugal

Cite AsGet BibTex

Alberto Simões, Bernardo Sacanene, Álvaro Iriarte, José João Almeida, and Joaquim Macedo. Towards a Morphological Analyzer for the Umbundu Language. In 9th Symposium on Languages, Applications and Technologies (SLATE 2020). Open Access Series in Informatics (OASIcs), Volume 83, pp. 10:1-10:11, Schloss Dagstuhl - Leibniz-Zentrum für Informatik (2020)
https://doi.org/10.4230/OASIcs.SLATE.2020.10

Abstract

In this document we present the first developments on an Umbundu dictionary for a jSpell, a morphological analyzer. Initially some comments are performed regarding the Umbundu language morphology, followed by the discussion on jSpell dictionaries structure and its environment. Last, we describe the Umbundu dictionary bootstrap process and perform some final experiments on its coverage.

Subject Classification

ACM Subject Classification
  • Computing methodologies → Language resources
Keywords
  • Umbundu
  • Angolan Languages
  • Morphological Analysis
  • Spell Checking

Metrics

  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    0
    PDF Downloads

References

  1. José João Almeida and Ulisses Pinto. Jspell - um módulo para análise léxica genérica de linguagem natural. In Actas do X Encontro da Associação Portuguesa de Linguística (APL'1994), pages 1-15, 1995. Google Scholar
  2. Lluís V. Aracil. Papers de sociolingüística. Edicions La Magrana, Barcelona, 1982. Google Scholar
  3. Boubacar Diarra. Choice and description of national languages with regard to their utility in literacy and education in Angola. In A paper delivered at the UNESCO Expert Pool Meeting on Language issues in Literacy and Basic Education, 1992. Google Scholar
  4. Charles Ferguson. Diglossia. Word, 15:325-340, 1959. Google Scholar
  5. João Fernandes and Zavoni Ntondo. Angola: povos e línguas. Editorial Nzila, Luanda, 2002. Google Scholar
  6. Malcolm Guthrie. The classification of the Bantu languages. Oxford University Press, 1948. Google Scholar
  7. INE. Resultados definitivos do recenseamento geral da população e da habitação de angola. Technical report, Instituto Nacional de Estatística, Gabinete Central do Censo, Subcomissão de Difusão de Resultados, Luanda, 2016. Google Scholar
  8. Botelho Isalino Jimbi. A reflection on the umbundu corpus planning for the Angola education system: towards the harmonization of the catholic and the protestant orthographies. In Actas Do XIII Congress Internacional de Linguistica Xeral, page 475–482, 2018. Retrieved from URL: http://cilx2018.uvigo.gal/actas/pdf/661789.pdf.
  9. Francis Katamba. Bantu nominal morphology. In D. Nurse and G. Philippson, editors, The Bantu Language. Routledge Tayloer & Francis Group, London, 2014. Google Scholar
  10. Gregoire Le Guenec and José Francisco Valente. Dicionário Português-Umbundu. Escolar Editora, Lobito, 2010. Google Scholar
  11. Alberto Manuel Simões and José João Almeida. jspell.pm - um módulo de análise morfológica para uso em processamento de linguagem natural. In Actas da Associação Portuguesa de Linguística (APL'2001), pages 485-495, 2002. Google Scholar
  12. Universal Declaration of Linguistic Rights, 1996. (Barcelona Declaration) World Conference on Linguistic Rights, Barcelona, Espanha, Junho de 1996. Retrieved January 31, 2020, from URL: https://unesdoc.unesco.org/ark:/48223/pf0000104267.
  13. João Francisco Valente. Gramática Umbundu. A língua do centro de Angola. Junta de Investigação do Ultramar, Lisboa, 1964. Google Scholar
  14. Rui Vilela. Geração de dicionários para correcção ortográfica do português. Master’s thesis, Escola de Engenharia, Universidade do Minho, 2009. Google Scholar
Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail