DROPS

Document

Short Paper

DOI: 10.4230/OASIcs.SLATE.2023.10

Large Language Models: Compilers for the 4^{th} Generation of Programming Languages? (Short Paper)

Authors: Francisco S. Marcondes, José João Almeida, and Paulo Novais

Published in: OASIcs, Volume 113, 12th Symposium on Languages, Applications and Technologies (SLATE 2023)

Abstract

This paper explores the possibility of large language models as a fourth generation programming language compiler. This is based on the idea that large language models are able to translate a natural language specification into a program written in a particular programming language. In other words, just as high-level languages provided an additional language abstraction to assembly code, large language models can provide an additional language abstraction to high-level languages. This interpretation allows large language models to be thought of through the lens of compiler theory, leading to insightful conclusions.

Cite as

Francisco S. Marcondes, José João Almeida, and Paulo Novais. Large Language Models: Compilers for the 4^{th} Generation of Programming Languages? (Short Paper). In 12th Symposium on Languages, Applications and Technologies (SLATE 2023). Open Access Series in Informatics (OASIcs), Volume 113, pp. 10:1-10:8, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2023)

Copy BibTex To Clipboard

@InProceedings{s.marcondes_et_al:OASIcs.SLATE.2023.10,
  author =	{S. Marcondes, Francisco and Almeida, Jos\'{e} Jo\~{a}o and Novais, Paulo},
  title =	{{Large Language Models: Compilers for the 4^\{th\} Generation of Programming Languages?}},
  booktitle =	{12th Symposium on Languages, Applications and Technologies (SLATE 2023)},
  pages =	{10:1--10:8},
  series =	{Open Access Series in Informatics (OASIcs)},
  ISBN =	{978-3-95977-291-4},
  ISSN =	{2190-6807},
  year =	{2023},
  volume =	{113},
  editor =	{Sim\~{o}es, Alberto and Ber\'{o}n, Mario Marcelo and Portela, Filipe},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops-dev.dagstuhl.de/entities/document/10.4230/OASIcs.SLATE.2023.10},
  URN =		{urn:nbn:de:0030-drops-185240},
  doi =		{10.4230/OASIcs.SLATE.2023.10},
  annote =	{Keywords: programming language, compiler, large language model}
}

Document

DOI: 10.4230/OASIcs.SLATE.2018.5

eOS: The Exercise Operating System

Authors: Rui Mendes and José João Almeida

Published in: OASIcs, Volume 62, 7th Symposium on Languages, Applications and Technologies (SLATE 2018)

Abstract

We present an architecture for a system for creating, adapting and evaluating programming exercises for students. The system is capable of generating exercise skeletons, automatically creating inputs and outputs, provide a way of creating a large number of exercises programmatically and allowing students to solve them while giving them feedback. Furthermore, it allows the creation of special comparators that can check whether the output of a given submission is equivalent to the expected one or simply check whether the above mentioned output corresponds to a correct solution.

Cite as

Rui Mendes and José João Almeida. eOS: The Exercise Operating System. In 7th Symposium on Languages, Applications and Technologies (SLATE 2018). Open Access Series in Informatics (OASIcs), Volume 62, pp. 5:1-5:13, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2018)

Copy BibTex To Clipboard

@InProceedings{mendes_et_al:OASIcs.SLATE.2018.5,
  author =	{Mendes, Rui and Almeida, Jos\'{e} Jo\~{a}o},
  title =	{{eOS: The Exercise Operating System}},
  booktitle =	{7th Symposium on Languages, Applications and Technologies (SLATE 2018)},
  pages =	{5:1--5:13},
  series =	{Open Access Series in Informatics (OASIcs)},
  ISBN =	{978-3-95977-072-9},
  ISSN =	{2190-6807},
  year =	{2018},
  volume =	{62},
  editor =	{Henriques, Pedro Rangel and Leal, Jos\'{e} Paulo and Leit\~{a}o, Ant\'{o}nio Menezes and Guinovart, Xavier G\'{o}mez},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops-dev.dagstuhl.de/entities/document/10.4230/OASIcs.SLATE.2018.5},
  URN =		{urn:nbn:de:0030-drops-92630},
  doi =		{10.4230/OASIcs.SLATE.2018.5},
  annote =	{Keywords: domain specific language, code generation, automatic evaluation, testing}
}

Document

Short Paper

DOI: 10.4230/OASIcs.SLATE.2018.8

Abcl: Abc music notation with rich chord support (Short Paper)

Authors: José João Almeida

Published in: OASIcs, Volume 62, 7th Symposium on Languages, Applications and Technologies (SLATE 2018)

Abstract

It is well known the relevance of accompany chords but there is a lack of tools capable of automatically generating sound from them. In this paper we describe a domain specific language (Abcl) aimed to be a prototyping environment for new experimental music operators. Currently Abcl: (1) adds support for accompany chords (chordmode, instruments, chord-lines); (2) adds clearer support for percussion (drums, drum-machine) (3) adds a support for variables and functions. Abcl tool is a syntactic-preprocessor that produces Abc. The DSLToolkit, used to create Abcl, is also briefly presented and discussed in the paper.

Cite as

José João Almeida. Abcl: Abc music notation with rich chord support (Short Paper). In 7th Symposium on Languages, Applications and Technologies (SLATE 2018). Open Access Series in Informatics (OASIcs), Volume 62, pp. 8:1-8:8, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2018)

Copy BibTex To Clipboard

@InProceedings{almeida:OASIcs.SLATE.2018.8,
  author =	{Almeida, Jos\'{e} Jo\~{a}o},
  title =	{{Abcl: Abc music notation with rich chord support}},
  booktitle =	{7th Symposium on Languages, Applications and Technologies (SLATE 2018)},
  pages =	{8:1--8:8},
  series =	{Open Access Series in Informatics (OASIcs)},
  ISBN =	{978-3-95977-072-9},
  ISSN =	{2190-6807},
  year =	{2018},
  volume =	{62},
  editor =	{Henriques, Pedro Rangel and Leal, Jos\'{e} Paulo and Leit\~{a}o, Ant\'{o}nio Menezes and Guinovart, Xavier G\'{o}mez},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops-dev.dagstuhl.de/entities/document/10.4230/OASIcs.SLATE.2018.8},
  URN =		{urn:nbn:de:0030-drops-92662},
  doi =		{10.4230/OASIcs.SLATE.2018.8},
  annote =	{Keywords: music, Abc music notation, domain specific language}
}

Document

Short Paper

DOI: 10.4230/OASIcs.SLATE.2018.19

Predicting Performance Problems Through Emotional Analysis (Short Paper)

Authors: Ricardo Martins, José João Almeida, Pedro Rangel Henriques, and Paulo Novais

Published in: OASIcs, Volume 62, 7th Symposium on Languages, Applications and Technologies (SLATE 2018)

Abstract

In the cartoons, every time a character is nervous he/she begins to count to ten to keep calm. This is a technique, among hundreds, that helps to control the emotional state. However, what would be the impact if the emotions would not be controlled? Are the emotions important in terms of impairing the ability to perform tasks correctly? Using a case study of typing text, this paper is about a process to predict the number of writing errors from a person based on the emotional state and some characteristics of the writing process. Using preprocessing techniques, lexicon-based approaches and machine learning, we achieved a percentage of 80% of correct values, when considering the emotional profile on the writing style.

Cite as

Ricardo Martins, José João Almeida, Pedro Rangel Henriques, and Paulo Novais. Predicting Performance Problems Through Emotional Analysis (Short Paper). In 7th Symposium on Languages, Applications and Technologies (SLATE 2018). Open Access Series in Informatics (OASIcs), Volume 62, pp. 19:1-19:9, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2018)

Copy BibTex To Clipboard

@InProceedings{martins_et_al:OASIcs.SLATE.2018.19,
  author =	{Martins, Ricardo and Almeida, Jos\'{e} Jo\~{a}o and Henriques, Pedro Rangel and Novais, Paulo},
  title =	{{Predicting Performance Problems Through Emotional Analysis}},
  booktitle =	{7th Symposium on Languages, Applications and Technologies (SLATE 2018)},
  pages =	{19:1--19:9},
  series =	{Open Access Series in Informatics (OASIcs)},
  ISBN =	{978-3-95977-072-9},
  ISSN =	{2190-6807},
  year =	{2018},
  volume =	{62},
  editor =	{Henriques, Pedro Rangel and Leal, Jos\'{e} Paulo and Leit\~{a}o, Ant\'{o}nio Menezes and Guinovart, Xavier G\'{o}mez},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops-dev.dagstuhl.de/entities/document/10.4230/OASIcs.SLATE.2018.19},
  URN =		{urn:nbn:de:0030-drops-92773},
  doi =		{10.4230/OASIcs.SLATE.2018.19},
  annote =	{Keywords: emotion analysis, machine learning, natural processing language}
}

Document

DOI: 10.4230/DagRep.7.12.18

Multi-Level Modelling (Dagstuhl Seminar 17492)

Authors: João Paulo A. Almeida, Ulrich Frank, and Thomas Kühne

Published in: Dagstuhl Reports, Volume 7, Issue 12 (2018)

Abstract

This report documents the program and the outcomes of Dagstuhl Seminar 17492 "Multi-Level Modelling". This seminar brought together researchers and industry practitioners from the fields of conceptual modeling, ontologies, and formal foundations to discuss and share the benefits of Multi-Level Modelling (MLM), to develop an agreement on MLM terminology and scope, and to drive future research directions in MLM. Some foundational presentations were given by the seminar organizers to ground the discussions and provide an initial set of open questions which would lead to the formation of the working groups. In addition, six industry representatives gave talks explaining the needs, challenges, utility, and possible issues with adoption of MLM in industry. Based on the original seminar goals, the talks, and the resulting discussions, four working groups were established to investigate: the formal and ontological "Foundations"of MLM; promising "Applications" and potential evaluation criteria for MLM methods; the "Dynamic Aspects" of MLM, such as processes and behaviour; and, the use of and impact on "Model Transformations" in the context of MLM.

Cite as

João Paulo A. Almeida, Ulrich Frank, and Thomas Kühne. Multi-Level Modelling (Dagstuhl Seminar 17492). In Dagstuhl Reports, Volume 7, Issue 12, pp. 18-49, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2018)

Copy BibTex To Clipboard

@Article{almeida_et_al:DagRep.7.12.18,
  author =	{Almeida, Jo\~{a}o Paulo A. and Frank, Ulrich and K\"{u}hne, Thomas},
  title =	{{Multi-Level Modelling (Dagstuhl Seminar 17492)}},
  pages =	{18--49},
  journal =	{Dagstuhl Reports},
  ISSN =	{2192-5283},
  year =	{2018},
  volume =	{7},
  number =	{12},
  editor =	{Almeida, Jo\~{a}o Paulo A. and Frank, Ulrich and K\"{u}hne, Thomas},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops-dev.dagstuhl.de/entities/document/10.4230/DagRep.7.12.18},
  URN =		{urn:nbn:de:0030-drops-86758},
  doi =		{10.4230/DagRep.7.12.18},
  annote =	{Keywords: metamodeling, multi-level modeling}
}

Document

DOI: 10.4230/OASIcs.SLATE.2017.21

Adapting Speech Recognition in Augmented Reality for Mobile Devices in Outdoor Environments

Authors: Rui Pascoal, Ricardo Ribeiro, Fernando Batista, and Ana de Almeida

Published in: OASIcs, Volume 56, 6th Symposium on Languages, Applications and Technologies (SLATE 2017)

Abstract

This paper describes the process of integrating automatic speech recognition (ASR) into a mobile application and explores the benefits and challenges of integrating speech with augmented reality (AR) in outdoor environments. The augmented reality allows end-users to interact with the information displayed and perform tasks, while increasing the user’s perception about the real world by adding virtual information to it. Speech is the most natural way of communication: it allows hands-free interaction and may allow end-users to quickly and easily access a range of features available. Speech recognition technology is often available in most of the current mobile devices, but it often uses Internet to receive the corresponding transcript from remote servers, e.g., Google speech recognition. However, in some outdoor environments, Internet is not always available or may be offered at poor quality. We integrated an off-line automatic speech recognition module into an AR application for outdoor usage that does not require Internet. Currently, speech interaction is used within the application to access five different features, namely: to take a photo, shoot a film, communicate, messaging related tasks, and to request information, either geographic, biometric, or climatic. The application makes available solutions to manage and interact with the mobile device, offering good usability. We have compared the online and off-line speech recognition systems in order to assess their adequacy to the tasks. Both systems were tested under different conditions, commonly found in outdoor environments, such as: Internet access quality, presence of noise, and distractions.

Cite as

Rui Pascoal, Ricardo Ribeiro, Fernando Batista, and Ana de Almeida. Adapting Speech Recognition in Augmented Reality for Mobile Devices in Outdoor Environments. In 6th Symposium on Languages, Applications and Technologies (SLATE 2017). Open Access Series in Informatics (OASIcs), Volume 56, pp. 21:1-21:14, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2017)

Copy BibTex To Clipboard

@InProceedings{pascoal_et_al:OASIcs.SLATE.2017.21,
  author =	{Pascoal, Rui and Ribeiro, Ricardo and Batista, Fernando and de Almeida, Ana},
  title =	{{Adapting Speech Recognition in Augmented Reality for Mobile Devices in Outdoor Environments}},
  booktitle =	{6th Symposium on Languages, Applications and Technologies (SLATE 2017)},
  pages =	{21:1--21:14},
  series =	{Open Access Series in Informatics (OASIcs)},
  ISBN =	{978-3-95977-056-9},
  ISSN =	{2190-6807},
  year =	{2017},
  volume =	{56},
  editor =	{Queir\'{o}s, Ricardo and Pinto, M\'{a}rio and Sim\~{o}es, Alberto and Leal, Jos\'{e} Paulo and Varanda, Maria Jo\~{a}o},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops-dev.dagstuhl.de/entities/document/10.4230/OASIcs.SLATE.2017.21},
  URN =		{urn:nbn:de:0030-drops-79541},
  doi =		{10.4230/OASIcs.SLATE.2017.21},
  annote =	{Keywords: Speech Recognition, Natural Language Processing, Sphinx for Mobile Devices, Augmented Reality, Outdoor Environments}
}

@InProceedings{pascoal_et_al:OASIcs.SLATE.2017.21,
  author =	{Pascoal, Rui and Ribeiro, Ricardo and Batista, Fernando and de Almeida, Ana},
  title =	{{Adapting Speech Recognition in Augmented Reality for Mobile Devices in Outdoor Environments}},
  booktitle =	{6th Symposium on Languages, Applications and Technologies (SLATE 2017)},
  pages =	{21:1--21:14},
  series =	{Open Access Series in Informatics (OASIcs)},
  ISBN =	{978-3-95977-056-9},
  ISSN =	{2190-6807},
  year =	{2017},
  volume =	{56},
  editor =	{Queir\'{o}s, Ricardo and Pinto, M\'{a}rio and Sim\~{o}es, Alberto and Leal, Jos\'{e} Paulo and Varanda, Maria Jo\~{a}o},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops-dev.dagstuhl.de/entities/document/10.4230/OASIcs.SLATE.2017.21},
  URN =		{urn:nbn:de:0030-drops-79541},
  doi =		{10.4230/OASIcs.SLATE.2017.21},
  annote =	{Keywords: Speech Recognition, Natural Language Processing, Sphinx for Mobile Devices, Augmented Reality, Outdoor Environments}
}

Document

DOI: 10.4230/OASIcs.SLATE.2016.10

Context-Free Grammars: Exercise Generation and Probabilistic Assessment

Authors: José João Almeida, Eliana Grande, and Georgi Smirnov

Published in: OASIcs, Volume 51, 5th Symposium on Languages, Applications and Technologies (SLATE'16) (2016)

Abstract

In this paper we present a metagrammar based algorithm for exercise generation in the domain of context-free grammars. We also propose a probabilistic assessment algorithm based on a new identity theorem for formal series, a matrix version of the well-known identity theorem from the theory of analytic functions.

Cite as

José João Almeida, Eliana Grande, and Georgi Smirnov. Context-Free Grammars: Exercise Generation and Probabilistic Assessment. In 5th Symposium on Languages, Applications and Technologies (SLATE'16). Open Access Series in Informatics (OASIcs), Volume 51, pp. 10:1-10:8, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2016)

Copy BibTex To Clipboard

@InProceedings{almeida_et_al:OASIcs.SLATE.2016.10,
  author =	{Almeida, Jos\'{e} Jo\~{a}o and Grande, Eliana and Smirnov, Georgi},
  title =	{{Context-Free Grammars: Exercise Generation and Probabilistic Assessment}},
  booktitle =	{5th Symposium on Languages, Applications and Technologies (SLATE'16)},
  pages =	{10:1--10:8},
  series =	{Open Access Series in Informatics (OASIcs)},
  ISBN =	{978-3-95977-006-4},
  ISSN =	{2190-6807},
  year =	{2016},
  volume =	{51},
  editor =	{Mernik, Marjan and Leal, Jos\'{e} Paulo and Gon\c{c}alo Oliveira, Hugo},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops-dev.dagstuhl.de/entities/document/10.4230/OASIcs.SLATE.2016.10},
  URN =		{urn:nbn:de:0030-drops-60159},
  doi =		{10.4230/OASIcs.SLATE.2016.10},
  annote =	{Keywords: Exercise generation, context-free grammars, assessment}
}

Document

DOI: 10.4230/OASIcs.SLATE.2016.14

Building a Dictionary using XML Technology

Authors: Alberto Simões, José João Almeida, and Ana Salgado

Published in: OASIcs, Volume 51, 5th Symposium on Languages, Applications and Technologies (SLATE'16) (2016)

Abstract

In this article we describe the workflow implemented to convert a dictionary saved as a PDF file into an XML document and posterior importation into an XML aware database, and the process to edit, add and delete new entries. The conversion process was challenging given the format of the PDF file, and the fine grained detail of the XML schema that was used. For that, an iterative filtering approach was used. To store the dictionary we decided to use an XML aware database (eXist-DB), that stores each dictionary entry as a separate resource. It can be queried used a web interface developed using XQuery. The lexicographers can edit entries using the oXygen XML editor, reading and storing them directly in the database. In order to guarantee incremental backups, it was defined a mechanism to import the XML database into a GIT repository. Finally, a couple of programs were created in order to prepare regular reports on the dictionary revision process, as well as to backup it in a GIT repository.

Cite as

Alberto Simões, José João Almeida, and Ana Salgado. Building a Dictionary using XML Technology. In 5th Symposium on Languages, Applications and Technologies (SLATE'16). Open Access Series in Informatics (OASIcs), Volume 51, pp. 14:1-14:8, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2016)

Copy BibTex To Clipboard

@InProceedings{simoes_et_al:OASIcs.SLATE.2016.14,
  author =	{Sim\~{o}es, Alberto and Almeida, Jos\'{e} Jo\~{a}o and Salgado, Ana},
  title =	{{Building a Dictionary using XML Technology}},
  booktitle =	{5th Symposium on Languages, Applications and Technologies (SLATE'16)},
  pages =	{14:1--14:8},
  series =	{Open Access Series in Informatics (OASIcs)},
  ISBN =	{978-3-95977-006-4},
  ISSN =	{2190-6807},
  year =	{2016},
  volume =	{51},
  editor =	{Mernik, Marjan and Leal, Jos\'{e} Paulo and Gon\c{c}alo Oliveira, Hugo},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops-dev.dagstuhl.de/entities/document/10.4230/OASIcs.SLATE.2016.14},
  URN =		{urn:nbn:de:0030-drops-60193},
  doi =		{10.4230/OASIcs.SLATE.2016.14},
  annote =	{Keywords: XML databases, dictionaries, XQuery, PDF files}
}

Document

DOI: 10.4230/OASIcs.SLATE.2014.19

Conclave: Writing Programs to Understand Programs

Authors: Nuno Ramos Carvalho, José João Almeida, Maria João Varanda Pereira, and Pedro Rangel Henriques

Published in: OASIcs, Volume 38, 3rd Symposium on Languages, Applications and Technologies (2014)

Abstract

Software maintainers are often challenged with source code changes to improve software systems, or eliminate defects, in unfamiliar programs. To undertake these tasks a sufficient understanding of the system, or at least a small part of it, is required. One of the most time consuming tasks of this process is locating which parts of the code are responsible for some key functionality or feature. This paper introduces Conclave, an environment for software analysis, that enhances program comprehension activities. Programmers use natural languages to describe and discuss the problem domain, programming languages to write source code, and markup languages to have programs talking with other programs, and so this system has to cope with this heterogeneity of dialects, and provide tools in all these areas to effectively contribute to the understanding process. The source code, the problem domain, and the side effects of running the program are represented in the system using ontologies. A combination of tools (specialized in different kinds of languages) create mappings between the different domains. Conclave provides facilities for feature location, code search, and views of the software that ease the process of understanding the code, devising changes. The underlying feature location technique explores natural language terms used in programs (e.g. function and variable names); using textual analysis and a collection of Natural Language Processing techniques, computes synonymous sets of terms. These sets are used to score relatedness between program elements, and search queries or problem domain concepts, producing sorted ranks of program elements that address the search criteria, or concepts respectively.

Cite as

Nuno Ramos Carvalho, José João Almeida, Maria João Varanda Pereira, and Pedro Rangel Henriques. Conclave: Writing Programs to Understand Programs. In 3rd Symposium on Languages, Applications and Technologies. Open Access Series in Informatics (OASIcs), Volume 38, pp. 19-34, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2014)

Copy BibTex To Clipboard

@InProceedings{carvalho_et_al:OASIcs.SLATE.2014.19,
  author =	{Carvalho, Nuno Ramos and Almeida, Jos\'{e} Jo\~{a}o and Pereira, Maria Jo\~{a}o Varanda and Henriques, Pedro Rangel},
  title =	{{Conclave: Writing Programs to Understand Programs}},
  booktitle =	{3rd Symposium on Languages, Applications and Technologies},
  pages =	{19--34},
  series =	{Open Access Series in Informatics (OASIcs)},
  ISBN =	{978-3-939897-68-2},
  ISSN =	{2190-6807},
  year =	{2014},
  volume =	{38},
  editor =	{Pereira, Maria Jo\~{a}o Varanda and Leal, Jos\'{e} Paulo and Sim\~{o}es, Alberto},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops-dev.dagstuhl.de/entities/document/10.4230/OASIcs.SLATE.2014.19},
  URN =		{urn:nbn:de:0030-drops-45561},
  doi =		{10.4230/OASIcs.SLATE.2014.19},
  annote =	{Keywords: software maintenance, software evolution, program comprehension, feature location, concept location, natural language processing}
}

@InProceedings{carvalho_et_al:OASIcs.SLATE.2014.19,
  author =	{Carvalho, Nuno Ramos and Almeida, Jos\'{e} Jo\~{a}o and Pereira, Maria Jo\~{a}o Varanda and Henriques, Pedro Rangel},
  title =	{{Conclave: Writing Programs to Understand Programs}},
  booktitle =	{3rd Symposium on Languages, Applications and Technologies},
  pages =	{19--34},
  series =	{Open Access Series in Informatics (OASIcs)},
  ISBN =	{978-3-939897-68-2},
  ISSN =	{2190-6807},
  year =	{2014},
  volume =	{38},
  editor =	{Pereira, Maria Jo\~{a}o Varanda and Leal, Jos\'{e} Paulo and Sim\~{o}es, Alberto},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops-dev.dagstuhl.de/entities/document/10.4230/OASIcs.SLATE.2014.19},
  URN =		{urn:nbn:de:0030-drops-45561},
  doi =		{10.4230/OASIcs.SLATE.2014.19},
  annote =	{Keywords: software maintenance, software evolution, program comprehension, feature location, concept location, natural language processing}
}

Document

DOI: 10.4230/OASIcs.SLATE.2014.77

A Workflow Description Language to Orchestrate Multi-Lingual Resources

Authors: Rui Brito and José João Almeida

Published in: OASIcs, Volume 38, 3rd Symposium on Languages, Applications and Technologies (2014)

Abstract

Texts aligned alongside their translation, or Parallel Corpora, are a very widely used resource in Computational Linguistics. Processing these resources, however, is a very intensive, time consuming task, which makes it a suitable case study for High Performance Computing (HPC). HPC underwent several recent changes, with the evolution of Heterogeneous Platforms, where multiple devices with different architectures are able to share workload to increase performance. Several frameworks/toolkits have been under development, in various fields, to aid the programmer in extracting more performance from these platforms. Either by dynamically scheduling the workload across the available resources or by exploring the opportunities for parallelism. However, there is no toolkit targeted at Computational Linguistics, more specifically, Parallel Corpora processing. Parallel Corpora processing can be a very time consuming task, and the field could definitely use a toolkit which aids the programmer in achieving not only better performance, but also a convenient and expressive way of specifying tasks and their dependencies.

Cite as

Rui Brito and José João Almeida. A Workflow Description Language to Orchestrate Multi-Lingual Resources. In 3rd Symposium on Languages, Applications and Technologies. Open Access Series in Informatics (OASIcs), Volume 38, pp. 77-83, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2014)

Copy BibTex To Clipboard

@InProceedings{brito_et_al:OASIcs.SLATE.2014.77,
  author =	{Brito, Rui and Almeida, Jos\'{e} Jo\~{a}o},
  title =	{{A Workflow Description Language to Orchestrate Multi-Lingual Resources}},
  booktitle =	{3rd Symposium on Languages, Applications and Technologies},
  pages =	{77--83},
  series =	{Open Access Series in Informatics (OASIcs)},
  ISBN =	{978-3-939897-68-2},
  ISSN =	{2190-6807},
  year =	{2014},
  volume =	{38},
  editor =	{Pereira, Maria Jo\~{a}o Varanda and Leal, Jos\'{e} Paulo and Sim\~{o}es, Alberto},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops-dev.dagstuhl.de/entities/document/10.4230/OASIcs.SLATE.2014.77},
  URN =		{urn:nbn:de:0030-drops-45609},
  doi =		{10.4230/OASIcs.SLATE.2014.77},
  annote =	{Keywords: workflow, orchestration, parallelism, domain specific languages, corpora}
}

Document

DOI: 10.4230/OASIcs.SLATE.2014.251

Language Identification: a Neural Network Approach

Authors: Alberto Simões, José João Almeida, and Simon D. Byers

Published in: OASIcs, Volume 38, 3rd Symposium on Languages, Applications and Technologies (2014)

Abstract

One of the first tasks when building a Natural Language application is the detection of the used language in order to adapt the system to that language. This task has been addressed several times. Nevertheless most of these attempts were performed a long time ago when the amount of computer data and the computational power were limited. In this article we analyze and explain the use of a neural network for language identification, where features can be extracted automatically, and therefore, easy to adapt to new languages. In our experiments we got some surprises, namely with the two Chinese variants, whose forced us for some language-dependent tweaking of the neural network. At the end, the network had a precision of 95%, only failing for the Portuguese language.

Cite as

Alberto Simões, José João Almeida, and Simon D. Byers. Language Identification: a Neural Network Approach. In 3rd Symposium on Languages, Applications and Technologies. Open Access Series in Informatics (OASIcs), Volume 38, pp. 251-265, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2014)

Copy BibTex To Clipboard

@InProceedings{simoes_et_al:OASIcs.SLATE.2014.251,
  author =	{Sim\~{o}es, Alberto and Almeida, Jos\'{e} Jo\~{a}o and Byers, Simon D.},
  title =	{{Language Identification: a Neural Network Approach}},
  booktitle =	{3rd Symposium on Languages, Applications and Technologies},
  pages =	{251--265},
  series =	{Open Access Series in Informatics (OASIcs)},
  ISBN =	{978-3-939897-68-2},
  ISSN =	{2190-6807},
  year =	{2014},
  volume =	{38},
  editor =	{Pereira, Maria Jo\~{a}o Varanda and Leal, Jos\'{e} Paulo and Sim\~{o}es, Alberto},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops-dev.dagstuhl.de/entities/document/10.4230/OASIcs.SLATE.2014.251},
  URN =		{urn:nbn:de:0030-drops-45749},
  doi =		{10.4230/OASIcs.SLATE.2014.251},
  annote =	{Keywords: language identification, neural networks, language models, trigrams}
}

Document

DOI: 10.4230/OASIcs.SLATE.2014.283

MLT-prealigner: a Tool for Multilingual Text Alignment

Authors: Pedro Carvalho and José João Almeida

Published in: OASIcs, Volume 38, 3rd Symposium on Languages, Applications and Technologies (2014)

Abstract

Parallel text alignment is a key procedure in the automated translation area. A large number of aligners have been presented along the years, but these require that the target resources have been pre-prepared for alignment (either manually or automatically). It is rather normal to encounter mixed language documents, that is, documents where the same information is written in many languages (Ex: manuals of electronic devices, touristic information, PhD thesis with dual language abstracts, etc). In this article we present MLT-prealigner: a tool aimed at helping those that need to process mixed texts in order to feed alignment tools and other related language systems.

Cite as

Pedro Carvalho and José João Almeida. MLT-prealigner: a Tool for Multilingual Text Alignment. In 3rd Symposium on Languages, Applications and Technologies. Open Access Series in Informatics (OASIcs), Volume 38, pp. 283-290, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2014)

Copy BibTex To Clipboard

@InProceedings{carvalho_et_al:OASIcs.SLATE.2014.283,
  author =	{Carvalho, Pedro and Almeida, Jos\'{e} Jo\~{a}o},
  title =	{{MLT-prealigner: a Tool for Multilingual Text Alignment}},
  booktitle =	{3rd Symposium on Languages, Applications and Technologies},
  pages =	{283--290},
  series =	{Open Access Series in Informatics (OASIcs)},
  ISBN =	{978-3-939897-68-2},
  ISSN =	{2190-6807},
  year =	{2014},
  volume =	{38},
  editor =	{Pereira, Maria Jo\~{a}o Varanda and Leal, Jos\'{e} Paulo and Sim\~{o}es, Alberto},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops-dev.dagstuhl.de/entities/document/10.4230/OASIcs.SLATE.2014.283},
  URN =		{urn:nbn:de:0030-drops-45776},
  doi =		{10.4230/OASIcs.SLATE.2014.283},
  annote =	{Keywords: parallel corpora, multilingual text alignment, language detection, Perl, automated translation}
}

Document

DOI: 10.4230/OASIcs.SLATE.2013.203

ABC with a UNIX Flavor

Authors: Bruno M. Azevedo and José João Almeida

Published in: OASIcs, Volume 29, 2nd Symposium on Languages, Applications and Technologies (2013)

Abstract

abc is a simple, yet powerful, textual musical notation. This paper presents abc::dt, a rule-based domain-specific language (Perl embedded), designed to simplify the creation of abc processing tools. Inspired by the Unix philosophy, those tools intend to be simple and compositional in a Unix filters' way. From abc::dt's rules we obtain an abc processing tool whose main algorithm follows a traditional compiler architecture, thus consisting of three stages: 1) abc parser (based on abcmtops' parser), 2) abc semantic transformation (associated with abc attributes), 3) output generation (either a user defined or system provided abc generator).

Cite as

Bruno M. Azevedo and José João Almeida. ABC with a UNIX Flavor. In 2nd Symposium on Languages, Applications and Technologies. Open Access Series in Informatics (OASIcs), Volume 29, pp. 203-218, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2013)

Copy BibTex To Clipboard

@InProceedings{azevedo_et_al:OASIcs.SLATE.2013.203,
  author =	{Azevedo, Bruno M. and Almeida, Jos\'{e} Jo\~{a}o},
  title =	{{ABC with a UNIX Flavor}},
  booktitle =	{2nd Symposium on Languages, Applications and Technologies},
  pages =	{203--218},
  series =	{Open Access Series in Informatics (OASIcs)},
  ISBN =	{978-3-939897-52-1},
  ISSN =	{2190-6807},
  year =	{2013},
  volume =	{29},
  editor =	{Leal, Jos\'{e} Paulo and Rocha, Ricardo and Sim\~{o}es, Alberto},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops-dev.dagstuhl.de/entities/document/10.4230/OASIcs.SLATE.2013.203},
  URN =		{urn:nbn:de:0030-drops-40397},
  doi =		{10.4230/OASIcs.SLATE.2013.203},
  annote =	{Keywords: Music Processing, ABC Notation, Unix, Scripting, Compilers}
}

13 Search Results for "Almeida, Jo�o Paulo A."

Abstract

Cite as

Abstract

Cite as

Abstract

Cite as

Abstract

Cite as

Abstract

Cite as

Abstract

Cite as

Abstract

Cite as

Abstract

Cite as

Abstract

Cite as

Abstract

Cite as

Abstract

Cite as

Abstract

Cite as

Abstract

Cite as

Thanks for your feedback!

Could not send message