RT Dissertation/Thesis T1 Contribution to Natural Language Generation for Spanish T2 Aportación a la Generación de Lenguaje Natural para español A1 García Méndez, Silvia K1 5701.04 Lingüística Informatizada K1 5705.08 Semántica K1 3304.16 Diseño Lógico AB In this thesis, we present our research aligned with the field of Natural Language Generation (NLG). Our work represents an effort to bring NLG capabilities to theresearch community for Spanish language. In this line, several contributions will be presented with the aim of extending the state of the art in this research area. Accordingly,we present a detailed description of the resources created and the architectures designed for NLG taking into consideration the main stages in the traditional pipeline:content determination, text structuring, lexicalisation, and finally, realisation. For this purpose, we created several linguistic resources paying special attentionto coverage and accuracy. They contain a wide range of linguistic data, that is, morphological, syntactic and semantic information: aLexiS (a Lexicon for Spanish), eLSA(Augmentative and Alternative Spanish Lexicon) and aLexiE (a Lexicon for English). This work is motivated by the lack of complete linguistic resources useful for real NLGapplications, specially in the case of Spanish language. In this line, both aLexiS and aLexiE will be useful in many use cases such as report generation. On the other hand,the eLSA lexicon aims at improving NLG systems to help people diagnosed with communication disorders. In terms of libraries developed for NLG, we present several contributions. Firstly,we introduce the adaptation of the popular SimpleNLG library to Spanish and an enhanced version of it with automatic performance which expands text from keywords.Both solutions can provide applications, such as web apps, with valuable NLG capabilities. Moreover, we present a modular and hybrid architecture for NLG. It combineslinguistic knowledge and statistical information (a language model to infer prepositions) to address the NLG task automatically. At the end, our system is able to generatecomplete, coherent and grammatically/orthographically correct sentences in Spanish from the keywords provided by the users (such as adjectives, nouns and verbs). Themain strength of the architecture is its modular feature. This means its constituents (lexicon, grammar and realiser) could be reused or substituted to address other generationchallenges or to improve the performance of the system. Moreover, our NLG architecture was designed to be efficient in terms of time requiredto generate the output but also to be easily extended to other languages, even if they are not linguistically similar like Spanish and English. We prove this valuablefeature extending our NLG system to English language. Besides, both NLG systems presented, for Spanish and English, have been evaluated using popular metrics in thestate of the art and manual annotations. Finally, the research results obtained are promising and they encourage me to continue my research on the field of automaticNLG systems. YR 2021 FD 2021-02-19 LK http://hdl.handle.net/11093/1775 UL http://hdl.handle.net/11093/1775 LA eng DS Investigo RD 04-oct-2023