Fuente: Expert Systems with Applications ; 39 (2). Páginas 1671-1678. 1 de febrero de 2012. Doi: 10.1016/j.eswa.2011.06.058
Primera autora: Iria da Cunha.
Centro: Universidad Pompeu y Fabra.
Título: DiSeg 1.0: The first system for Spanish discourse segmentation
Resumen : Nowadays discourse parsing is a very prominent research topic. However, there is not a discourse parser for Spanish texts. The first stage in order to develop this tool is discourse segmentation. In this work, we present DiSeg, the first discourse segmenter for Spanish, which uses the framework of Rhetorical Structure Theory and is based on lexical and syntactic rules. We describe the system and we evaluate its performance against a gold standard corpus, divided in a medical and a terminological subcorpus. We obtain promising results, which means that discourse segmentation is possible using shallow parsing
Autores : Iria da Cunha, Eric San Juan, Juan Manuel Torres-Moreno, Irene Castellone.
Direcciones :
1. Univ Pompeu Fabra, Inst Univ Linguist Aplicada, Barcelona 08018, Spain
2. Univ Avignon & Pays Vaucluse, Lab Informat Avignon, F-84911 Avignon 9, France
3. Univ Nacl Autonoma Mexico, Inst Ingn, Mexico City 04510, DF, Mexico
4. Ecole Polytech, Montreal, PQ H3C 3A7, Canada
5. Univ Barcelona, E-08007 Barcelona, Spain
Si eres periodista y quieres el contacto con los investigadores, regístrate en SINC como periodista.