Presentan un segmentador del discurso para analizar sintácticamente textos en castellano

Fuente: Expert Systems with Applications ; 39 (2). Páginas 1671-1678. 1 de febrero de 2012. Doi: 10.1016/j.eswa.2011.06.058

Primera autora: Iria da Cunha.

Centro: Universidad Pompeu y Fabra.

SINC | 07 febrero 2012 13:02

Título: DiSeg 1.0: The first system for Spanish discourse segmentation

Resumen : Nowadays discourse parsing is a very prominent research topic. However, there is not a discourse parser for Spanish texts. The first stage in order to develop this tool is discourse segmentation. In this work, we present DiSeg, the first discourse segmenter for Spanish, which uses the framework of Rhetorical Structure Theory and is based on lexical and syntactic rules. We describe the system and we evaluate its performance against a gold standard corpus, divided in a medical and a terminological subcorpus. We obtain promising results, which means that discourse segmentation is possible using shallow parsing

Autores : Iria da Cunha, Eric San Juan, Juan Manuel Torres-Moreno, Irene Castellone.

Direcciones :

1. Univ Pompeu Fabra, Inst Univ Linguist Aplicada, Barcelona 08018, Spain

2. Univ Avignon & Pays Vaucluse, Lab Informat Avignon, F-84911 Avignon 9, France

3. Univ Nacl Autonoma Mexico, Inst Ingn, Mexico City 04510, DF, Mexico

4. Ecole Polytech, Montreal, PQ H3C 3A7, Canada

5. Univ Barcelona, E-08007 Barcelona, Spain

Si eres periodista y quieres el contacto con los investigadores, regístrate en SINC como periodista.

Zona geográfica: Internacional
Fuente: SINC

Comentarios

Queremos saber tu opinión