“HARTA” de novelesun corpus de español académico

  1. Milka Villayandre Llamazares 1
  1. 1 Universidad de León
    info

    Universidad de León

    León, España

    ROR https://ror.org/02tzt0b78

Revista:
CHIMERA: Romance Corpora and Linguistic Studies

ISSN: 2386-2629

Año de publicación: 2018

Número: 5

Páginas: 131-140

Tipo: Artículo

DOI: 10.15366/CHIMERA2018.5.1.011 DIALNET GOOGLE SCHOLAR lock_openAcceso abierto editor

Otras publicaciones en: CHIMERA: Romance Corpora and Linguistic Studies

Resumen

The aim of this review is to account for the process of compilation and codification of the corpus HARTA-Noveles. This corpus was created as part of the research project titled “Corpus-based study of lexical combinations of academic Spanish for the development of a computational tool for academic writing assistance” (HARTA), under the direction of Margarita Alonso Ramos (University of La Coruña). The corpus consists of representative samples of essays produced by Spanish university students and gathered with the purpose of studying academic lexical combinations (CLA), i.e., recurrent segments specific to the academic domain, along with collocations, discourse markers and other multiword expressions. Inspired by the BAWE corpus (British Academic Written English), our corpus is formed exclusively by final project texts (for the degrees) and dissertations (for the masters) selected from different public repositories of Spanish universities and from various scientific domains. These texts have been annotated with an specific system adapted from that followed by the Spanish Royal Academy (RAE) in CORPES XXI

Referencias bibliográficas

  • Alonso Ramos, M. (ed.) 2016. Spanish Learner Corpus Research: Current trends and future perspectives. Amsterdam: John Benjamins.
  • Alsop, S. y Nesi, H. 2009. Issues in the development of the British Academic Written English (BAWE) corpus. Corpora 4 (1) 71-83.
  • CLAE: Corpus de lenguaje académico en español. 2009. UCMexus-CONACYT (Consultado el 1 de junio de 2018).
  • Ebeling, S and Heuboeck, A. 2007. Encoding document information in a corpus of student writing: the experience of the British Academic Written English (BAWE) corpus. Encoding document. Corpora 2 (2) 241-256
  • Nesi, H. 2008a. BAWE: An introduction to a new resource. In Frankenberg-Garcia, A., Rkibi, T., Braga da Cruz, M., Carvalho, R., Direito, C. & Santos-Rosa, D. (eds) Proceedings of the 8th Teaching and Language Corpora Conference. Held 4-6 July 2008 at the Instituto Superior de Línguas e Administração,. Lisbon, Portugal: ISLA: 239-246.
  • Nesi, H. 2008b. ‘Introducing BAWE: a new lexicographical resource.’ In Bernal, E. and DeCesaris, J. (eds.) Proceedings of the XII EURALEX International Congress. Held 15-19 July 2008 at Universitat Pompeu Fabra. Barcelona: Institut Universari de Linguistica Aplicada, Universitat Pompeu Fabra: 737-752.
  • Parodi, G., Ed. 2007. Lingüística de corpus y discursos especializados: Puntos de Mira. Valparaíso: Ediciones Universitarias de Valparaíso.
  • Parodi, G. 2009. El corpus académico y profesional del español PUCV-2006: semejanzas y diferencias entre los géneros académicos y profesionales. Estudios Filológicos 44, 123-147.
  • Real Academia Española. 2013. Corpus del español del siglo XXI (CORPES). Descripción del sistema de codificación. Libros y prensa. Madrid: Real Academia Española.