The representativeness threshold for the CETA subcorpus of the Coruña Corpus
| UDC.coleccion | Investigación | es_ES |
| UDC.departamento | Letras | es_ES |
| UDC.endPage | 139 | es_ES |
| UDC.grupoInv | Research Group for Multidimensional Corpus-Based Studies in English (MUSTE) | es_ES |
| UDC.issue | 2 | es_ES |
| UDC.journalTitle | Revista de lenguas para fines específicos | es_ES |
| UDC.startPage | 125 | es_ES |
| UDC.volume | 27 | es_ES |
| dc.contributor.author | Alfaya Lamas, Elena | |
| dc.contributor.author | Garrote Espantoso, Menchu | |
| dc.date.accessioned | 2024-07-05T11:29:14Z | |
| dc.date.available | 2024-07-05T11:29:14Z | |
| dc.date.issued | 2021-09 | |
| dc.description.abstract | [Resumen] The concept of representativeness is the main distinguishing characteristic of specialised corpora in comparison to other sets of texts. The Coruña Corpus of English Scientific Writing currently comprises four published subcorpora (astronomy, life sciences, history, and philosophy) plus three others under compilation (physics, chemistry and linguistics). In this paper we aim to assess the lexical density of the text samples in CETA, the Corpus of English Texts on Astronomy, by means of the ReCor tool, a posteriori. The study is motivated by the following question: does quantitative representativeness analysis using ReCor provide, in the form of a cross-check, further validation of previous research on the representativeness of CETA? Previous work (Crespo and Moskowich, 2010) has indicated that the CETA corpus is well designed and valid for the purposes for which it was intended. We will here suggest metrics to measure these findings. The most important contribution of this study is to offer quantitative data collection results using the ReCor tool, which allows data triangulation and consequently ensures overall data quality. Results show that data analysis with the ReCor tool supports previous findings, and thus we are able to verify that CETA is indeed representative of the language of its time and register. | es_ES |
| dc.identifier.citation | Alfaya Lamas, Elena and Garrote Espantoso, Menchu. 2021. The representativeness threshold for the CETA subcorpus of the Coruña Corpus. Revista de lenguas para fines específicos 27.2, pp. 125-139 · https://doi.org/10.20420/rlfe.2021.440 | es_ES |
| dc.identifier.uri | http://hdl.handle.net/2183/37756 | |
| dc.language.iso | eng | es_ES |
| dc.publisher | Universidad de Las Palmas de Gran Canaria | es_ES |
| dc.relation.uri | https://doi.org/10.20420/rlfe.2021.440 | es_ES |
| dc.rights | Atribución-NoComercial-SinDerivadas 3.0 España | es_ES |
| dc.rights.accessRights | open access | es_ES |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ | * |
| dc.subject | Representativeness | es_ES |
| dc.subject | ReCor | es_ES |
| dc.subject | Specialized Corpus | es_ES |
| dc.subject | Zipf's Law | es_ES |
| dc.subject | N-gram | es_ES |
| dc.subject | Coruña Corpus | es_ES |
| dc.subject | CETA | es_ES |
| dc.subject | Astronomy | es_ES |
| dc.title | The representativeness threshold for the CETA subcorpus of the Coruña Corpus | es_ES |
| dc.type | journal article | es_ES |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | 8d898a03-208f-4256-9d6e-827e77512932 | |
| relation.isAuthorOfPublication.latestForDiscovery | 8d898a03-208f-4256-9d6e-827e77512932 |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Alfaya_Lamas_Elena_2021_Representativeness_threshold_CETA.pdf
- Size:
- 896.53 KB
- Format:
- Adobe Portable Document Format
- Description:

