The making of the Corpus of English Life Sciences Texts (CELiST), a bunch of disciplines

UDC.coleccionInvestigaciónes_ES
UDC.departamentoLetrases_ES
UDC.endPage19es_ES
UDC.grupoInvResearch Group for Multidimensional Corpus-Based Studies in English (MUSTE)es_ES
UDC.startPage2es_ES
dc.contributor.authorMoskowich, Isabel
dc.date.accessioned2024-01-15T08:45:50Z
dc.date.embargoEndDate9999-12-31es_ES
dc.date.embargoLift9999-12-31
dc.date.issued2021
dc.description.abstract[Abstract] Contrary to what happens with huge corpora automatically taken from the Internet by crawlers, the compilation of a smaller specialised corpus is a time-consuming, carefully planned task that must follow a protocol. The different subcorpora in the Coruña Corpus family have been built in a similar way and attending to what Kennedy (1998: 70-85) mentions as the five steps in corpus creation: design, planning a storage system and keeping records, obtaining permissions, text capture, and encoding. In the case of CELiST, the Corpus of English Life Sciences Texts, design has been certainly difficult. This chapter will explore and explain the reasons behind text selection for this particular corpus and will also address the decisions that had to be made regarding disciplines. As we intended to compile a corpus of texts dealing with biology, we found that the field, as such, did not exist in the eighteenth and nineteenth centuries, thus leading compilers to look for extracts and works in different sources and to extend our original selection to many more disciplines in the UNESCO classification of the fields of Science and Technology (1988). Therefore, the sampling frame was determined, first and foremost, by the field in question. Consequently, we had to move to something different and more inclusive as we learned more about the taxonomies of scientific fields across history. The chapter will provide the final portrait of CELiST in iys makinges_ES
dc.identifier.citationMoskowich, Isabel. 2021. “The making of the Corpus of English Life Sciences Texts (CELiST), a bunch of disciplines”. In Moskowich, Isabel; Lareo, Inés and Camiña Rioboó, Gonzalo (eds.), "All families and genera": Exploring the Corpus of English Life Sciences Texts. Amsterdam: John Benjamins. 2–19es_ES
dc.identifier.isbn9789027259622
dc.identifier.urihttp://hdl.handle.net/2183/34901
dc.language.isoenges_ES
dc.publisherJohn Benjaminses_ES
dc.relation.urihttps://doi.org/10.1075/z.237.01moses_ES
dc.rights.accessRightsembargoed accesses_ES
dc.subjectLate Modern Englishes_ES
dc.subjectCorpus linguisticses_ES
dc.subjectScientific discoursees_ES
dc.subjectLife Scienceses_ES
dc.titleThe making of the Corpus of English Life Sciences Texts (CELiST), a bunch of disciplineses_ES
dc.typebook partes_ES
dspace.entity.typePublication
relation.isAuthorOfPublicationc672d28b-685f-4ef4-88f1-ce65594795f0
relation.isAuthorOfPublication.latestForDiscoveryc672d28b-685f-4ef4-88f1-ce65594795f0

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Moskowich_Isabel_2021_Making_Corpus_English_Life_Sciences_Texts.pdf
Size:
120.76 KB
Format:
Adobe Portable Document Format
Description: