Persistent Identifier
|
doi:10.18710/RCG0ZH |
Publication Date
|
2024-01-17 |
Title
| Replication Data for: Dormivit et resurgit. A ‘language-ecology’ approach to the diachrony of the Latin ingressive perfect |
Author
| Aerts, Simon (Ghent University) - ORCID: 0000-0003-1852-9255 |
Point of Contact
|
Use email button above to contact.
Aerts, Simon (Ghent University) |
Description
| Dataset includes annotated corpus data from Latin texts from the 3rd c. BCE until the 6th c. CE. Attestations of 'perfectum' stem forms of a selection of common stative verbs were extracted from major online corpora (about 14.000 data points); a random sample (n = 234) that represents all text types and time periods as evenly as possible was then subjected to a close-reading analysis in order to ascertain the attestation rate of 'ingressive' meaning with the target observations. Only the data points that were annotated in full are included in the current dataset. (2023-08-18) |
Subject
| Arts and Humanities |
Keyword
| Latin linguistics
ingressive aspect
tense-aspect
corpus semantics
Latin tense system |
Related Publication
| Aerts, S. 2024. “Dormivit et resurgit. A ‘language-ecology’ approach to the diachrony of the Latin ingressive perfect.” Journal of Latin Linguistics 23 (1-2), 21-50. doi: 10.1515/joll-2024-2006 https://doi.org/10.1515/joll-2024-2006 |
Language
| English |
Producer
| Ghent University https://www.ugent.be/en |
Contributor
| Data Curator : Cluyse, Brian |
Funding Information
| Research Foundation - Flanders: Grant number: 1282722N |
Distributor
| The Tromsø Repository of Language and Linguistics (TROLLing) (TROLLing) https://trolling.uit.no/ |
Depositor
| Aerts, Simon |
Deposit Date
| 2023-08-21 |
Time Period
| Start Date: 200BCE ; End Date: 0600 |
Date of Collection
| Start Date: 2021-11-01 ; End Date: 2023-05-31 |
Data Type
| Annotated corpus data |
Series
| Tracing change and reaction in the Latin tense system: The datasets in this series contain the replication data for research papers published within the FWO-funded project "Tracing change and reaction in the Latin tense system: an empirical analysis of language-internal and language-external influences on the development of morphological innovations and form-function pairings from Early Latin to Early Romance". |
Software
| R |
Data Source
| The data contained in this dataset originate from the following sources:
- ECDS: Epigraphik-Datenbank Clauss/Slaby. Included in the used sample are texts from the collections CLE (Carmina Latina Epigraphica, i.e. Latin verse inscriptions), leges (i.e. laws), and tituli operum (i.e. public inscriptions describing buildings). ECDS does not provide a user license / Terms of Use, except for the following disclaimer: "All texts, pictures and graphics published on this website are subject to copyright and other laws for the protection of intellectual property".
- LASLA: Laboratoire d’Analyse Statistique des Langues Anciennes. Classical period, mostly literary texts. LASLA does not provide a user license / Terms of Use, except for the general copyright statement in the about section of the LASLA Opera Latina website: Copyright LASLA - CIPL 2014.
- LLT: Library of Latin texts. LLT1 (Antiquitas (until CE 200) from which the texts overlapping with the LASLA corpus were excluded in our corpus queries), LLT2 (Aetas Patrum I, ca. CE 200 - CE 500) and LLT5 (Aetas Patrum II, ca. CE 501 - 735), from which the texts overlapping with the PaLaFra corpus were excluded, LLT3 (transcripts from the Councils, ca. CE 500 - CE 800), LLT4 (the texts of the Vulgate, i.e. the Latin Old and New Testaments, translated by St Hieronymus and his team) and LLT6 (parabiblical texts). LLT is part of the BREPOLiS databases, for which the BREPOLiS Terms and Conditions apply. The BREPOLiS Terms and Conditions entitle users "to extract and re-utilize, for non-commercial purposes only, any insubstantial parts of the contents of the Database".
- PaLaFra: The transition from Latin to French: constitution and analysis of a Latin-French digital corpus. The subcorpus PaLaFraLat contains texts of various types from the Gallo-Roman area, mainly from the Merovingian period but also including hagiographic texts; it is accessible under the CC BY-NC-SA 4.0 license.
The extracted text fragments that are contained in the data file of this dataset only represent non-substantial portions of the sources listed above, and they do not represent coherent larger texts. Therefore, the reuse (including redistribution) of these excerpts is permitted by the exceptions rules in IPR and database protection regulations, such as Fair use (USA cf. US Copyright Act), Fair dealing (UK; cf. Exceptions to copyright), "lover, forskrifter, rettsavgjørelser og andre vedtak av offentlig myndighet" (Norway; cf. § 14 in Åndsverkloven), "uvesentlige deler av databaser" (Norway; cf. § 24 in Åndsverkloven), "sitatretten" (Norway; cf. § 29 in Åndsverkloven). As these excerpts do not represent substantial parts of the reused sources, the redistribution of these excerpts is according to Creative Commons (CC) also permitted if they are extracted from sources that are distributed under Creative Commons licenses (cf. question "Do I always have to comply with the license terms? If not, what are the exceptions?" in the Creative Commons Frequently Asked Questions). |