| 
                                            Persistent Identifier
                                            
                                         | 
                                        doi:10.18710/CRSJLY | 
                                    
                                        | 
                                            Publication Date
                                            
                                         | 
                                        2025-03-20 | 
                                    
                                        | 
                                            Title
                                            
                                         | Replication Data for: A corpus-based analysis of the Dat-Nom/Nom-Dat alternation in German | 
                                    
                                    
                                        | 
                                            Author
                                            
                                         | Somers, JorenGhent UniversityORCID0000-0002-3815-3139
                                                         Leuschner, TorstenGhent UniversityORCID0000-0003-4825-7510
                                                         De Cuypere, LudovicVrije Universiteit Brussel - Ghent UniversityORCID0000-0002-0050-1097
                                                         Barðdal, JóhannaORCID0000-0003-0164-4249 | 
                                    
                                    
                                        | 
                                            Point of Contact
                                            
                                         | 
                                                     Use email button above to contact.
                                                     De Cuypere, Ludovic (Vrije Universiteit Brussel - Ghent University)  | 
                                    
                                    
                                        | 
                                            Description
                                            
                                         | Dataset abstract 
The dataset includes an annotated sample of N = 13292 German written sentences with a Nominative and a Dative argument. The sentences comprise 76 different verbs taking two alternating object orders: 5591 sentences occur with the Dat-Nom order, 8701 sentences occur with the Nom-Dat order. Each sentence is annotated for Object order, the sentence Verb and several features related to both objects, including: (pro)nominality, pronoun type, referentiality, person, number, definiteness, animacy, and length. The sentences and the two objects are shared in a separate .csv-file. An R Notebook with the data analysis is provided as well as an html file with both the R code and output for the analysis.  (2023-10-23)
                                                        
 Article abstract 
A subgroup of German Nom-Dat verbs have received considerable attention in the literature due to the propensity of the dative to occur preverbally, which is unexpected on an object analysis of the dative (see references below). Here we argue for an alternative analysis, namely that the relevant verbs alternate between two different argument structures, Dat-Nom and Nom-Dat, and hence that either argument, the dative or the nominative, may be the syntactic subject. Earlier studies have shown that topicalisation of direct arguments is found in ca. 4–12% of the cases in German texts. For comparison, we have extracted 13,000 tokens of 76 verbs from the deTenTen13 corpus and coded them for ten different variables. Our findings support an alternating Dat-Nom/Nom-Dat analysis for these verbs, as 42% of the tokens instantiate the Dat-Nom order and the remnant 58% instantiate the Nom-Dat order. In contexts with full NPs only, the share of Dat-Nom tokens is even higher, 46% compared to 54% Nom-Dat, which altogether excludes a topicalisation analysis of the Dat-Nom word order. In order to throw further light on the alternation, we carry out a multivariate analysis which confirms the effect of topicality, definiteness, length, the animacy of the dative and the inanimacy of the nominative.  (2023-10-23) | 
                                    
                                    
                                        | 
                                            Subject
                                            
                                         | Arts and Humanities | 
                                    
                                    
                                        | 
                                            Keyword
                                            
                                         | German
                                                         Nominative
                                                         Dative
                                                         Alternating constructions
                                                         Argument structure construction
                                                         Corpus-based linguistics
                                                         Mixed-effects logistic regression
                                                         Dat-Nom alternation
                                                         experiential verb
                                                         experiential construction
                                                         unaccusative
                                                         unaccusative verb
                                                         mixed-effects logistic regression
                                                         dative subject
                                                         word order
                                                         argument structure | 
                                    
                                    
                                        | 
                                            Related Publication
                                            
                                         | Somers, J., Leuschner, T., De Cuypere, L., & Barðdal, J. (in press). A Corpus-Based Analysis of the Dat-Nom/Nom-Dat Alternation in German. Zeitschrift Für Sprachwissenschaft. | 
                                    
                                    
                                        | 
                                            Language
                                            
                                         | English | 
                                    
                                    
                                        | 
                                            Producer
                                            
                                         | Ghent University (UGent) https://www.ugent.be
                                                         Vrije Universiteit Brussel https://www.vub.be | 
                                    
                                    
                                        | 
                                            Production Location
                                            
                                         | Ghent, Belgium | 
                                    
                                    
                                        | 
                                            Distributor
                                            
                                         | The Tromsø Repository of Language and Linguistics (TROLLing) (TROLLing) https://trolling.uit.no/ | 
                                    
                                    
                                        | 
                                            Depositor
                                            
                                         | De Cuypere, Ludovic | 
                                    
                                    
                                        | 
                                            Deposit Date
                                            
                                         | 2023-10-04 | 
                                    
                                    
                                        | 
                                            Date of Collection
                                            
                                         | Start Date: 2020-06-01; End Date: 2022-11-01 | 
                                    
                                    
                                        | 
                                            Data Type
                                            
                                         | annotated corpus data | 
                                    
                                    
                                        | 
                                            Software
                                            
                                         | MS Excel, Version: 2309
                                                         R, Version: 4.2.3
                                                         RStudio (Posit Software, PBC), Version: 2023.09.0
                                                         SketchEngine, Version: 2023 | 
                                    
                                    
                                        | 
                                            Data Source
                                            
                                         | The data was retrieved from the deTenTen corpus, accessed through SketchEngine. For more information about the deTenTen corpus, see: sketchengine.eu/detenten-german-corpus. For more information about SketchEngine, see: sketchengine.eu. 
References for SketchEngine: 
 - Adam Kilgarriff, Vít Baisa, Jan Bušta, Miloš Jakubíček, Vojtěch Kovář, Jan Michelfeit, Pavel Rychlý, Vít Suchomel. The Sketch Engine: ten years on. Lexicography, 1: 7-36, 2014.
 
 - Adam Kilgarriff, Pavel Rychlý, Pavel Smrž, David Tugwell. The Sketch Engine. Proceedings of the 11th EURALEX International Congress: 105-116, 2004.
 
 
The extracted text fragments included in this dataset only represent insubstantial portions of the source listed above, and they do not represent coherent larger texts. Reuse of such excerpts is permitted under exceptions in IPR and database protection regulations, such as Fair use (cf. US Copyright Act), the EU Database Directive (cf. art 8 Rights and obligations of lawful users), and the Norwegian Copyright Act (cf. § 24 Eneretten til databaser).  |