Sign Up           Getting started with TROLLing           Clarino

Featured Dataverses

In order to use this feature you must have at least one published or linked dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

241 to 250 of 4,721 Results
Nov 26, 2024
Sönning, Lukas, 2024, "Background data for: Advancing our understanding of dispersion measures in corpus research", https://doi.org/10.18710/FVHTFM, DataverseNO, V1
Dataset description This dataset contains background data and supplementary material for Sönning (forthcoming), a study that looks at the behavior of dispersion measures when applied to text-level frequency data. For the literature survey reported in that study, which examines ho...
Plain Text - 14.9 KB - MD5: c23972673607ac093e7294d8d54c2286
File describing the dataset
Tab-Separated Values - 47.6 KB - MD5: 3364ac343f7cd4a8237e739aaf41f7b3
Tab-delimited data table containing the 730 research articles that entered our literature survey
Tab-Separated Values - 4.9 KB - MD5: 9bcf7eb3fc40c94c0649e39f8952fa4f
Tab-delimited data table containing annotations for the 38 studies in our survey that assessed dispersion
Tab-Separated Values - 47.8 MB - MD5: 9078aedb23cafe3679d0b0be2807cb59
Tab-delimited document-term matrix for the word forms in the Brown Corpus
Tab-Separated Values - 47.8 MB - MD5: a86da44e3100e3d6185c4929d8bc900b
Tab-delimited term-document matrix for the word forms in the Brown Corpus
Unknown - 6.1 KB - MD5: cc79a805eb71773729042bd7b77d5dd8
R quarto script documenting retrieval of the data from the Brown XML files
Nov 25, 2024
Sönning, Lukas, 2024, "Background data for: Some obstacles to replication in corpus linguistics", https://doi.org/10.18710/7LNWJX, DataverseNO, V1
This dataset contains tabular files recording occurrences and frequencies of modal verbs in the Brown family corpora; nine modal verbs (can, could, may, might, must, shall, should, will, would) and six corpora are considered (Brown, LOB, Frown, FLOB, BE06, AmE06). Tokens were ret...
Plain Text - 26.7 KB - MD5: e22385f8fd2817c4fbd19dbf5cd5e488
Documentation
File describing the dataset
Tab-Separated Values - 12.7 MB - MD5: a0b8a87b0d8b93b970794b46c1e30378
Data
Tab-delimited data table containing the data in case form
Add Data

Log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.