241 to 250 of 4,721 Results
Nov 26, 2024
Sönning, Lukas, 2024, "Background data for: Advancing our understanding of dispersion measures in corpus research", https://doi.org/10.18710/FVHTFM, DataverseNO, V1
Dataset description This dataset contains background data and supplementary material for Sönning (forthcoming), a study that looks at the behavior of dispersion measures when applied to text-level frequency data. For the literature survey reported in that study, which examines ho... |
Nov 26, 2024 -
Background data for: Advancing our understanding of dispersion measures in corpus research
Plain Text - 14.9 KB -
MD5: c23972673607ac093e7294d8d54c2286
File describing the dataset |
Nov 26, 2024 -
Background data for: Advancing our understanding of dispersion measures in corpus research
Tab-Separated Values - 47.6 KB -
MD5: 3364ac343f7cd4a8237e739aaf41f7b3
Tab-delimited data table containing the 730 research articles that entered our literature survey |
Nov 26, 2024 -
Background data for: Advancing our understanding of dispersion measures in corpus research
Tab-Separated Values - 4.9 KB -
MD5: 9bcf7eb3fc40c94c0649e39f8952fa4f
Tab-delimited data table containing annotations for the 38 studies in our survey that assessed dispersion |
Nov 26, 2024 -
Background data for: Advancing our understanding of dispersion measures in corpus research
Tab-Separated Values - 47.8 MB -
MD5: 9078aedb23cafe3679d0b0be2807cb59
Tab-delimited document-term matrix for the word forms in the Brown Corpus |
Nov 26, 2024 -
Background data for: Advancing our understanding of dispersion measures in corpus research
Tab-Separated Values - 47.8 MB -
MD5: a86da44e3100e3d6185c4929d8bc900b
Tab-delimited term-document matrix for the word forms in the Brown Corpus |
Nov 26, 2024 -
Background data for: Advancing our understanding of dispersion measures in corpus research
Unknown - 6.1 KB -
MD5: cc79a805eb71773729042bd7b77d5dd8
R quarto script documenting retrieval of the data from the Brown XML files |
Nov 25, 2024
Sönning, Lukas, 2024, "Background data for: Some obstacles to replication in corpus linguistics", https://doi.org/10.18710/7LNWJX, DataverseNO, V1
This dataset contains tabular files recording occurrences and frequencies of modal verbs in the Brown family corpora; nine modal verbs (can, could, may, might, must, shall, should, will, would) and six corpora are considered (Brown, LOB, Frown, FLOB, BE06, AmE06). Tokens were ret... |
Plain Text - 26.7 KB -
MD5: e22385f8fd2817c4fbd19dbf5cd5e488
File describing the dataset |
Tab-Separated Values - 12.7 MB -
MD5: a0b8a87b0d8b93b970794b46c1e30378
Tab-delimited data table containing the data in case form |