Statistics for linguist(ic)s blog
Modeling the interpretation of quantifiers using beta regression
This blog post shows how to use beta regression to model the proportional interpretation of the quantifiers few, some, many, and most. We consider variable-dispersion and mixed-effects structures as well as diagnostics for frequentist and Bayesian models.
Different parameterizations of the negative binomial distribution
This blog post discusses two different parameterizations of the negative binomial distribution and groups R packages (and functions) based on the version they implement.
The negative binomial distribution: A visual explanation
This blog post uses a visual approach to explain how the negative binomial distribution works.
The replication crisis: Implications for myself
In this blog post, I reflect on the ways in which learning about the replication crisis in science has affected my own work.
Structured down-sampling: Implementation in R
This blog post shows how to implement structured down-sampling in R.
Two types of down-sampling in corpus-based work
This short blog post contrasts the different ways in which the term down-sampling is used in corpus-based work.
‘Dispersion’ in corpus linguistics and statistics
This blog post clarifies the different ways in which the term dispersion is used in corpus linguistics and statistics.