Biometrical Letters

ISSN:1896-3811

Submit manuscript
Volume (55) Number 2 pp. 233-243

Ewa Skotarczak 1, Anita Dobek 1, Krzysztof Moliński 1

1Department of Mathematical and Statistical Methods, Poznań University of Life Sciences, Poznań, Poland

Entropy as a measure of dependency for categorized data

Summary

Data arranged in a two-way contingency table can be obtained as a result of many experiments in the life sciences. In some cases the categorized trait is in fact conditioned by an unobservable continuous variable, called liability. It may be interesting to know the relationship between the Pearson correlation coefficient of these two continuous variables and the entropy function measuring the corresponding relation for categorized data. After many simulation trials, a linear regression was estimated between the Pearson correlation coefficient and the normalized mutual information (both on a logarithmic scale). It was observed that the regression coefficients obtained do not depend either on the number of observations classified on a categorical scale or on the continuous random distribution used for the latent variable, but they are influenced by the number of columns in the contingency table. In this paper a known measure of dependency for such data, based on the entropy concept, is applied.

Keywords: contingency table, correlation, entropy, liability

DOI: 10.2478/bile-2018-0014

For citation:

MLA Skotarczak, Ewa, et al. "Entropy as a measure of dependency for categorized data." Biometrical Letters 55.2 (2018): 233-243. DOI: 10.2478/bile-2018-0014
APA Skotarczak, E., Dobek, A., & Moliński, K. (2018). Entropy as a measure of dependency for categorized data. Biometrical Letters 55(2), 233-243 DOI: 10.2478/bile-2018-0014
ISO 690 SKOTARCZAK, Ewa, DOBEK, Anita, MOLIńSKI, Krzysztof. Entropy as a measure of dependency for categorized data. Biometrical Letters, 2018, 55.2: 233-243. DOI: 10.2478/bile-2018-0014