J. J. Egozcue, V. Pawlowsky-Glahn, J. A. Martín-Fernández, M. I. Ortego

In compositional data analysis, usual correlation/association coefficients (Pearson, Spearman, Kendall) are spurious. Linear association (LA) between parts, defined as proportionality of two parts along a sample, is an alternative, which can be extended to LA between groups of parts. The group or balance LA is an adequate model in genetics, e.g. for the Hardy-Weinberg equilibrium. However, detection of LAs has computational difficulties. We propose some methods to approximate general and group LAs. The best general LA is efficiently detected using the SVD of the centered clr-transformed data. However, the (almost) constant log-contrast detected in this way is frequently difficult to interpret. Approximation of the last compositional principal component (minimum variance) provides two methods to detect balance-association. Finally, exhaustive search for almost constant balances is possible using algorithms designed for principal balances, although at a high computational cost.

Keywords: linear association, compositional data, balance, log-contrast, linear restriction, singular value decomposition

Scheduled

AM-1 Multivariate Analysis
September 4, 2019  12:00 PM
I3L8. Georgina Blanes building


Other papers in the same session


Cookie policy

We use cookies in order to be able to identify and authenticate you on the website. They are necessary for the correct functioning of it, and therefore they can not be disabled. If you continue browsing the website, you are agreeing with their acceptance, as well as our Privacy Policy.

Additionally, we use Google Analytics in order to analyze the website traffic. They also use cookies and you can accept or refuse them with the buttons below.

You can read more details about our Cookie Policy and our Privacy Policy.