Optimal-transport based consensus clustering with applications to flow cytometry analysis

E. del Barrio, H. Inouzhe Valdes, J. Loubes, C. Matrán, A. Mayo-Íscar

We present a strategy for classifying a test sample $X_T$ using a database $X_1,...,X_N$ of classified samples where the high intrinsic variability of the data makes part of the information in the database not suitable. We cluster the database in homogeneous groups, extract a representative template of each group and use it as an initialization for an unsupervised clustering procedure on $X_T$. The resulting partition of $X_T$ is assigned to the closest template and the information of the template or/and the corresponding group of the database is used to classify $X_T$. To implement this strategy we use optimal transport techniques and introduce novel ideas for consensus clustering and optimal relabelling of a cluster based on optimal transport. As an application of our ideas we develop a tool for automated flow cytometry analysis called floWasserTclust.

Palabras clave: Optimal transport consensus clustering flow cytometry transfer labelling

Programado

GT4-1 Análisis Multivariante y Clasificación

3 de septiembre de 2019 15:30

I3L10. Edificio Georgina Blanes

Otros trabajos en la misma sesión

Estimando el centro de un conjunto de Datos Composicionales en sus unidades originales

J. A. Martín-Fernández, V. Pawlowsky-Glahn, J. J. Egozcue

On aggregation of groups and categories in contingency tables

E. Carrizosa, V. Guerrero, D. Romero Morales

Últimas noticias

04/07/19
Programa científico completo disponible

31/05/19
Convocado Premio INE 2019

13/04/19
Inscripción ya abierta

Organizan

Colaboran