Contrastive multiple correspondence analysis (cMCA): Using contrastive learning to identify latent subgroups in political parties
Abstract
Scaling methods have long been utilized to simplify and cluster high-dimensional data. However, the general latent spaces across all predefined groups derived from these methods sometimes do not fall into researchers' interest regarding specific patterns within groups. To tackle this issue, we adopt an emerging analysis approach called contrastive learning. We contribute to this growing field by extending its ideas to multiple correspondence analysis (MCA) in order to enable an analysis of data often encountered by social scientists—containing binary, ordinal, and nominal variables. We demonstrate the utility of contrastive MCA (cMCA) by analyzing two different surveys of voters in the U.S. and U.K. Our results suggest that, first, cMCA can identify substantively important dimensions and divisions among subgroups that are overlooked by traditional methods; second, for other cases, cMCA can derive latent traits that emphasize subgroups seen moderately in those derived by traditional methods.
- Publication:
-
PLoS ONE
- Pub Date:
- July 2023
- DOI:
- 10.1371/journal.pone.0287180
- arXiv:
- arXiv:2007.04540
- Bibcode:
- 2023PLoSO..1887180F
- Keywords:
-
- Computer Science - Social and Information Networks;
- Computer Science - Machine Learning;
- Statistics - Machine Learning
- E-Print:
- Both authors contributed equally to this work and are listed alphabetically. This manuscript is accepted by PLOS ONE