Vision, Deduction and Alignment: An Empirical Study on Multi-modal Knowledge Graph Alignment
Abstract
Entity alignment (EA) for knowledge graphs (KGs) plays a critical role in knowledge engineering. Existing EA methods mostly focus on utilizing the graph structures and entity attributes (including literals), but ignore images that are common in modern multi-modal KGs. In this study we first constructed Multi-OpenEA -- eight large-scale, image-equipped EA benchmarks, and then evaluated some existing embedding-based methods for utilizing images. In view of the complementary nature of visual modal information and logical deduction, we further developed a new multi-modal EA method named LODEME using logical deduction and multi-modal KG embedding, with state-of-the-art performance achieved on Multi-OpenEA and other existing multi-modal EA benchmarks.
- Publication:
-
arXiv e-prints
- Pub Date:
- February 2023
- DOI:
- 10.48550/arXiv.2302.08774
- arXiv:
- arXiv:2302.08774
- Bibcode:
- 2023arXiv230208774L
- Keywords:
-
- Computer Science - Artificial Intelligence;
- Computer Science - Multimedia
- E-Print:
- Accepted by ICASSP2023