GMFDE: Gated Multi-Modal Fusion with Distillation Enhancement for Entity Alignment

Miaomiao Li; Rongyan Yuan; Zhijiang Li

doi:10.2352/J.ImagingSci.Technol.2026.70.1.010415

Back to articles

Special issue: ACDM 2025

Volume: 70 | Article ID: 010415

GMFDE: Gated Multi-Modal Fusion with Distillation Enhancement for Entity Alignment

multi-modal knowledge graph entity alignment knowledge distillation feature fusion

DOI : 10.2352/J.ImagingSci.Technol.2026.70.1.010415 Published Online : January 2026

Abstract

In recent years, multi-modal knowledge graphs (MMKGs) have emerged to enhance the representation of real-world entities through structural, textual, and visual features. However, the inherent heterogeneity among different modalities poses significant challenges for entity alignment across KGs. In this study, we introduce GMFDE, an innovative framework for multi-modal entity alignment that integrates a gated residual fusion mechanism with a knowledge distillation strategy. The fusion module adaptively balances and refines modality-specific features while the distillation component enables unimodal encoders to learn complementary information from the fused multi-modal representation, promoting consistency across modalities. Extensive experiments on both bilingual and cross-KG datasets demonstrate that GMFDE achieves superior performance compared with existing leading methods, particularly excelling in settings with limited alignment seeds.

Journal Title : Journal of Imaging Science and Technology

Publisher Name : Society for Imaging Science and Technology

Downloads 1

Cite this article

Miaomiao Li, Rongyan Yuan, Zhijiang Li, "GMFDE: Gated Multi-Modal Fusion with Distillation Enhancement for Entity Alignment" in Journal of Imaging Science and Technology, 2026, pp 1 - 11, https://doi.org/10.2352/J.ImagingSci.Technol.2026.70.1.010415

Copy citation

Article timeline

received July 2025
accepted December 2025
PublishedJanuary 2026

articleview.keywords

Login or subscribe to view the content