Structured IB: Improving Information Bottleneck with Structured Feature Learning

doi:10.48550/arXiv.2412.08222

Structured IB: Improving Information Bottleneck with Structured Feature Learning

The Information Bottleneck (IB) principle has emerged as a promising approach for enhancing the generalization, robustness, and interpretability of deep neural networks, demonstrating efficacy across image segmentation, document clustering, and semantic communication. Among IB implementations, the IB Lagrangian method, employing Lagrangian multipliers, is widely adopted. While numerous methods for the optimizations of IB Lagrangian based on variational bounds and neural estimators are feasible, their performance is highly dependent on the quality of their design, which is inherently prone to errors. To address this limitation, we introduce Structured IB, a framework for investigating potential structured features. By incorporating auxiliary encoders to extract missing informative features, we generate more informative representations. Our experiments demonstrate superior prediction accuracy and task-relevant information preservation compared to the original IB Lagrangian method, even with reduced network size.

Publication:

arXiv e-prints

Pub Date:

December 2024

DOI:

10.48550/arXiv.2412.08222

arXiv:

arXiv:2412.08222

Bibcode:

2024arXiv241208222Y

Keywords:

Computer Science - Information Theory;
Computer Science - Machine Learning

ADS

Structured IB: Improving Information Bottleneck with Structured Feature Learning

Abstract