HAXMLNet: Hierarchical Attention Network for Extreme Multi-Label Text Classification
Abstract
Extreme multi-label text classification (XMTC) addresses the problem of tagging each text with the most relevant labels from an extreme-scale label set. Traditional methods use bag-of-words (BOW) representations without context information as their features. The state-ot-the-art deep learning-based method, AttentionXML, which uses a recurrent neural network (RNN) and the multi-label attention, can hardly deal with extreme-scale (hundreds of thousands labels) problem. To address this, we propose our HAXMLNet, which uses an efficient and effective hierarchical structure with the multi-label attention. Experimental results show that HAXMLNet reaches a competitive performance with other state-of-the-art methods.
- Publication:
-
arXiv e-prints
- Pub Date:
- March 2019
- DOI:
- 10.48550/arXiv.1904.12578
- arXiv:
- arXiv:1904.12578
- Bibcode:
- 2019arXiv190412578Y
- Keywords:
-
- Computer Science - Information Retrieval;
- Computer Science - Machine Learning;
- Statistics - Machine Learning