A Bottom-Up Capsule Network for Hierarchical Image Classification
Published in 2023 International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2023
Hierarchical image classification is an arduous task in deep learning and computer vision. It requires classifying multiple image classes following a taxonomy or data hierarchy. This paper introduces a bottom-up hierarchical capsule network (BUH-CapsNet) designed to address hierarchical multi-label classification. The hierarchical structure of BUH-CapsNet allows it to build a tree-like structure for classification problems, making use of the data hierarchy. This structure enables the network to learn more complex relationships in the taxonomy by balancing the hierarchical levels and following the fine-to-coarse paradigm, leading to more accurate classification results. Furthermore, the bottom-up architecture of the BUH-CapsNet enforces hierarchical consistency, using the hierarchical structure of the datasets. We trained our BUH-CapsNet considering the hierarchical level weights that keep a balance between the levels. Experiments on six widely available datasets show that BUH-CapsNet achieves better results than existing multi-label classification methods and performs better when handling hierarchical labels.
Recommended citation: K. T. Noor, A. Robles-Kelly, L. Y. Zhang, and M. R. Bouadjenek, ‘A Bottom-Up Capsule Network for Hierarchical Image Classification’, in 2023 International Conference on Digital Image Computing: Techniques and Applications (DICTA), Nov. 2023, pp. 325–331. doi: 10.1109/DICTA60407.2023.00052.
Download Paper