Neural networks have achieved remarkable success across various fields. However, the lack of interpretability limits their practical use, particularly in critical decision-making scenarios. Posthoc interpretability, which provides explanations for pretrained models, is often at risk of fidelity and robustness. This has inspired a rising interest in self-interpretable neural networks (SINNs), which inherently reveal the prediction rationale through model structures. Despite this progress, existin
A Comprehensive Survey on Self-Interpretable Neural Networks
Yang Ji·Hui Xiong·Hengshu Zhu·Zhigaoyuan Wang·Dazhong Shen·Yuting Zhang·Chuan Qin·Yuanxin Zhuang·Zheng Gong·Ying Sun
