AHAMask: Reliable Task Specification for Large Audio Language Models without Instructions
Published in AAAI, 2026
This paper proposes to simply mask some attention heads in an LALM (large audio language model) to achieve reliable task specification. This is because selectively masking some attention heads in an LALM can trigger its specific task functionalities well.
Recommended citation: Yiwei Guo, Bohan Li, Hankun Wang, Zhihan Li, Shuai Wang, Xie Chen, Kai Yu (2026). "AHAMask: Reliable Task Specification for Large Audio Language Models without Instructions." In Proc. AAAI, 2026.
