SCALING STICK-BREAKING ATTENTION: AN EFFICIENT IMPLEMENTATION AND IN-DEPTH STUDYShawn TanSonglin Yanget al.2025ICLR 2025
Multiresolution recurrent neural networks: An application to dialogue response generationIulian Vlad SerbanTim Klingeret al.2017AAAI 2017