Sanskrit sandhi splitting using Seq2(Seq)22
Rahul Aralikatte, Neelamadhav Gantayat, et al.
EMNLP 2018
Attention-based models are successful when trained on large amounts of data. In this paper, we demonstrate that even in the low-resource scenario, attention can be learned effectively. To this end, we start with discrete human-annotated rationales and map them into continuous attention. Our central hypothesis is that this mapping is general across domains, and thus can be transferred from resource-rich domains to low-resource ones. Our model jointly learns a domain-invariant representation and induces the desired mapping between rationales and attention. Our empirical results validate this hypothesis and show that our approach delivers significant gains over state-of-the-art baselines, yielding over 15% average error reduction on benchmark datasets.
Rahul Aralikatte, Neelamadhav Gantayat, et al.
EMNLP 2018
Saiteja Utpala, Alex Gu, et al.
NAACL 2024
Gosia Lazuka, Andreea Simona Anghel, et al.
SC 2024
Takuma Udagawa, Aashka Trivedi, et al.
EMNLP 2023