The quest to teach LLMs how to countResearchKim Martineau05 Dec 2025AIComputer ScienceGenerative AIMathematical SciencesNatural Language Processing
Improving Hugging Face training efficiency through packing with flash attentionTechnical noteRhui Dih Lee, Arthur Zucker, Achintya Kundu, Laura Wynter, Raghu Ganti, and Mayank Mishra28 Aug 2024AI