QAttn: Efficient GPU Kernels for mixed-precision vision transformersPiotr Sebastian KluskaAdrián Castellóet al.2024CVPR 2024
Planning with Language Models Through The Lens of EfficiencyMichael KatzHarsha Kokelet al.2024ICAPS 2024
Quantifying the Ethical Dilemma of Using Culturally Toxic Training Data in AI Tools for Indigenous LanguagesPedro DominguesClaudio Santos Pinhanezet al.2024LREC-COLING 2024
Facilitating Human-LLM Collaboration through Factuality Scores and Source AttributionsHyo Jin DoRachel Ostrandet al.2024CHI 2024
Towards Pareto Optimal Throughput in Small Language Model ServingPol G. RecasensYue Zhuet al.2024EuroSys 2024
Human Evaluation of the Usefulness of Fine-Tuned English Translators for the Guarani Mbya and Nheengatu Indigenous LanguagesClaudio Santos PinhanezPaulo Rodrigo Cavalinet al.2024PROPOR 2024
Predicting Question-Answering Performance of Large Language Models through Semantic ConsistencyElla RabinovichSamuel Ackermanet al.2023EMNLP 2023
Matching Table Metadata with Business Glossaries Using Large Language ModelsElita LoboOktie Hassanzadehet al.2023ISWC 2023
ComplexWorld: A Large Language Model-based Interactive Fiction Learning Environment for Text-based Reinforcement Learning AgentsShreyas BasavatiaShivam Ratnakaret al.2023IJCAI 2023