A Novel Metric for Measuring the Robustness of Large Language Models in Non-adversarial ScenariosSamuel AckermanElla Rabinovichet al.2024EMNLP 2024
Structured Chain-of-Thought Prompting for Few-Shot Generation of Content-Grounded QA ConversationsArafat SultanJatin Ganhotraet al.2024EMNLP 2024
DARE to Diversify: DAta Driven and Diverse LLM REd TeamingManish NagireddyBernat Guillen Pegueroleset al.2024KDD 2024
Navigating the Modern Evaluation Landscape: Considerations in Benchmarks and Frameworks for Large Language Models (LLMs)Leshem ChoshenAriel Geraet al.2024LREC-COLING 2024
Using Large Language Models to Understand Suicidality in a Social Media–Based Taxonomy of Mental Health Disorders: Linguistic Analysis of Reddit PostsBrian W. BauerKely Norelet al.2024JMIR Mental Health
Machine-Assisted Error Discovery in Conversational AI SystemsMaeda HanafiFrederick Reisset al.2024CHI 2024
Leveraging Large Language Models to Enhance Domain Expert Inclusion in Data Science WorkflowsJasmine ShihVishal Mohantyet al.2024CHI 2024
Language models can identify enzymatic binding sites in protein sequencesYves Gaetan Nana TeukamLoic Kwate Dassiet al.2024Computational And Structural Biotechnology Journal
Training Large Language Encoders with the Curated Carolina CorpusGuilherme Lamartine MelloPaulo Rodrigo Cavalinet al.2024PROPOR 2024