Cost-Aware Counterfactuals for Black Box ExplanationsNatalia Martinez GilKanthi Sarpatwaret al.2023NeurIPS 2023
Influence Based Approaches to Algorithmic Fairness: A Closer LookSoumya GhoshPrasanna Sattigeriet al.2023NeurIPS 2023
Weakly Supervised Detection of Hallucinations in LLM ActivationsMiriam RateikeCelia Cintaset al.2023NeurIPS 2023
Workshop version: How hard are computer vision datasets? Calibrating dataset difficulty to viewing timeDavid MayoJesse Cummingset al.2023NeurIPS 2023
Explaining knock-on effects of bias mitigationSvetoslav NizhnichenkovRahul Nairet al.2023NeurIPS 2023
Subtle Misogyny Detection and Mitigation: An Expert-Annotated DatasetAnna RichterBrooklyn Sheppardet al.2023NeurIPS 2023
Risk Assessment and Statistical Significance in the Age of Foundation ModelsApoorva NitsureYoussef Mrouehet al.2023NeurIPS 2023
Characterizing pre-trained and task-adapted molecular representationsCelia CintasPayel Daset al.2023NeurIPS 2023