Aleatoric and Epistemic Discrimination: Fundamental Limits of Fairness InterventionsHao WangLuxi Heet al.2023NeurIPS 2023
Cookie Consent Has Disparate Impact on Estimation AccuracyErik MiehlingRahul Nairet al.2023NeurIPS 2023
Subtle Misogyny Detection and Mitigation: An Expert-Annotated DatasetAnna RichterBrooklyn Sheppardet al.2023NeurIPS 2023
FairSISA: Ensemble Post-Processing to Improve Fairness of Unlearning in LLMsSwanand Ravindra KadheAnisa Halimiet al.2023NeurIPS 2023
Influence Based Approaches to Algorithmic Fairness: A Closer LookSoumya GhoshPrasanna Sattigeriet al.2023NeurIPS 2023
Cost-Aware Counterfactuals for Black Box ExplanationsNatalia Martinez GilKanthi Sarpatwaret al.2023NeurIPS 2023
Weakly Supervised Detection of Hallucinations in LLM ActivationsMiriam RateikeCelia Cintaset al.2023NeurIPS 2023
Adversarial Auditing of Machine Learning Models under Compound ShiftKaran BhanotDennis Weiet al.2023ESANN 2023