RL Tango: Reinforcing Generator and Verifier Together for Language ReasoningKaiwen ZhaZhengqi Gaoet al.2025NeurIPS 2025Conference paper
Thermometer: Towards Universal Calibration for Large Language ModelsMaohao ShenSubhro Daset al.2024ICML 2024Conference paper
Group Fairness with Uncertain Sensitive AttributesAbhin ShahMaohao Shenet al.2024ISIT 2024Conference paper