Publications

2 results for Idan Shenfeld

KL-Regularized RLHF with Multiple Reference Models: Exact Solutions and Sample Complexity
- - Gholamali Aminian
  - Amir R Asadi
  - et al.
- 2025
- NeurIPS 2025
Conference paper
Curiosity-driven Red-teaming for Large Language Models
- - Zhang-wei Hong
  - Idan Shenfeld
  - et al.
- 2024
- ICLR 2024
Conference paper