TSR: Trajectory‑Search Rollouts for Multi‑Turn RL of LLM AgentsAladin DjuheraSwanand Ravindra Kadheet al.2026ICLR 2026Workshop paper
SafeCOMM: A Study on Safety Degradation in Fine-Tuned Telecom Large Language ModelsAladin DjuheraSwanand Ravindra Kadheet al.2026WCNC 2026Conference paper
When Data is the Algorithm: A Systematic Study and Curation of Preference Optimization DatasetsAladin DjuheraFarhan Ahmedet al.2025NeurIPS 2025Workshop paper