From Benchmarks to Business Impact: Deploying IBM Generalist Agent in Enterprise ProductionSegev ShlomovAlon Ovedet al.2026IAAI 2026
ST-WEBAGENTBENCH: A Benchmark for Evaluating Safety and Trustworthiness in Web AgentsIdo LevyBen Wieselet al.2025ICML 2025
Web Agent Revolution: Enhancing Trust and Enterprise-Grade Adoption Through InnovationSegev ShlomovXiang Denget al.2025AAAI 2025
The Second Resiliency of Intelligent Automation Systems ChallengeSegev ShlomovSami Marreedet al.2024IJCAI 2024
AUTOMATES: THE SECOND INTERNATIONAL WORKSHOP ON NO-CODE COPILOTSSegev ShlomovRonen Brafmanet al.2024IJCAI 2024
Enhancing Trust in LLM-Based AI Automation Agents: New Considerations and Future ChallengesSivan SchwartzAvi Yaeliet al.2023IJCAI 2023