Towards Automating the AI Operations Lifecycle
Matthew Arnold, Jeffrey Boston, et al.
MLSys 2020
Like traditional software, AI agents are prone to failure; specifically, they can enter ‘repetitive futile cycles’ — loops of unproductive behavior that are particularly difficult to detect. This paper introduces the concept of futile cycles and distinguishes them from productive cycles in agent execution trajectories. We propose unsupervised approaches for detecting futile cycles that leverage both structural and semantic representations of agent trajectories evaluated on a large dataset of trajectories for a LangGraph-based stock market multi-Agent application. Our hybrid approach achieves an F1 score of (precision: , recall: ), significantly outperforming individual structural (F1: ) and semantic (F1: ) methods.
Matthew Arnold, Jeffrey Boston, et al.
MLSys 2020
Hadar Mulian, Sergey Zeltyn, et al.
ICSE 2026
Toshiaki Yasue, Kohichi Ono, et al.
ICSE 2026
Genady Ya. Grabarnik, Filippo Poltronieri, et al.
CASCON 2023