Workshop paperEfficient Interactive LLM Serving with Proxy Model-based Sequence Length PredictionHaoran Qiu, Weichao Mao, et al.ASPLOS 2024
Conference paperCross-Domain Telemetry Architecture: Real-Time Metrics in the Computing ContinuumJose Manuel Bernabe' Murcia, Eduardo Canovas Martinez, et al.MobiSec 2024
PaperIncorporating Signal Awareness in Source Code Modeling: An Application to Vulnerability DetectionSahil Suneja, Yufan Zhuang, et al.ACM TOSEM
Conference paperSCAD: Scalability Advisor for Interactive Microservices on Hybrid CloudsKa-Ho Chow, Umesh Deshpande, et al.SIGMOD 2023