Publications

FlexAttention for Efficient High-Resolution Vision-Language Models