Matías Mazzanti, Esteban Mocskos, et al.
ISCA 2025
Addressing privacy regulation such as GDPR requires organizations to find and classify sensitive and personal data in their datastores. First, data discovery tools are applied to identify the data. Then, data classification tools are applied on the data that was discovered. Organizations must classify the data into concrete categories to manage data appropriately. In this paper we focus on multi-value classification, where the classifier provides a category to set of values all from the same category. Traditional classifiers usually apply single-value classification methods to a multi-value data set. However, in many cases this resulting an incorrect classification when, for example, domain categories overlap. In this paper, we address this scenario and provide two methods to overcome this problem.
Matías Mazzanti, Esteban Mocskos, et al.
ISCA 2025
Chen Xiong, Xiangyu Qi, et al.
ACL 2025
Zhiyuan He, Yijun Yang, et al.
ICML 2024
Teryl Taylor, Frederico Araujo, et al.
Big Data 2020