Inductive learning in less than one sequential data scan
Wei Fan, Haixun Wang, et al.
IJCAI 2003
Classification is an important data analysis tool that uses a model built from historical data to predict class labels for new observations. More and more applications are featuring data streams, rather than finite stored data sets, which are a challenge for traditional classification algorithms. Concept drifts and skewed distributions, two common properties of data stream applications, make the task of learning in streams difficult. The authors aim to develop a new approach to classify skewed data streams that uses an ensemble of models to match the distribution over under-samples of negatives and repeated samples of positives. © 2008 IEEE.
Wei Fan, Haixun Wang, et al.
IJCAI 2003
Mohammad M. Masud, Qing Chen, et al.
IEEE TKDE
Xiaoxiao Shi, Qi Liu, et al.
SDM 2010
Wei Fan, Yi-an Huang, et al.
SDM 2004