LIBSVM Data: Classification (Binary Class)

This page contains many classification, regression, and multi-label data sets used in our papers. Many are from UCI, Statlog, StatLib and other collections. We really thank their efforts. For most sets, we directly transform the file into LIBSVM format and linearly scale each attribute to [-1,1]. The testing data (if provided) is adjusted accordingly. Some training data are further separated to "training" (tr) and "validation" (val) sets. Details can be found in the description of each data set.


a1a

a2a

a3a

a4a

a5a

a6a

a7a

a8a

a9a

australian

breast-cancer

colon-cancer

covtype.binary

diabetes

duke breast-cancer

fourclass

german.numer

heart

ijcnn1

ionosphere

leukemia

liver-disorders

mushrooms

news20.binary

rcv1.binary

real-sim

splice

sonar

svmguide1

svmguide3

w1a

w2a

w3a

w4a

w5a

w6a

w7a

w8a