LIBSVM Data: Classification (Multi-class)

This page contains many classification, regression, multi-label and string data sets stored in LIBSVM format. For some sets raw materials (e.g., original texts) are also available. These data sets are from UCI, Statlog, StatLib and other collections. We thank their efforts. For most sets, we linearly scale each attribute to [-1,1] or [0,1]. The testing data (if provided) is adjusted accordingly. Some training data are further separated to "training" (tr) and "validation" (val) sets. Details can be found in the description of each data set. To read data via MATLAB, you can use "libsvmread" in LIBSVM package.


aloi

cifar10

connect-4

covtype

dna

glass

imdb-rating

iris

LEDGAR (LexGLUE)

letter

mnist

mnist8m

news20

news20 (18,846)

pendigits

poker

protein

rcv1.multiclass

SCOTUS (LexGLUE)

satimage

sector

segment

Sensorless

shuttle

smallNORB

SVHN

svmguide2

svmguide4

usps

SensIT Vehicle (acoustic)

SensIT Vehicle (seismic)

SensIT Vehicle (combined)

vehicle

vowel

wine