LIBSVM Data: Regression
This page contains many classification, regression, and
multi-label data sets used in our papers. Many
are from UCI, Statlog, StatLib and other collections. We
really thank their efforts. For most sets, we directly transform the file
into LIBSVM format and linearly scale each attribute to [-1,1]. The testing data (if provided)
is adjusted accordingly. Some training data are further separated
to "training" (tr) and "validation" (val) sets. Details can be
found in the description of each data set.
abalone
- Source:
UCI
/ Abalone
- # of data:
4,177
- # of features:
8
- Files:
bodyfat
- Source:
StatLib
/ bodyfat
- # of data:
252
- # of features:
14
- Files:
cadata
- Source:
StatLib
/ houses.zip
- # of data:
20,640
- # of features:
8
- Files:
cpusmall
- Source:
Delve
/ comp-activ
- # of data:
8,192
- # of features:
12
- Files:
housing
- Source:
UCI
/ Housing (Boston)
- # of data:
506
- # of features:
13
- Files:
mg
- Source:
[GWF01a]
- # of data:
1,385
- # of features:
6
- Files:
mpg
- Source:
UCI
/ Auto-Mpg
- # of data:
392
- # of features:
7
- Files:
pyrim
- Source:
UCI
/ Qualitative Structure Activity Relationships
- # of data:
74
- # of features:
27
- Files:
space_ga
- Source:
StatLib
/ space_ga
- # of data:
3,107
- # of features:
6
- Files:
triazines
- Source:
UCI
/ Qualitative Structure Activity Relationships
- # of data:
186
- # of features:
60
- Files: