Table 3

Confusion matrix for Euclidean distance when 10 nearest neighbors were retrieved. The values in the matrix are the percentage of the signals retrieved from each category (rows) when the example was from the certain category (columns).

Inside a car

In a restaurant

Road

Jazz

Drums

Popular

Classical

Humming

Singing

Whistling

Speaker1

Speaker2

Speaker3

Speaker4

Speaker5

Speaker6

Speaker7

Inside a car

99.5

1.2

4.7

0

0

0

0

0

0

0

0

0

0

0

0

0

0

In a restaurant

0

98.8

2.6

0

0

0

0

0

0

0

0

0

0

0

0

0

0

Road

0.2

0

92.4

0

0

0

0

0

0

0

0

0

0

0

0

0

0


Jazz

0.1

0

0

90.2

0

11.6

5.9

0.2

0.2

0.4

0

0

0

0

0

0

0

Drums

0

0

0

0

93.6

0.1

0

0

0

0

0

0

0

0

0

0

0

Popular

0

0

0.3

8.4

0.2

87.8

13.5

0

0.2

0.9

0

0

0

0

0

0

0

Classical

0

0

0

0.5

0

0.5

78.0

0

0

0.4

0

0

0

0

0

0

0


Humming

0

0

0

0.4

0

0

0.6

90.8

4.0

0.5

0

0

0

0

0

0

0

Singing

0

0

0

0.2

1.1

0

0.4

3.7

93.5

0.4

0

0

0

0

0

0

0

Whistling

0

0

0

0.1

0.7

0

0.4

0

0.5

97.9

0

0

0

0

0

0

0


Speaker1

0

0

0

0

0

0

0.8

0.4

0

0

100

0

0

0

0

0

0

Speaker2

0

0

0

0

0

0

0

0

0

0

0

99.9

2.1

0

0

0

0

Speaker3

0

0

0

0

0

0

0

0

0

0

0

0.1

97.7

0

0

0.3

0

Speaker4

0.2

0

0

0

0

0

0

0

0

0

0

0

0

100

0

0

0

Speaker5

0

0

0

0

0

0

0

0.3

1.8

0

0

0

0

0

100

0

0

Speaker6

0

0

0

0

0

0

0

0

0

0

0

0

0.2

0

0

99.7

0

Speaker7

0

0

0

0

4.5

0

0.4

0

0

1.3

0

0

0

0

0

0

100


HelĂ©n and Virtanen EURASIP Journal on Audio, Speech, and Music Processing 2010 2010:179303   doi:10.1155/2010/179303

Open Data