Conference paper
The thirteen colors of timbre
Hiroko Terasawa, Malcolm Slaney, et al.
WASPAA 2005
This paper describes a system for connecting sounds and words in linked multi-dimensional vector spaces. The acoustic space is represented using anchor models and partitioned using agglomerative clustering. The semantic space is modeled by a hierarchical multinomial clustering model. Nodes in one space are linked by probabilistic models to the other space. With these linked models, users retrieve sounds with natural language, and the system describes new sounds with words.
Hiroko Terasawa, Malcolm Slaney, et al.
WASPAA 2005
Scott Axelrod
ICASSP 2002
Malcolm Slaney, Jayashree Subrahmonia, et al.
UM 2003
Malcolm Slaney, Michcle Covell
NeurIPS 2000