Using Compressed Audio-visual Words for Multi-modal Scene Classification
(2014)
Conference Proceeding
Kurcius, J., & Breckon, T. (2014). Using Compressed Audio-visual Words for Multi-modal Scene Classification. In Proc. International Workshop on Computational Intelligence for Multimedia Understanding (1-5). https://doi.org/10.1109/IWCIM.2014.7008808
We present a novel approach to scene classification using combined audio signal and video image features and compare this methodology to scene classification results using each modality in isolation. Each modality is represented using summary feature... Read More about Using Compressed Audio-visual Words for Multi-modal Scene Classification.