Learning to localize sound source in visual scenes A Senocak, TH Oh, J Kim, MH Yang, IS Kweon Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018 | 381 | 2018 |
Learning to Localize Sound Sources in Visual Scenes: Analysis and Applications A Senocak, TH Oh, J Kim, MH Yang, IS Kweon IEEE Transactions on Pattern Analysis and Machine Intelligence 43 (5), 1605-1619, 2021 | 58 | 2021 |
Part-based Player Identification using Deep Convolutional Representation and Multi-scale Pooling A Senocak, TH Oh, J Kim, IS Kweon Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018 | 50 | 2018 |
Learning Sound Localization Better From Semantically Similar Samples A Senocak*, H Ryu*, J Kim*, IS Kweon ICASSP IEEE International Conference on Acoustics, Speech and Signal …, 2022 | 35 | 2022 |
Sound to visual scene generation by audio-to-visual latent alignment K Sung-Bin, A Senocak, H Ha, A Owens, TH Oh Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 31 | 2023 |
Less can be more: Sound source localization with a classification model A Senocak*, H Ryu*, J Kim*, IS Kweon Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2022 | 26 | 2022 |
Sound source localization is all about cross-modal alignment A Senocak, H Ryu, J Kim, TH Oh, H Pfister, JS Chung Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 15 | 2023 |
MarginNCE: Robust Sound Localization with a Negative Margin S Park*, A Senocak*, JS Chung ICASSP IEEE International Conference on Acoustics, Speech and Signal …, 2023 | 14 | 2023 |
Event-Specific Audio-Visual Fusion Layers: A Simple and New Perspective on Video Understanding A Senocak*, J Kim*, TH Oh, D Li, IS Kweon Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2023 | 11 | 2023 |
Audio Mamba: Bidirectional State Space Model for Audio Representation Learning MH Erol, A Senocak, J Feng, JS Chung arXiv preprint arXiv:2406.03344, 2024 | 10 | 2024 |
Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples H Ryu*, A Senocak*, IS Kweon, JS Chung ICASSP IEEE International Conference on Acoustics, Speech and Signal …, 2023 | 7 | 2023 |
Can CLIP Help Sound Source Localization? S Park*, A Senocak*, JS Chung Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024 | 6 | 2024 |
FlexiAST: Flexibility is What AST Needs J Feng, MH Erol, JS Chung, A Senocak Interspeech, 2023 | 3 | 2023 |
From Coarse to Fine: Efficient Training for Audio Spectrogram Transformers J Feng, MH Erol, JS Chung, A Senocak ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 2 | 2024 |
AVHBench: A Cross-Modal Hallucination Benchmark for Audio-Visual Large Language Models K Sung-Bin, O Hyun-Bin, JM Lee, A Senocak, JS Chung, TH Oh arXiv preprint arXiv:2410.18325, 2024 | 1 | 2024 |
Aligning Sight and Sound: Advanced Sound Source Localization Through Audio-Visual Alignment A Senocak, H Ryu, J Kim, TH Oh, H Pfister, JS Chung arXiv preprint arXiv:2407.13676, 2024 | 1 | 2024 |
Speech Guided Masked Image Modeling for Visually Grounded Speech J Woo, H Ryu, A Senocak, JS Chung ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 1 | 2024 |
Sound2Vision: Generating Diverse Visuals from Audio through Cross-Modal Latent Alignment K Sung-Bin, A Senocak, H Ha, TH Oh arXiv preprint arXiv:2412.06209, 2024 | | 2024 |
ElasticAST: An Audio Spectrogram Transformer for All Length and Resolutions J Feng, MH Erol, JS Chung, A Senocak arXiv preprint arXiv:2407.08691, 2024 | | 2024 |
Audio Mamba: Bidirectional State Space Model for Audio Representation Learning M Hamza Erol, A Senocak, J Feng, J Son Chung arXiv e-prints, arXiv: 2406.03344, 2024 | | 2024 |