Follow
Arda Senocak
Arda Senocak
Verified email at kaist.ac.kr - Homepage
Title
Cited by
Cited by
Year
Learning to localize sound source in visual scenes
A Senocak, TH Oh, J Kim, MH Yang, IS Kweon
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018
3812018
Learning to Localize Sound Sources in Visual Scenes: Analysis and Applications
A Senocak, TH Oh, J Kim, MH Yang, IS Kweon
IEEE Transactions on Pattern Analysis and Machine Intelligence 43 (5), 1605-1619, 2021
582021
Part-based Player Identification using Deep Convolutional Representation and Multi-scale Pooling
A Senocak, TH Oh, J Kim, IS Kweon
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018
502018
Learning Sound Localization Better From Semantically Similar Samples
A Senocak*, H Ryu*, J Kim*, IS Kweon
ICASSP IEEE International Conference on Acoustics, Speech and Signal …, 2022
352022
Sound to visual scene generation by audio-to-visual latent alignment
K Sung-Bin, A Senocak, H Ha, A Owens, TH Oh
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
312023
Less can be more: Sound source localization with a classification model
A Senocak*, H Ryu*, J Kim*, IS Kweon
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2022
262022
Sound source localization is all about cross-modal alignment
A Senocak, H Ryu, J Kim, TH Oh, H Pfister, JS Chung
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
152023
MarginNCE: Robust Sound Localization with a Negative Margin
S Park*, A Senocak*, JS Chung
ICASSP IEEE International Conference on Acoustics, Speech and Signal …, 2023
142023
Event-Specific Audio-Visual Fusion Layers: A Simple and New Perspective on Video Understanding
A Senocak*, J Kim*, TH Oh, D Li, IS Kweon
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2023
112023
Audio Mamba: Bidirectional State Space Model for Audio Representation Learning
MH Erol, A Senocak, J Feng, JS Chung
arXiv preprint arXiv:2406.03344, 2024
102024
Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples
H Ryu*, A Senocak*, IS Kweon, JS Chung
ICASSP IEEE International Conference on Acoustics, Speech and Signal …, 2023
72023
Can CLIP Help Sound Source Localization?
S Park*, A Senocak*, JS Chung
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024
62024
FlexiAST: Flexibility is What AST Needs
J Feng, MH Erol, JS Chung, A Senocak
Interspeech, 2023
32023
From Coarse to Fine: Efficient Training for Audio Spectrogram Transformers
J Feng, MH Erol, JS Chung, A Senocak
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
22024
AVHBench: A Cross-Modal Hallucination Benchmark for Audio-Visual Large Language Models
K Sung-Bin, O Hyun-Bin, JM Lee, A Senocak, JS Chung, TH Oh
arXiv preprint arXiv:2410.18325, 2024
12024
Aligning Sight and Sound: Advanced Sound Source Localization Through Audio-Visual Alignment
A Senocak, H Ryu, J Kim, TH Oh, H Pfister, JS Chung
arXiv preprint arXiv:2407.13676, 2024
12024
Speech Guided Masked Image Modeling for Visually Grounded Speech
J Woo, H Ryu, A Senocak, JS Chung
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
12024
Sound2Vision: Generating Diverse Visuals from Audio through Cross-Modal Latent Alignment
K Sung-Bin, A Senocak, H Ha, TH Oh
arXiv preprint arXiv:2412.06209, 2024
2024
ElasticAST: An Audio Spectrogram Transformer for All Length and Resolutions
J Feng, MH Erol, JS Chung, A Senocak
arXiv preprint arXiv:2407.08691, 2024
2024
Audio Mamba: Bidirectional State Space Model for Audio Representation Learning
M Hamza Erol, A Senocak, J Feng, J Son Chung
arXiv e-prints, arXiv: 2406.03344, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–20