Enhanced sound event localization and detection in real 360-degree audio-visual soundscapes (DCASE task3 format)
sound-detection
sound-localization
audio-visual-learning
seldnet
yolov5
seld
yolov8
dcase2023
detic
audio-visual-seld
-
Updated
Jul 4, 2024 - Python