Separating the “Chirp” from the “Chat”: Self-supervised Visual Grounding of Sound and Language
| Title: | Separating the “Chirp” from the “Chat”: Self-supervised Visual Grounding of Sound and Language |
|---|---|
| Authors: | Hamilton, Mark; Zisserman, Andrew; Hershey, John R.; Freeman, William T. |
| Source: | 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) CVPR Computer Vision and Pattern Recognition (CVPR), 2024 IEEE/CVF Conference on. :13117-13127 Jun, 2024 |
| Relation: | 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) |
| Database: | IEEE Xplore Digital Library |