NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities
Sreyan Ghosh
|
Sonal Kumar
|
Ashish Seth
|
Chandra Kiran Reddy Evuru
|
Utkarsh Tyagi
|
S Sakshi
|
Oriol Nieto
|
Ramani Duraiswami
|
Dinesh Manocha
|
Paper Details:
Month: November
Year: 2024
Location: Miami, Florida, USA
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://sreyan88.github.io/gamaaudio/
https://github.com/FThompson/BBCSoundDownloader
https://research.google.com/audioset/download.html
https://huggingface.co/datasets/cvssp/WavCaps
https://www.robots.ox.ac.uk/
https://research.google.com/audioset/download.html
https://zenodo.org/records/5114771
https://sound-effects.bbcrewind.co.uk/
https://research.google.com/audioset/download.html
https://zenodo.org/records/4783391
https://labs.freesound.org/datasets/
https://www.kaggle.com/datasets/soumendraprasad/musical-
https://soundbible.com/
https://github.com/microsoft/WavText5K
https://github.com/seungheondoh/music_caps_dl
https://www.kaggle.com/datasets/andradaolteanu/gtzan-
https://zenodo.org/records/1344103
https://zenodo.org/records/1344103
https://www.kaggle.com/datasets/modaresimr/sound-
https://zenodo.org/records/4060432
https://www.tensorflow.org/datasets/catalog/nsynth
https://zenodo.org/records/6473207
https://pytorch.org/
https://huggingface.co/
https://github.com/RetroCirce/HTS-Audio-Transformer
https://github.com/LAION-AI/CLAP/tree/main
https://github.com/Sreyan88/CompA
https://github.com/microsoft/CLAP
https://github.com/descriptinc/lyrebird-wav2clip
https://github.com/AndreyGuzhov/AudioCLIP
https://github.com/akoepke/audio-retrieval-benchmark
https://github.com/akoepke/audio-retrieval-benchmark
https://github.com/microsoft/pengi
https://github.com/YuanGongND/ltu
https://github.com/aigc-audio/audiogpt
https://github.com/bytedance/salmonn
https://github.com/QwenLM/Qwen-Audio
Field Of Study