NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
ALVR - 2024
Total Papers:- 19
Total Papers accross all years:- 35
Total Citations :- 0
1
2
»
How and where does CLIP process negation?
Vincent Quantmeyer |
Pablo Mosteiro |
Albert Gatt |
LMPT: Prompt Tuning with Class-Specific Embedding Loss for Long-Tailed Multi-Label Visual Recognition
Peng Xia |
Di Xu |
Ming Hu |
Lie Ju |
Zongyuan Ge |
Enhancing Continual Learning in Visual Question Answering with Modality-Aware Feature Distillation
Malvina Nikandrou |
Georgios Pantazopoulos |
Ioannis Konstas |
Alessandro Suglia |
WISMIR3: A Multi-Modal Dataset to Challenge Text-Image Retrieval Approaches
Florian Schneider |
Chris Biemann |
Vision Language Models for Spreadsheet Understanding: Challenges and Opportunities
Shiyu Xia |
Junyu Xiong |
Haoyu Dong |
Jianbo Zhao |
Yuzhang Tian |
Mengyu Zhou |
Yeye He |
Shi Han |
Dongmei Zhang |
Proceedings of the 3rd Workshop on Advances in Language and Vision Research (ALVR)
Jing Gu |
Tsu-Jui (Ray) Fu |
Drew Hudson |
Asli Celikyilmaz |
William Wang |
mBLIP: Efficient Bootstrapping of Multilingual Vision-LLMs
Gregor Geigle |
Abhay Jain |
Radu Timofte |
Goran Glavaš |
Negative Object Presence Evaluation (NOPE) to Measure Object Hallucination in Vision-Language Models
Holy Lovenia |
Wenliang Dai |
Samuel Cahyawijaya |
Ziwei Ji |
Pascale Fung |
VideoCoT: A Video Chain-of-Thought Dataset with Active Annotation Tool
Yan Wang |
Yawen Zeng |
Jingsheng Zheng |
Xiaofen Xing |
Jin Xu |
Xiangmin Xu |
Wiki-VEL: Visual Entity Linking for Structured Data on Wikimedia Commons
Philipp Bielefeld |
Jasmin Geppert |
Necdet Güven |
Melna John |
Adrian Ziupka |
Lucie-Aimée Kaffee |
Russa Biswas |
Gerard De Melo |
Improving Vision-Language Cross-Lingual Transfer with Scheduled Unfreezing
Max Reinhardt |
Gregor Geigle |
Radu Timofte |
Goran Glavaš |
Causal and Temporal Inference in Visual Question Generation by Utilizing Pre-trained Models
Zhanghao Hu |
Frank Keller |
VerbCLIP: Improving Verb Understanding in Vision-Language Models with Compositional Structures
Hadi Wazni |
Kin Lo |
Mehrnoosh Sadrzadeh |
English-to-Japanese Multimodal Machine Translation Based on Image-Text Matching of Lecture Videos
Ayu Teramen |
Takumi Ohtsuka |
Risa Kondo |
Tomoyuki Kajiwara |
Takashi Ninomiya |
Automatic Layout Planning for Visually-Rich Documents with Instruction-Following Models
Wanrong Zhu |
Ruiyi Zhang |
Jennifer Healey |
William Yang Wang |
Tong Sun |
Conference Topic Distribution
Linguistic
Task
Approach
Language
Dataset
Conference Citation Distribution
Conference Papers have no Citations yet
Topics