NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
TrustNLP - 2024
Total Papers:- 22
Total Papers accross all years:- 40
Total Citations :- 0
1
2
»
Holistic Evaluation of Large Language Models: Assessing Robustness, Accuracy, and Toxicity for Real-World Applications
David Cecchini |
Arshaan Nazir |
Kalyan Chakravarthy |
Veysel Kocaman |
Masking Latent Gender Knowledge for Debiasing Image Captioning
Fan Yang |
Shalini Ghosh |
Emre Barut |
Kechen Qin |
Prashan Wanigasekara |
Chengwei Su |
Weitong Ruan |
Rahul Gupta |
BELIEVE: Belief-Enhanced Instruction Generation and Augmentation for Zero-Shot Bias Mitigation
Lisa Bauer |
Ninareh Mehrabi |
Palash Goyal |
Kai-Wei Chang |
Aram Galstyan |
Rahul Gupta |
When XGBoost Outperforms GPT-4 on Text Classification: A Case Study
Matyas Bohacek |
Michal Bravansky |
Exploring Causal Mechanisms for Machine Text Detection Methods
Kiyoon Yoo |
Wonhyuk Ahn |
Yeji Song |
Nojun Kwak |
FairBelief - Assessing Harmful Beliefs in Language Models
Mattia Setzu |
Marta Marchiori Manerba |
Pasquale Minervini |
Debora Nozza |
Overconfidence is Key: Verbalized Uncertainty Evaluation in Large Language and Vision-Language Models
Tobias Groot |
Matias Valdenegro - Toro |
FactAlign: Fact-Level Hallucination Detection and Classification Through Knowledge Graph Alignment
Mohamed Rashad |
Ahmed Zahran |
Abanoub Amin |
Amr Abdelaal |
Mohamed Altantawy |
Cross-Task Defense: Instruction-Tuning LLMs for Content Safety
Yu Fu |
Wen Xiao |
Jia Chen |
Jiachen Li |
Evangelos Papalexakis |
Aichi Chien |
Yue Dong |
Automated Adversarial Discovery for Safety Classifiers
Yash Kumar Lal |
Preethi Lahoti |
Aradhana Sinha |
Yao Qin |
Ananth Balashankar |
Introducing GenCeption for Multimodal LLM Benchmarking: You May Bypass Annotations
Lele Cao |
Valentin Buchner |
Zineb Senane |
Fangkai Yang |
HGOT: Hierarchical Graph of Thoughts for Retrieval-Augmented In-Context Learning in Factuality Evaluation
Yihao Fang |
Stephen Thomas |
Xiaodan Zhu |
On the Interplay between Fairness and Explainability
Stephanie Brandl |
Emanuele Bugliarello |
Ilias Chalkidis |
Tell Me Why: Explainable Public Health Fact-Checking with Large Language Models
Majid Zarharan |
Pascal Wullschleger |
Babak Behkam Kia |
Mohammad Taher Pilehvar |
Jennifer Foster |
Tweak to Trust: Assessing the Reliability of Summarization Metrics in Contact Centers via Perturbed Summaries
Kevin Patel |
Suraj Agrawal |
Ayush Kumar |
Conference Topic Distribution
Linguistic
Task
Approach
Language
Dataset
Conference Citation Distribution
Conference Papers have no Citations yet
Topics