NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
WS - 2025
Total Papers:- 2561
Total Papers accross all years:- 24858
Total Citations :- 0
«
117
118
119
120
121
122
123
124
125
126
127
»
Lost in Variation? Evaluating NLI Performance in Basque and Spanish Geographical Variants
Jaione Bengoetxea |
Itziar Gonzalez-Dios |
Rodrigo Agerri |
Overview of the SciHal25 Shared Task on Hallucination Detection for Scientific Content
Dan Li |
Bogdan Palfi |
Colin Zhang |
Jaiganesh Subramanian |
Adrian Raudaschl |
Yoshiko Kakita |
Anita De Waard |
Zubair Afzal |
Georgios Tsatsaronis |
StRuCom: A Novel Dataset of Structured Code Comments in Russian
Maria Dziuba |
Valentin Malykh |
Testing Spatial Intuitions of Humans and Large Language and Multimodal Models in Analogies
Ivo Bueno |
Anna Bavaresco |
João Miguel Cunha |
Philipp Wicke |
Towards a Principled Evaluation of Knowledge Editors
Sebastian Pohl |
Max Ploner |
Alan Akbik |
PROTECT: Policy-Related Organizational Value Taxonomy for Ethical Compliance and Trust
Avni Mittal |
Sree Hari Nagaralu |
Sandipan Dandapat |
Power(ful) Associations: Rethinking “Stereotype” for NLP
Hannah Devinney |
Leveraging Generative AI for Enhancing Automated Assessment in Programming Education Contests
Stefan Dascalescu |
Marius Dumitran |
Mihai Alexandru Vasiluta |
Building Japanese Creativity Benchmarks and Applying them to Enhance LLM Creativity
So Fukuda |
Hayato Ogawa |
Kaito Horio |
Daisuke Kawahara |
Tomohide Shibata |
Another Approach to Agreement Measurement and Prediction with Emotion Annotations
Quanqi Du |
Veronique Hoste |
Adapting LLMs for Minimal-edit Grammatical Error Correction
Ryszard Staruch |
Filip Gralinski |
Daniel Dzienisiewicz |
Large Language Models for Education: Understanding the Needs of Stakeholders, Current Capabilities and the Path Forward
Sankalan Pal Chowdhury |
Nico Daheim |
Ekaterina Kochmar |
Jakub Macina |
Donya Rooein |
Mrinmaya Sachan |
Shashank Sonkar |
A Framework for Large-Scale Parallel Corpus Evaluation: Ensemble Quality Estimation Models Versus Human Assessment
Dmytro Chaplynskyi |
Kyrylo Zakharov |
Temporalizing Confidence: Evaluation of Chain-of-Thought Reasoning with Signal Temporal Logic
Zhenjiang Mao |
Artem Bisliouk |
Rohith Nama |
Ivan Ruchkin |
Questioning Our Questions: How Well Do Medical QA Benchmarks Evaluate Clinical Capabilities of Language Models?
Siun Kim |
Hyung-Jin Yoon |
Conference Topic Distribution
Linguistic
Task
Approach
Language
Dataset
Conference Citation Distribution
Conference Papers have no Citations yet
Topics