NLPExplorer

BlackboxNLP - 2024

Total Papers:- 36

Total Papers accross all years:- 176

Total Citations :- 0

« 1 2 3

Mechanistic?

Naomi Saphra | Sarah Wiegreffe |

Copy Suppression: Comprehensively Understanding a Motif in Language Model Attention Heads

Callum Stuart McDougall | Arthur Conmy | Cody Rushing | Thomas McGrath | Neel Nanda |

Log Probabilities Are a Reliable Estimate of Semantic Plausibility in Base and Instruction-Tuned Language Models

Carina Kauf | Emmanuele Chersoni | Alessandro Lenci | Evelina Fedorenko | Anna A Ivanova |

Enhancing adversarial robustness in Natural Language Inference using explanations

Alexandros Koulakos | Maria Lymperaiou | Giorgos Filandrianos | Giorgos Stamou |

Uncovering Syllable Constituents in the Self-Attention-Based Speech Representations of Whisper

Erfan A Shams | Iona Gessinger | Julie Carson-Berndsen |

An Adversarial Example for Direct Logit Attribution: Memory Management in GELU-4L

Jett Janiak | Can Rager | James Dao | Yeu-Tong Lau |

Conference Topic Distribution

Conference Citation Distribution

Conference Papers have no Citations yet

Topics