Best Papers

Best Paper Awards

  • Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
    Lean Wang, Lei Li, Damai Dai, Deli Chen, Hao Zhou, Fandong Meng, Jie Zhou and Xu Sun

  • Faster Minimum Bayes Risk Decoding with Confidence-based Pruning
    Julius Cheng and Andreas Vlachos

  • Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs Through a Global Prompt Hacking Competition
    Sander V Schulhoff, Jeremy Pinto, Anaum Khan, Louis-François Bouchard, Chenglei Si, Svetlina Anati, Valen Tagliabue, Anson Liu Kost, Christopher R Carnahan and Jordan Lee Boyd-Graber

  • PaperMage: A Unified Toolkit for Processing, Representing, and Manipulating Visually-Rich Scientific Documents
    _Kyle Lo, Zejiang Shen, Benjamin Newman, Joseph Chee Chang, Russell Authur, Erin Bransom, Stefan Candra, Yoganand Chandrasekhar, Regan Huff, Bailey Kuehl, Amanpreet Singh, Chris Wilhelm, Angele Zamarron, Marti A. Hearst, Daniel Weld, Doug Downey and Luca Soldaini _

  • Personalized Dense Retrieval on Global Index for Voice-enabled Conversational Systems
    Masha Belyi, Charlotte Dzialo, Chaitanya Dwivedi, Prajit Reddy Muppidi and Kanna Shimizu

Outstanding Papers

  • LINC: A Neurosymbolic Approach for Logical Reasoning by Combining Language Models with First-Order Logic Provers
    Theo X. Olausson, Alex Gu, Ben Lipkin, Cedegao E. Zhang, Armando Solar-Lezama, Joshua B. Tenenbaum and Roger P. Levy
  • Toward a Critical Toponymy Framework for Named Entity Recognition: A Case Study of Airbnb in New York City
    Mikael Brunila, Jack LaViolette, Sky CH-Wang, Priyanka Verma, Clara Féré and Grant McKenzie
  • SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization
    Hyunwoo Kim, Jack Hessel, Liwei Jiang, Peter West, Ximing Lu, Youngjae Yu, Pei Zhou, Ronan Le Bras, Malihe Alikhani, Gunhee Kim, Maarten Sap and Yejin Choi
  • Incorporating Worker Perspectives into MTurk Annotation Practices for NLP
    Olivia Huang, Eve Fleisig and Dan Klein
  • Physician Detection of Clinical Harm in Machine Translation: Quality Estimation Aids in Reliance and Backtranslation Identifies Critical Errors
    Nikita Mehandru, Sweta Agrawal, Yimin Xiao, Ge Gao, Elaine C Khoong, Marine Carpuat and Niloufar Salehi
  • Preserving Knowledge Invariance: Rethinking Robustness Evaluation of Open Information Extraction
    Ji Qi, Chuchun Zhang, Xiaozhi Wang, Kaisheng Zeng, Jifan Yu, Jinxin Liu, Lei Hou, Juanzi Li and Xu Bin
  • Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking Agents
    Weiwei Sun, Lingyong Yan, Xinyu Ma, Shuaiqiang Wang, Pengjie Ren, Zhumin Chen, Dawei Yin and Zhaochun Ren
  • Unraveling Feature Extraction Mechanisms in Neural Networks
    Xiaobing Sun, Jiaxi Li and Wei Lu
  • ViPE: Visualise Pretty-much Everything
    Hassan Shahmohammadi, Adhiraj Ghosh and Hendrik Lensch
  • Revisiting the Optimality of Word Lengths
    Tiago Pimentel, Clara Meister, Ethan Wilcox, Kyle Mahowald and Ryan Cotterell
  • Language Model Quality Correlates with Psychometric Predictive Power in Multiple Languages
    Ethan Wilcox, Clara Meister, Ryan Cotterell and Tiago Pimentel
  • FedID: Federated Interactive Distillation for Large-Scale Pretraining Language Models
    Xinge Ma, Jiangming Liu, Jin Wang and Xuejie Zhang
  • Ties Matter: Meta-Evaluating Modern Metrics with Pairwise Accuracy and Tie Calibration
    Daniel Deutsch, George Foster and Markus Freitag
  • Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models
    Jirui Qi, Raquel Fernández and Arianna Bisazza
  • Look-back Decoding for Open-Ended Text Generation
    Nan Xu, Chunting Zhou, Asli Celikyilmaz and Xuezhe Ma
  • Text Embeddings Reveal (Almost) As Much As Text
    John Xavier Morris, Volodymyr Kuleshov, Vitaly Shmatikov and Alexander M Rush
  • Understanding Compositional Data Augmentation in Typologically Diverse Morphological Inflection
    Farhan Samir and Miikka Silfverberg
  • IfQA: A Dataset for Open-domain Question Answering under Counterfactual Presuppositions
    Wenhao Yu, Meng Jiang, Peter Clark and Ashish Sabharwal
  • Counter Turing Test (CT2): AI-Generated Text Detection is Not as Easy as You May Think - Introducing AI Detectability Index (ADI)
    Megha Chakraborty, S.M Towhidul Islam Tonmoy, S M Mehedi Zaman, Shreya Gautam, Tanay Kumar, Krish Sharma, Niyar R Barman, Chandan Gupta, Vinija Jain, Aman Chadha, Amit P. Sheth and Amitava Das
  • Bridging Continuous and Discrete Spaces: Interpretable Sentence Representation Learning via Compositional Operations
    James Y. Huang, Wenlin Yao, Kaiqiang Song, Hongming Zhang, Muhao Chen and Dong Yu
  • The Sentiment Problem: A Critical Survey towards Deconstructing Sentiment Analysis
    Pranav Narayanan Venkit, Mukund Srinath, Sanjana Gautam, Saranya Venkatraman, Vipul Gupta, Rebecca J. Passonneau and Shomir Wilson
  • Homophone Disambiguation Reveals Patterns of Context Mixing in Speech Transformers
    Hosein Mohebbi, Grzegorz Chrupała, Willem Zuidema and Afra Alishahi
  • Background Summarization of Event Timelines
    Adithya Pratapa, Kevin Small and Markus Dreyer
  • Linear-Time Modeling of Linguistic Structure: An Order-Theoretic Perspective
    Tianyu Liu, Afra Amini, Mrinmaya Sachan and Ryan Cotterell
  • Joint Dialogue Topic Segmentation and Categorization: A Case Study on Clinical Spoken Conversations
    Zhengyuan Liu, Siti Umairah Md Salleh, Hong Choon Oh, Pavitra Krishnaswamy and Nancy Chen