Main Conference
Long Papers
IAG: Induction-Augmented Generation Framework for Answering Reasoning Questions
Zhebin Zhang, Xinyu Zhang, Yuanhang Ren, Saijiang Shi, Meng Han, Yongkang Wu, Ruofei Lai, Zhao Cao
Absolute Position Embedding Learns Sinusoid-like Waves for Attention Based on Relative Position
Yuji Yamamoto, Takuya Matsuzaki
Chinese Lexical Substitution: Dataset and Method
Jipeng Qiang, Kang Liu, Ying Li, Yun Li, Yi Zhu, Yun-Hao Yuan, Xiaocheng Hu, Xiaoye Ouyang
Decoding the Silent Majority: Inducing Belief Augmented Social Graph with Large Language Model for Response Forecasting
Chenkai Sun, Jinning Li, Yi Fung, Hou Chan, Tarek Abdelzaher, ChengXiang Zhai, Heng Ji
Holistic Inter-Annotator Agreement and Corpus Coherence Estimation in a Large-scale Multilingual Annotation Campaign
Nicolas Stefanovitch, Jakub Piskorski
PHD: Pixel-Based Language Modeling of Historical Documents
Nadav Borenstein, Phillip Rust, Desmond Elliott, Isabelle Augenstein
Evaluating the Rationale Understanding of Critical Reasoning in Logical Reading Comprehension
Akira Kawabata, Saku Sugawara
Evaluating and Modeling Attribution for Cross-Lingual Question Answering
Benjamin Muller, John Wieting, Jonathan Clark, Tom Kwiatkowski, Sebastian Ruder, Livio Soares, Roee Aharoni, Jonathan Herzig, Xinyi Wang
Sparse Universal Transformer
Shawn Tan, Yikang Shen, Zhenfang Chen, Aaron Courville, Chuang Gan
Theory of Mind for Multi-Agent Collaboration via Large Language Models
Huao Li, Yu Chong, Simon Stepputtis, Joseph Campbell, Dana Hughes, Charles Lewis, Katia Sycara
Let’s Think Frame by Frame with VIP: A Video Infilling and Prediction Dataset for Evaluating Video Chain-of-Thought
Vaishnavi Himakunthala, Andy Ouyang, Daniel Rose, Ryan He, Alex Mei, Yujie Lu, Chinmay Sonar, Michael Saxon, William Wang
GPTAraEval: A Comprehensive Evaluation of ChatGPT on Arabic NLP
Md Tawkat Islam Khondaker, Abdul Waheed, El Moatez Billah Nagoudi, Muhammad Abdul-Mageed
Dual-Channel Span for Aspect Sentiment Triplet Extraction
Pan Li, Ping Li, Kai Zhang
Cultural Concept Adaptation on Multimodal Reasoning
Zhi Li, Yin Zhang
Understanding Compositional Data Augmentation in Typologically Diverse Morphological Inflection
Farhan Samir, Miikka Silfverberg
Evaluating Object Hallucination in Large Vision-Language Models
Yifan Li, Yifan Du, Kun Zhou, Jinpeng Wang, Xin Zhao, Ji-Rong Wen
Event Ontology Completion with Hierarchical Structure Evolution Networks
Pengfei Cao, Yupu Hao, Yubo Chen, Kang Liu, Jiexin Xu, Huaijun Li, Xiaojian Jiang, Jun Zhao
Parameter-efficient Tuning for Large Language Model without Calculating Its Gradients
Feihu Jin, Jiajun Zhang, Chengqing Zong
Discourse Structures Guided Fine-grained Propaganda Identification
Yuanyuan Lei, Ruihong Huang
CompoundPiece: Evaluating and Improving Decompounding Performance of Language Models
Benjamin Minixhofer, Jonas Pfeiffer, Ivan Vulić
Improving Image Captioning via Predicting Structured Concepts
Ting Wang, Weidong Chen, Yuanhe Tian, Yan Song, Zhendong Mao
GATITOS: Using a New Multilingual Lexicon for Low-resource Machine Translation
Alexander Jones, Isaac Caswell, Orhan Firat, Ishank Saxena
Continually Improving Extractive QA via Human Feedback
Ge Gao, Hung-Ting Chen, Yoav Artzi, Eunsol Choi
Using Interpretation Methods for Model Enhancement
Zhuo Chen, Chengyue Jiang, Kewei Tu
An Expression Tree Decoding Strategy for Mathematical Equation Generation
Wenqi Zhang, Yongliang Shen, Qingpeng Nong, Zeqi Tan, Yanna Ma, Weiming Lu
Diversity Enhanced Narrative Question Generation for Storybooks
Hokeun Yoon, JinYeong Bak
Debiasing Made State-of-the-art: Revisiting the Simple Seed-based Weak Supervision for Text Classification
Chengyu Dong, Zihan Wang, Jingbo Shang
How to Enhance Causal Discrimination of Utterances: A Case on Affective Reasoning
Hang Chen, Xinyu Yang, Jing Luo, Wenjing Zhu
Compressing and Debiasing Vision-Language Pre-Trained Models for Visual Question Answering
Qingyi Si, Yuanxin Liu, Zheng Lin, Peng Fu, Yanan Cao, Weiping Wang
Selectively Answering Ambiguous Questions
Jeremy Cole, Michael Zhang, Daniel Gillick, Julian Eisenschlos, Bhuwan Dhingra, Jacob Eisenstein
Temporal Knowledge Graph Forecasting Without Knowledge Using In-Context Learning
Dong-Ho Lee, Kian Ahrabian, Woojeong Jin, Fred Morstatter, Jay Pujara
Knowledge Graph Compression Enhances Diverse Commonsense Generation
EunJeong Hwang, Veronika Thost, Vered Shwartz, Tengfei Ma
Pragmatic Reasoning Unlocks Quantifier Semantics for Foundation Models
Yiyuan Li, Rakesh Menon, Sayan Ghosh, Shashank Srivastava
LLM-FP4: 4-Bit Floating-Point Quantized Transformers
Shih-yang Liu, Zechun Liu, Xijie Huang, Pingcheng Dong, Kwang-Ting Cheng
Improving Biomedical Abstractive Summarisation with Knowledge Aggregation from Citation Papers
Chen Tang, Shun Wang, Tomas Goldsack, Chenghua Lin
Explanation Selection Using Unlabeled Data for Chain-of-Thought Prompting
Xi Ye, Greg Durrett
HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation
David Dale, Elena Voita, Janice Lam, Prangthip Hansanti, Christophe Ropers, Elahe Kalbassi, Cynthia Gao, Loic Barrault, Marta Costa-jussà
Gradient-based Gradual Pruning for Language-Specific Multilingual Neural Machine Translation
Dan He, Minh-Quang Pham, Thanh-Le Ha, Marco Turchi
LLM-powered Data Augmentation for Enhanced Cross-lingual Performance
Chenxi Whitehouse, Monojit Choudhury, Alham Aji
Prompt-based Logical Semantics Enhancement for Implicit Discourse Relation Recognition
Chenxu Wang, Ping Jian, Mu Huang
VLIS: Unimodal Language Models Guide Multimodal Language Generation
Jiwan Chung, Youngjae Yu
Conceptual structure coheres in human cognition but not in large language models
Siddharth Suresh, Kushin Mukherjee, Xizheng Yu, Wei-Chun Huang, Lisa Padua, Timothy Rogers
Towards LLM-driven Dialogue State Tracking
Yujie Feng, Zexin Lu, Bo Liu, Liming Zhan, Xiao-Ming Wu
Learning Language-guided Adaptive Hyper-modality Representation for Multimodal Sentiment Analysis
Haoyu Zhang, Yu Wang, Guanghao Yin, Kejun Liu, Yuanyuan Liu, Tianshu Yu
Multitask Multimodal Prompted Training for Interactive Embodied Task Completion
Georgios Pantazopoulos, Malvina Nikandrou, Amit Parekh, Bhathiya Hemanthage, Arash Eshghi, Ioannis Konstas, Verena Rieser, Oliver Lemon, Alessandro Suglia
We’re Afraid Language Models Aren’t Modeling Ambiguity
Alisa Liu, Zhaofeng Wu, Julian Michael, Alane Suhr, Peter West, Alexander Koller, Swabha Swayamdipta, Noah Smith, Yejin Choi
Linear-Time Modeling of Linguistic Structure: An Order-Theoretic Perspective
Tianyu Liu, Afra Amini, Mrinmaya Sachan, Ryan Cotterell
GEMINI: Controlling The Sentence-Level Summary Style in Abstractive Text Summarization
Guangsheng Bao, Zebin Ou, Yue Zhang
Analyzing Norm Violations in Live-Stream Chat
Jihyung Moon, Dong-Ho Lee, Hyundong Cho, Woojeong Jin, Chan Park, Minwoo Kim, Jonathan May, Jay Pujara, Sungjoon Park
Coarse-to-Fine Contrastive Learning in Image-Text-Graph Space for Improved Vision-Language Compositionality
Harman Singh, Pengchuan Zhang, Qifan Wang, Mengjiao Wang, Wenhan Xiong, Jingfei Du, Yu Chen
Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Commonsense Norms
Seungju Han, Junhyeok Kim, Jack Hessel, Liwei Jiang, Jiwan Chung, Yejin Son, Yejin Choi, Youngjae Yu
Enhancing Uncertainty-Based Hallucination Detection with Stronger Focus
Tianhang Zhang, Lin Qiu, Qipeng Guo, Cheng Deng, Yue Zhang, Zheng Zhang, Chenghu Zhou, Xinbing Wang, Luoyi Fu
FactKB: Generalizable Factuality Evaluation using Language Models Enhanced with Factual Knowledge
Shangbin Feng, Vidhisha Balachandran, Yuyang Bai, Yulia Tsvetkov
Mitigating Backdoor Poisoning Attacks through the Lens of Spurious Correlation
Xuanli He, Qiongkai Xu, Jun Wang, Benjamin Rubinstein, Trevor Cohn
Symbol tuning improves in-context learning in language models
Jerry Wei, Le Hou, Andrew Lampinen, Xiangning Chen, Da Huang, Yi Tay, Xinyun Chen, Yifeng Lu, Denny Zhou, Tengyu Ma, Quoc Le
The neural dynamics of word recognition and integration
Jon Gauthier, Roger Levy
Incorporating Worker Perspectives into MTurk Annotation Practices for NLP
Olivia Huang, Eve Fleisig, Dan Klein
Predict the Future from the Past? On the Temporal Data Distribution Shift in Financial Sentiment Classifications
Yue Guo, Chenxi Hu, Yi Yang
Look-back Decoding for Open-Ended Text Generation
Nan Xu, Chunting Zhou, Asli Celikyilmaz, Xuezhe Ma
Large Language Models Can Self-Improve
Jiaxin Huang, Shixiang Gu, Le Hou, Yuexin Wu, Xuezhi Wang, Hongkun Yu, Jiawei Han
CodeT5+: Open Code Large Language Models for Code Understanding and Generation
Yue Wang, Hung Le, Akhilesh Gotmare, Nghi Bui, Junnan Li, Steven Hoi
Structural generalization in COGS: Supertagging is (almost) all you need
Alban Petit, Caio Corro, François Yvon
BioT5: Enriching Cross-modal Integration in Biology with Chemical Knowledge and Natural Language Associations
Qizhi Pei, Wei Zhang, Jinhua Zhu, Kehan Wu, Kaiyuan Gao, Lijun Wu, Yingce Xia, Rui Yan
SeqXGPT: Sentence-Level AI-Generated Text Detection
Pengyu Wang, Linyang Li, Ke Ren, Botian Jiang, Dong Zhang, Xipeng Qiu
QTSumm: Query-Focused Summarization over Tabular Data
Yilun Zhao, Zhenting Qi, Linyong Nan, Boyu Mi, Yixin Liu, Weijin Zou, Simeng Han, Ruizhe Chen, Xiangru Tang, Yumo Xu, Dragomir Radev, Arman Cohan
From Wrong To Right: A Recursive Approach Towards Vision-Language Explanation
Jiaxin Ge, Sanjay Subramanian, Trevor Darrell, Boyi Li
`Don’t Get Too Technical with Me’: A Discourse Structure-Based Framework for Automatic Science Journalism
Ronald Cardenas, Bingsheng Yao, Dakuo Wang, Yufang Hou
LACMA: Language-Aligning Contrastive Learning with Meta-Actions for Embodied Instruction Following
Cheng-Fu Yang, Yen-Chun Chen, Jianwei Yang, Xiyang Dai, Lu Yuan, Yu-Chiang Wang, Kai-Wei Chang
Towards Robust Pruning: An Adaptive Knowledge-Retention Pruning Strategy for Language Models
Jianwei Li, Qi Lei, Wei Cheng, Dongkuan Xu
Clinical Contradiction Detection
Dave Makhervaks, Plia Gillis, Kira Radinsky
Vera: A General-Purpose Plausibility Estimation Model for Commonsense Statements
Jiacheng Liu, Wenya Wang, Dianzhuo Wang, Noah Smith, Yejin Choi, Hannaneh Hajishirzi
Text-Transport: Toward Learning Causal Effects of Natural Language
Victoria Lin, Louis-Philippe Morency, Eli Ben-Michael
How Does Generative Retrieval Scale to Millions of Passages?
Ronak Pradeep, Kai Hui, Jai Gupta, Adam Lelkes, Honglei Zhuang, Jimmy Lin, Donald Metzler, Vinh Tran
Unveiling the Implicit Toxicity in Large Language Models
Jiaxin Wen, Pei Ke, Hao Sun, Zhexin Zhang, Chengfei Li, Jinfeng Bai, Minlie Huang
Is ChatGPT a General-Purpose Natural Language Processing Task Solver?
Chengwei Qin, Aston Zhang, Zhuosheng Zhang, Jiaao Chen, Michihiro Yasunaga, Diyi Yang
Length is a Curse and a Blessing for Document-level Semantics
Chenghao Xiao, Yizhi Li, G Hudson, Chenghua Lin, Noura Al Moubayed
ALCUNA: Large Language Models Meet New Knowledge
Xunjian Yin, Baizhou Huang, Xiaojun Wan
Location-Aware Visual Question Generation with Lightweight Models
Nicholas Suwono, Justin Chen, Tun Hung, Ting-Hao Huang, I-Bin Liao, Yung-Hui Li, Lun-Wei Ku, Shao-Hua Sun
MemeCap: A Dataset for Captioning and Interpreting Memes
EunJeong Hwang, Vered Shwartz
Where to start? Analyzing the potential value of intermediate models
Leshem Choshen, Elad Venezian, Shachar Don-Yehiya, Noam Slonim, Yoav Katz
Transcending Scaling Laws with 0.1% Extra Compute
Yi Tay, Jason Wei, Hyung Chung, Vinh Tran, David So, Siamak Shakeri, Xavier Garcia, Steven Zheng, Jinfeng Rao, Aakanksha Chowdhery, Denny Zhou, Donald Metzler, Slav Petrov, Neil Houlsby, Quoc Le, Mostafa Dehghani
CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation
Minzhi Li, Taiwei Shi, Caleb Ziems, Min-Yen Kan, Nancy Chen, Zhengyuan Liu, Diyi Yang
Optimizing Retrieval-augmented Reader Models via Token Elimination
Moshe Berchansky, Peter Izsak, Avi Caciularu, Ido Dagan, Moshe Wasserblat
WSDMS: Debunk Fake News via Weakly Supervised Detection of Misinforming Sentences with Contextualized Social Wisdom
Ruichao Yang, Wei Gao, Jing Ma, Hongzhan Lin, Zhiwei Yang
Robust Prompt Optimization for Large Language Models Against Distribution Shifts
Moxin Li, Wenjie Wang, Fuli Feng, Yixin Cao, Jizhi Zhang, Tat-Seng Chua
Exploiting Asymmetry for Synthetic Training Data Generation: SynthIE and the Case of Information Extraction
Martin Josifoski, Marija Sakota, Maxime Peyrard, Robert West
Condensing Multilingual Knowledge with Lightweight Language-Specific Modules
Haoran Xu, Weiting Tan, Shuyue Li, Yunmo Chen, Benjamin Van Durme, Philipp Koehn, Kenton Murray
The Framework Tax: Disparities Between Inference Efficiency in NLP Research and Deployment
Jared Fernandez, Jacob Kahn, Clara Na, Yonatan Bisk, Emma Strubell
Evaluating Cross-Domain Text-to-SQL Models and Benchmarks
Mohammadreza Pourreza, Davood Rafiei
Increasing Coverage and Precision of Textual Information in Multilingual Knowledge Graphs
Simone Conia, Min Li, Daniel Lee, Umar Minhas, Ihab Ilyas, Yunyao Li
Memory-Based Invariance Learning for Out-of-Domain Text Classification
Chen Jia, Yue Zhang
Outlier Suppression+: Accurate quantization of large language models by equivalent and effective shifting and scaling
Xiuying Wei, Yunchen Zhang, Yuhang Li, Xiangguo Zhang, Ruihao Gong, Jinyang Guo, Xianglong Liu
Three Stream Based Multi-level Event Contrastive Learning for Text-Video Event Extraction
Jiaqi Li, Chuanyi Zhang, Miaozeng Du, Dehai Min, Yongrui Chen, Guilin Qi
Diversify Question Generation with Retrieval-Augmented Style Transfer
Qi Gou, Zehua Xia, Bowen Yu, Haiyang Yu, Fei Huang, Yongbin Li, Nguyen Cam-Tu
Fast and Accurate Factual Inconsistency Detection Over Long Documents
Barrett Lattimer, Patrick CHen, Xinyuan Zhang, Yi Yang
Interpreting Embedding Spaces by Conceptualization
Adi Simhi, Shaul Markovitch
Knowledge-Augmented Language Model Verification
Jinheon Baek, Soyeong Jeong, Minki Kang, Jong Park, Sung Hwang
A Generation-based Deductive Method for Math Word Problems
Yuxuan Hu, Jing Zhang, Haoyang Li, Cuiping Li, Hong Chen
Failures Pave the Way: Enhancing Large Language Models through Tuning-free Rule Accumulation
Zeyuan Yang, Peng Li, Yang Liu
Building Persona Consistent Dialogue Agents with Offline Reinforcement Learning
Ryan Shea, Zhou Yu
Augmenting Zero-Shot Dense Retrievers with Plug-in Mixture-of-Memories
Suyu Ge, Chenyan Xiong, Corby Rosset, Arnold Overwijk, Jiawei Han, Paul Bennett
Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks
Po-Nien Kung, Fan Yin, Di Wu, Kai-Wei Chang, Nanyun Peng
Towards Example-Based NMT with Multi-Levenshtein Transformers
Maxime Bouthors, Josep Crego, François Yvon
DUnE: Dataset for Unified Editing
Afra Akyürek, Eric Pan, Garry Kuwanto, Derry Wijaya
``Fifty Shades of Bias’’: Normative Ratings of Gender Bias in GPT Generated English Text
Rishav Hada, Agrima Seth, Harshita Diddee, Kalika Bali
Hybrid Inverted Index Is a Robust Accelerator for Dense Retrieval
Peitian Zhang, Zheng Liu, Shitao Xiao, Zhicheng Dou, Jing Yao
ChatGPT to Replace Crowdsourcing of Paraphrases for Intent Classification: Higher Diversity and Comparable Model Robustness
Jan Cegin, Jakub Simko, Peter Brusilovsky
Query-as-context Pre-training for Dense Passage Retrieval
Xing W, Guangyuan Ma, Wanhui Qian, Zijia Lin, Songlin Hu
A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding
Andrea Burns, Krishna Srinivasan, Joshua Ainslie, Geoff Brown, Bryan Plummer, Kate Saenko, Jianmo Ni, Mandy Guo
Democratizing Reasoning Ability: Tailored Learning from Large Language Model
Zhaoyang Wang, Shaohan Huang, Yuxuan Liu, Jiahai Wang, Minghui Song, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang
OpenAsp: A Benchmark for Multi-document Open Aspect-based Summarization
Shmuel Amar, Liat Schiff, Ori Ernst, Asi Shefer, Ori Shapira, Ido Dagan
Byte Pair Encoding for Symbolic Music
Nathan Fradet, Nicolas Gutowski, Fabien Chhel, Jean-Pierre Briot
Combining Denoising Autoencoders with Contrastive Learning to fine-tune Transformer Models
Alejo Lopez-Avila, Víctor Suárez-Paniagua
Self-Influence Guided Data Reweighting for Language Model Pre-training
Megh Thakkar, Tolga Bolukbasi, Sriram Ganapathy, Shikhar Vashishth, Sarath Chandar, Partha Talukdar
TrueTeacher: Learning Factual Consistency Evaluation with Large Language Models
Zorik Gekhman, Jonathan Herzig, Roee Aharoni, Chen Elkind, Idan Szpektor
Tagging-Assisted Generation Model with Encoder and Decoder Supervision for Aspect Sentiment Triplet Extraction
Luo Xianlong, Meng Yang, Yihao Wang
Norm of Word Embedding Encodes Information Gain
Momose Oyama, Sho Yokoi, Hidetoshi Shimodaira
CRT-QA: A Dataset of Complex Reasoning Question Answering over Tabular Data
Zhehao Zhang, Xitao Li, Yan Gao, Jian-Guang Lou
Promoting Topic Coherence and Inter-Document Consorts in Multi-Document Summarization via Simplicial Complex and Sheaf Graph
Yash Atri, Arun Iyer, Tanmoy Chakraborty, Vikram Goyal
MAGNIFICo: Evaluating the In-Context Learning Ability of Large Language Models to Generalize to Novel Interpretations
Arkil Patel, Satwik Bhattamishra, Siva Reddy, Dzmitry Bahdanau
Generating and Evaluating Tests for K-12 Students with Language Model Simulations: A Case Study on Sentence Reading Efficiency
Eric Zelikman, Wanjing Ma, Jasmine Tran, Diyi Yang, Jason Yeatman, Nick Haber
Counter Turing Test (CT2): AI-Generated Text Detection is Not as Easy as You May Think - Introducing AI Detectability Index (ADI)
Megha Chakraborty, S.M Towhidul Islam Tonmoy, S M Mehedi Zaman, Shreya Gautam, Tanay Kumar, Krish Sharma, Niyar Barman, Chandan Gupta, Vinija Jain, Aman Chadha, Amit Sheth, Amitava Das
Revisiting the Optimality of Word Lengths
Tiago Pimentel, Clara Meister, Ethan Wilcox, Kyle Mahowald, Ryan Cotterell
Document-level Relationship Extraction by Bidirectional Constraints of Beta Rules
Yichun Liu, Zizhong Zhu, Xiaowang Zhang, Zhiyong Feng, Daoqi Chen, Yaxin Li
Instructed Language Models with Retrievers Are Powerful Entity Linkers
Zilin Xiao, Ming Gong, Jie Wu, Xingyao Zhang, Linjun Shou, Daxin Jiang
Towards Noise-Tolerant Speech-Referring Video Object Segmentation: Bridging Speech and Text
Xiang Li, Jinglu Wang, Xiaohao Xu, Muqiao Yang, Fan Yang, Yizhou Zhao, Rita Singh, Bhiksha Raj
PROSE: A Pronoun Omission Solution for Chinese-English Spoken Language Translation
Ke Wang, Xiutian Zhao, Yanghui Li, Wei Peng
A Diachronic Analysis of Paradigm Shifts in NLP Research: When, How, and Why?
Aniket Pramanick, Yufang Hou, Saif Mohammad, Iryna Gurevych
Does the Correctness of Factual Knowledge Matter for Factual Knowledge-Enhanced Pre-trained Language Models?
Boxi Cao, Qiaoyu Tang, Hongyu Lin, Xianpei Han, Le Sun
Syntactic Substitutability as Unsupervised Dependency Syntax
Jasper Jian, Siva Reddy
MProto: Multi-Prototype Network with Denoised Optimal Transport for Distantly Supervised Named Entity Recognition
Shuhui Wu, Yongliang Shen, Zeqi Tan, Wenqi Ren, Jietian Guo, Shiliang Pu, Weiming Lu
The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions
Siru Ouyang, Shuohang Wang, Yang Liu, Ming Zhong, Yizhu Jiao, Dan Iter, Reid Pryzant, Chenguang Zhu, Heng Ji, Jiawei Han
Learning the Visualness of Text Using Large Vision-Language Models
Gaurav Verma, Ryan Rossi, Christopher Tensmeyer, Jiuxiang Gu, Ani Nenkova
The Past, Present and Better Future of Feedback Learning in Large Language Models for Subjective Human Preferences and Values
Hannah Kirk, Andrew Bean, Bertie Vidgen, Paul Rottger, Scott Hale
TempTabQA: Temporal Question Answering for Semi-Structured Tables
Vivek Gupta, Pranshu Kandoi, Mahek Vora, Shuo Zhang, Yujie He, Ridho Reinanda, Vivek Srikumar
Task-Level Thinking Steps Help Large Language Models for Challenging Classification Task
Chunhui Du, Jidong Tian, Haoran Liao, Jindou Chen, Hao He, Yaohui Jin
RepoCoder: Repository-Level Code Completion Through Iterative Retrieval and Generation
Fengji Zhang, Bei Chen, Yue Zhang, Jacky Keung, Jin Liu, Daoguang Zan, Yi Mao, Jian-Guang Lou, Weizhu Chen
Influence Scores at Scale for Efficient Language Data Sampling
Nikhil Anand, Joshua Tan, Maria Minakova
G-Eval: NLG Evaluation using Gpt-4 with Better Human Alignment
Yang Liu, Dan Iter, Yichong Xu, Shuohang Wang, Ruochen Xu, Chenguang Zhu
Learning Retrieval Augmentation for Personalized Dialogue Generation
Qiushi Huang, Shuai Fu, Xubo Liu, Wenwu Wang, Tom Ko, Yu Zhang, Lilian Tang
The Troubling Emergence of Hallucination in Large Language Models - An Extensive Definition, Quantification, and Prescriptive Remediations
Vipula Rawte, Swagata Chakraborty, Agnibh Pathak, Anubhav Sarkar, S.M Towhidul Islam Tonmoy, Aman Chadha, Amit Sheth, Amitava Das
NAIL: Lexical Retrieval Indices with Efficient Non-Autoregressive Decoders
Livio Soares, Daniel Gillick, Jeremy Cole, Tom Kwiatkowski
Analyzing Modular Approaches for Visual Question Decomposition
Apoorv Khandelwal, Ellie Pavlick, Chen Sun
Improving Summarization with Human Edits
Zonghai Yao, Benjamin Schloss, Sai Selvaraj
The Skipped Beat: A Study of Sociopragmatic Understanding in LLMs for 64 Languages
Chiyu Zhang, Khai Doan, Qisheng Liao, Muhammad Abdul-Mageed
BioPlanner: Automatic Evaluation of LLMs on Protocol Planning in Biology
Odhran O’Donoghue, Aleksandar Shtedritski, John Ginger, Ralph Abboud, Ali Ghareeb, Samuel Rodriques
Cross-lingual Prompting: Improving Zero-shot Chain-of-Thought Reasoning across Languages
Libo Qin, Qiguang Chen, Fuxuan Wei, Shijue Huang, Wanxiang Che
FinGPT: Large Generative Models for a Small Language
Risto Luukkonen, Ville Komulainen, Jouni Luoma, Anni Eskelinen, Jenna Kanerva, Hanna-Mari Kupari, Filip Ginter, Veronika Laippala, Niklas Muennighoff, Aleksandra Piktus, Thomas Wang, Nouamane Tazi, Teven Scao, Thomas Wolf, Osma Suominen, Samuli Sairanen, Mikko Merioksa, Jyrki Heinonen, Aija Vahtola, Samuel Antao, Sampo Pyysalo
Boosting Summarization with Normalizing Flows and Aggressive Training
Yu Yang, Xiaotong Shen
Indicative Summarization of Long Discussions
Shahbaz Syed, Dominik Schwabe, Khalid Khatib, Martin Potthast
A Framework for Vision-Language Warm-up Tasks in Multimodal Dialogue Models
Jaewook Lee, Seongsik Park, Seong-Heum Park, Hongjin Kim, Harksoo Kim
Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts
Tengxiao Liu, Qipeng Guo, Yuqing Yang, Xiangkun Hu, Yue Zhang, Xipeng Qiu, Zheng Zhang
GLEN: General-Purpose Event Detection for Thousands of Types
Sha Li, Qiusi Zhan, Kathryn Conger, Martha Palmer, Heng Ji, Jiawei Han
Hierarchical Pretraining on Multimodal Electronic Health Records
Xiaochen Wang, Junyu Luo, Jiaqi Wang, Ziyi Yin, Suhan Cui, Yuan Zhong, Yaqing Wang, Fenglong Ma
Bridging the Gap between Synthetic and Authentic Images for Multimodal Machine Translation
Wenyu Guo, Qingkai Fang, Dong Yu, Yang Feng
DEPN: Detecting and Editing Privacy Neurons in Pretrained Language Models
Xinwei Wu, Junzhuo Li, Minghui Xu, Weilong Dong, Shuangzhi Wu, Chao Bian, Deyi Xiong
Can Language Models Laugh at YouTube Short-form Videos?
Dayoon Ko, Sangho Lee, Gunhee Kim
Random Entity Quantization for Parameter-Efficient Compositional Knowledge Graph Representation
Jiaang Li, Quan Wang, Yi Liu, Licheng Zhang, Zhendong Mao
Exploring All-In-One Knowledge Distillation Framework for Neural Machine Translation
Zhongjian Miao, Wen Zhang, Jinsong Su, Xiang Li, Jian Luan, Yidong Chen, Bin Wang, Min Zhang
HistAlign: Improving Context Dependency in Language Generation by Aligning with History
David Wan, Shiyue Zhang, Mohit Bansal
CombLM: Adapting Black-Box Language Models through Small Fine-Tuned Models
Aitor Ormazabal, Mikel Artetxe, Eneko Agirre
Image Manipulation via Multi-Hop Instructions - A New Dataset and Weakly-Supervised Neuro-Symbolic Approach
Harman Singh, Poorva Garg, Mohit Gupta, Kevin Shah, Ashish Goswami, Satyam Modi, Arnab Mondal, Dinesh Khandelwal, Dinesh Garg, Parag Singla
Generative Spoken Language Model based on continuous word-sized audio tokens
Robin Algayres, Yossi Adi, Tu Nguyen, Jade Copet, Gabriel Synnaeve, Benoît Sagot, Emmanuel Dupoux
Enhancing Chat Language Models by Scaling High-quality Instructional Conversations
Ning Ding, Yulin Chen, Bokai Xu, Yujia Qin, Shengding Hu, Zhiyuan Liu, Maosong Sun, Bowen Zhou
Weakly-Supervised Learning of Visual Relations in Multimodal Pretraining
Emanuele Bugliarello, Aida Nematzadeh, Lisa Hendricks
Unsupervised Grammatical Error Correction Rivaling Supervised Methods
Hannan Cao, Liping Yuan, Yuchen Zhang, Hwee Ng
S2abEL: A Dataset for Entity Linking from Scientific Tables
Yuze Lou, Bailey Kuehl, Erin Bransom, Sergey Feldman, Aakanksha Naik, Doug Downey
API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMs
Minghao Li, Yingxiu Zhao, Bowen Yu, Feifan Song, Hangyu Li, Haiyang Yu, Zhoujun Li, Fei Huang, Yongbin Li
Language and Mental Health: Measures of Emotion Dynamics from Text as Linguistic Biosocial Markers
Daniela Teodorescu, Tiffany Cheng, Alona Fyshe, Saif Mohammad
Lion: Adversarial Distillation of Proprietary Large Language Models
Yuxin Jiang, Chunkit Chan, Mingyang Chen, Wei Wang
Evaluating Large Language Models on Controlled Generation Tasks
Jiao Sun, Yufei Tian, Wangchunshu Zhou, Nan Xu, Qian Hu, Rahul Gupta, John Wieting, Nanyun Peng, Xuezhe Ma
DeSIQ: Towards an Unbiased, Challenging Benchmark for Social Intelligence Understanding
Xiao-Yu Guo, Yuan-Fang Li, Reza Haf
Why LLMs Hallucinate, and How to Get (Evidential) Closure: Perceptual, Intensional, and Extensional Learning for Faithful Natural Language Generation
Adam Bouyamourn
A Question Answering Framework for Decontextualizing User-facing Snippets from Scientific Documents
Benjamin Newman, Luca Soldaini, Raymond Fok, Arman Cohan, Kyle Lo
SLOG: A Structural Generalization Benchmark for Semantic Parsing
Bingzhi Li, Lucia Donatelli, Alexander Koller, Tal Linzen, Yuekun Yao, Najoung Kim
Pushdown Layers: Encoding Recursive Structure in Transformer Language Models
Shikhar Murty, Pratyusha Sharma, Jacob Andreas, Christopher Manning
Can LLMs Facilitate Interpretation of Pre-trained Language Models?
Basel Mousi, Nadir Durrani, Fahim Dalvi
Enhancing Low-resource Fine-grained Named Entity Recognition by Leveraging Coarse-grained Datasets
Su Lee, Seokjin Oh, Woohwan Jung
Non-Autoregressive Math Word Problem Solver with Unified Tree Structure
Yi Bin, Mengqun Han, Wenhao Shi, Lei Wang, Yang Yang, See-Kiong Ng, Heng Shen
Improving Chinese Pop Song and Hokkien Gezi Opera Singing Voice Synthesis by Enhancing Local Modeling
Peng Bai, Yue Zhou, Meizhen Zheng, Wujin Sun, Xiaodong Shi
What Else Do I Need to Know? The Effect of Background Information on Users’ Reliance on QA Systems
Navita Goyal, Eleftheria Briakou, Amanda Liu, Connor Baumler, Claire Bonial, Jeffrey Micher, Clare Voss, Marine Carpuat, Hal Daumé III
VIBE: Topic-Driven Temporal Adaptation for Twitter Classification
Yuji Zhang, Jing Li, Wenjie Li
TOD-Flow: Modeling the Structure of Task-Oriented Dialogues
Sungryull Sohn, Yiwei Lyu, Anthony Liu, Lajanugen Logeswaran, Dong-Ki Kim, Dongsub Shim, Honglak Lee
TopWORDS-Poetry: Simultaneous Text Segmentation and Word Discovery for Classical Chinese Poetry via Bayesian Inference
Changzai Pan, Feiyue Li, Ke Deng
Knowledge Rumination for Pre-trained Language Models
Yunzhi Yao, Peng Wang, Shengyu Mao, Chuanqi Tan, Fei Huang, Huajun Chen, Ningyu Zhang
Struct-XLM: A Structure Discovery Multilingual Language Model for Enhancing Cross-lingual Transfer through Reinforcement Learning
Linjuan Wu, Weiming Lu
AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classification
Yongxin Huang, Kexin Wang, Sourav Dutta, Raj Patel, Goran Glavaš, Iryna Gurevych
Interview Evaluation: A Novel Approach for Automatic Evaluation of Conversational Question Answering Models
Xibo Li, Bowei Zou, Yifan Fan, Yanling Li, Ai Ti Aw, Yu Hong
TCFLE-8: a Corpus of Learner Written Productions for French as a Foreign Language and its Application to Automated Essay Scoring
Rodrigo Wilkens, Alice Pintard, David Alfter, Vincent Folny, Thomas François
Dancing Between Success and Failure: Edit-level Simplification Evaluation using SALSA
David Heineman, Yao Dou, Mounica Maddela, Wei Xu
Confidence-based Ensembling of Perspective-aware Models
Silvia Casola, Soda Lo, Valerio Basile, Simona Frenda, Alessandra Cignarella, Viviana Patti, Cristina Bosco
ToViLaG: Your Visual-Language Generative Model is Also An Evildoer
Xinpeng Wang, Xiaoyuan Yi, Han Jiang, Shanlin Zhou, Zhihua Wei, Xing Xie
GPT-RE: In-context Learning for Relation Extraction using Large Language Models
Zhen Wan, Fei Cheng, Zhuoyuan Mao, Qianying Liu, Haiyue Song, Jiwei Li, Sadao Kurohashi
Sociocultural Norm Similarities and Differences via Situational Alignment and Explainable Textual Entailment
Sky CH-Wang, Arkadiy Saakyan, Oliver Li, Zhou Yu, Smaranda Muresan
INFORM : Information eNtropy based multi-step reasoning FOR large language Models
Chuyue Zhou, Wangjie You, Juntao Li, Jing Ye, Kehai Chen, Min Zhang
Adaptive Gating in Mixture-of-Experts based Language Models
Jiamin Li, Qiang Su, Yitao Yang, Yimin Jiang, Cong Wang, Hong Xu
On the Automatic Generation and Simplification of Children’s Stories
Maria Valentini, Jennifer Weber, Jesus Salcido, Téa Wright, Eliana Colunga, Katharina von der Wense
The Curious Case of Hallucinatory (Un)answerability: Finding Truths in the Hidden States of Over-Confident Large Language Models
Aviv Slobodkin, Omer Goldman, Avi Caciularu, Ido Dagan, Shauli Ravfogel
Identifying Informational Sources in News Articles
Alexander Spangher, Nanyun Peng, Emilio Ferrara, Jonathan May
Retrofitting Light-weight Language Models for Emotions using Supervised Contrastive Learning
Sapan Shah, Sreedhar Reddy, Pushpak Bhattacharyya
Longtriever: a Pre-trained Long Text Encoder for Dense Document Retrieval
Junhan Yang, Zheng Liu, Chaozhuo Li, Guangzhong Sun, Xing Xie
Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning
Gurusha Juneja, Subhabrata Dutta, Soumen Chakrabarti, Sunny Manchanda, Tanmoy Chakraborty
Structural Priming Demonstrates Abstract Grammatical Representations in Multilingual Language Models
James Michaelov, Catherine Arnett, Tyler Chang, Ben Bergen
ReasoningLM: Enabling Structural Subgraph Reasoning in Pre-trained Language Models for Question Answering over Knowledge Graph
Jinhao Jiang, Kun Zhou, Xin Zhao, Yaliang Li, Ji-Rong Wen
Deep Natural Language Feature Learning for Interpretable Prediction
Felipe Urrutia, Cristian Calderon, Valentin Barriere
ROBBIE: Robust Bias Evaluation of Large Generative Language Models
David Esiobu, Xiaoqing Tan, Saghar Hosseini, Megan Ung, Yuchen Zhang, Jude Fernandes, Jane Dwivedi-Yu, Eleonora Presani, Adina Williams, Eric Smith
Enhancing Task-oriented Dialogue Systems with Generative Post-processing Networks
Atsumoto Ohashi, Ryuichiro Higashinaka
Adapting Language Models to Compress Contexts
Alexis Chevalier, Alexander Wettig, Anirudh Ajith, Danqi Chen
Selective Labeling: How to Radically Lower Data-Labeling Costs for Document Extraction Models
Yichao Zhou, James Wendt, Navneet Potti, Jing Xie, Sandeep Tata
TRAVEL: Tag-Aware Conversational FAQ Retrieval via Reinforcement Learning
Yue Chen, Dingnan Jin, Chen Huang, Jia Liu, Wenqiang Lei
Continual Dialogue State Tracking via Example-Guided Question Answering
Hyundong Cho, Andrea Madotto, Zhaojiang Lin, Khyathi Chandu, Satwik Kottur, Jing Xu, Jonathan May, Chinnadhurai Sankar
Lost in Translation, Found in Spans: Identifying Claims in Multilingual Social Media
Shubham Mittal, Megha Sundriyal, Preslav Nakov
COVID-19 Vaccine Misinformation in Middle Income Countries
Jongin Kim, Byeo Bak, Aditya Agrawal, Jiaxi Wu, Veronika Wirtz, Traci Hong, Derry Wijaya
Contrastive Learning of Sentence Embeddings from Scratch
Junlei Zhang, Zhenzhong Lan, Junxian He
A Rose by Any Other Name would not Smell as Sweet: Social Bias in Names Mistranslation
Sandra Sandoval, Jieyu Zhao, Marine Carpuat, Hal Daumé III
Investigating Efficiently Extending Transformers for Long Input Summarization
Jason Phang, Yao Zhao, Peter Liu
CS2W: A Chinese Spoken-to-Written Style Conversion Dataset with Multiple Conversion Types
Zishan Guo, Linhao Yu, Minghui Xu, Renren Jin, Deyi Xiong
Unifying Cross-Lingual Transfer across Scenarios of Resource Scarcity
Alan Ansell, Marinela Parović, Ivan Vulić, Anna Korhonen, Edoardo Ponti
A Tale of Pronouns: Interpretability Informs Gender Bias Mitigation for Fairer Instruction-Tuned Machine Translation
Giuseppe Attanasio, Flor Plaza del Arco, Debora Nozza, Anne Lauscher
DisCo: Distilled Student Models Co-training for Semi-supervised Text Mining
Weifeng Jiang, Qianren Mao, Chenghua Lin, Jianxin Li, Ting Deng, Weiyi Yang, Zheng Wang
Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation
Da Yin, Xiao Liu, Fan Yin, Ming Zhong, Hritik Bansal, Jiawei Han, Kai-Wei Chang
Language Model is Suitable for Correction of Handwritten Mathematical Expressions Recognition
Zui Chen, Jiaqi Han, Chaofan Yang, Yi Zhou
Vicinal Risk Minimization for Few-Shot Cross-lingual Transfer in Abusive Language Detection
Gretel De la Peña Sarracén, Paolo Rosso, Robert Litschko, Goran Glavaš, Simone Ponzetto
SuperDialseg: A Large-scale Dataset for Supervised Dialogue Segmentation
Junfeng Jiang, Chengzhang Dong, Sadao Kurohashi, Akiko Aizawa
ATFormer: A Learned Performance Model with Transfer Learning Across Devices for Deep Learning Tensor Programs
Yang Bai, Wenqian Zhao, Shuo Yin, Zixiao Wang, Bei Yu
mRedditSum: A Multimodal Abstractive Summarization Dataset of Reddit Threads with Images
Keighley Overbay, Jaewoo Ahn, Fatemeh Pesaran zadeh, Joonsuk Park, Gunhee Kim
Sparse Low-rank Adaptation of Pre-trained Language Models
Ning Ding, Xingtai Lv, Qiaosen Wang, Yulin Chen, Bowen Zhou, Zhiyuan Liu, Maosong Sun
Human Learning by Model Feedback: The Dynamics of Iterative Prompting with Midjourney
Shachar Don-Yehiya, Leshem Choshen, Omri Abend
The Art of SOCRATIC QUESTIONING: Recursive Thinking with Large Language Models
Jingyuan Qi, Zhiyang Xu, Ying Shen, Minqian Liu, Di Jin, Qifan Wang, Lifu Huang
Ideology Takes Multiple Looks: A High-Quality Dataset for Multifaceted Ideology Detection
Songtao Liu, Ziling Luo, Minghua Xu, Lixiao Wei, Ziyao Wei, Han Yu, Wei Xiang, Bang Wang
Transductive Learning for Textual Few-Shot Classification in API-based Embedding Models
Pierre Colombo, Victor Pellegrain, Malik Boudiaf, Myriam Tami, Victor Storchan, Ismail Ayed, Pablo Piantanida
MEGA: Multilingual Evaluation of Generative AI
Kabir Ahuja, Harshita Diddee, Rishav Hada, Millicent Ochieng, Krithika Ramesh, Prachi Jain, Akshay Nambi, Tanuja Ganu, Sameer Segal, Mohamed Ahmed, Kalika Bali, Sunayana Sitaram
Support or Refute: Analyzing the Stance of Evidence to Detect Out-of-Context Mis- and Disinformation
Xin Yuan, Jie Guo, Weidong Qiu, Zheng Huang, Shujun Li
Video-Helpful Multimodal Machine Translation
Yihang Li, Shuichiro Shimizu, Chenhui Chu, Sadao Kurohashi, Wei Li
Large Language Models are Temporal and Causal Reasoners for Video Question Answering
Dohwan Ko, Ji Lee, Woo-Young Kang, Byungseok Roh, Hyunwoo Kim
Prompting Large Language Models with Chain-of-Thought for Few-Shot Knowledge Base Question Generation
Yuanyuan Liang, Jianing Wang, Hanlun Zhu, Lei Wang, Weining Qian, Yunshi Lan
TrojanSQL: SQL Injection against Natural Language Interface to Database
Jinchuan Zhang, Yan Zhou, Binyuan Hui, Yaxin Liu, Ziming Li, Songlin Hu
Preserving Privacy Through Dememorization: An Unlearning Technique For Mitigating Memorization Risks In Language Models
Aly Kassem, Omar Mahmoud, Sherif Saad
MingOfficial: A Ming Official Career Dataset and a Historical Context-Aware Representation Learning Framework
You-Jun Chen, Hsin-Yi Hsieh, Yu Lin, Yingtao Tian, Bert Chan, Yu-Sin Liu, Yi-Hsuan Lin, Richard Tsai
DPP-TTS: Diversifying prosodic features of speech via determinantal point processes
Seongho Joo, Hyukhun Koh, Kyomin Jung
Meta-Learning Online Adaptation of Language Models
Nathan Hu, Eric Mitchell, Christopher Manning, Chelsea Finn
Self-Detoxifying Language Models via Toxification Reversal
Chak Leong, Yi Cheng, Jiashuo Wang, Jian Wang, Wenjie Li
Interactive Text Generation
Felix Faltings, Michel Galley, Kianté Brantley, Baolin Peng, Weixin Cai, Yizhe Zhang, Jianfeng Gao, Bill Dolan
NeuSTIP: A Neuro-Symbolic Model for Link and Time Prediction in Temporal Knowledge Graphs
Ishaan Singh, Navdeep Kaur, Garima Gaur, Mausam
Standardizing Distress Analysis: Emotion-Driven Distress Identification and Cause Extraction (DICE) in Multimodal Online Posts
Gopendra Singh, Soumitra Ghosh, Atul Verma, Chetna Painkra, Asif Ekbal
Out-of-Distribution Generalization in Natural Language Processing: Past, Present, and Future
Linyi Yang, Yaoxian Song, Xuan Ren, Chenyang Lyu, Yidong Wang, Jingming Zhuo, Lingqiao Liu, Jindong Wang, Jennifer Foster, Yue Zhang
Can Large Language Models Capture Dissenting Human Voices?
Noah Lee, Na An, James Thorne
DecoMT: Decomposed Prompting for Machine Translation Between Related Languages using Large Language Models
Ratish Puduppully, Anoop Kunchukuttan, Raj Dabre, Ai Ti Aw, Nancy Chen
Prototype-based HyperAdapter for Sample-Efficient Multi-task Tuning
Hao Zhao, Jie Fu, Zhaofeng He
Towards Building More Robust NER datasets: An Empirical Study on NER Dataset Bias from a Dataset Difficulty View
Ruotian Ma, Xiaolei Wang, Xin Zhou, Qi Zhang, Xuanjing Huang
GradSim: Gradient-Based Language Grouping for Effective Multilingual Training
Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen, Hinrich Schuetze
Discovering Universal Geometry in Embeddings with ICA
Hiroaki Yamagiwa, Momose Oyama, Hidetoshi Shimodaira
Toward a Critical Toponymy Framework for Named Entity Recognition: A Case Study of Airbnb in New York City
Mikael Brunila, Jack LaViolette, Sky CH-Wang, Priyanka Verma, Clara Féré, Grant McKenzie
Well Begun is Half Done: Generator-agnostic Knowledge Pre-Selection for Knowledge-Grounded Dialogue
Lang Qin, Yao Zhang, Hongru Liang, Jun Wang, Zhenglu Yang
Merging Generated and Retrieved Knowledge for Open-Domain QA
Yunxiang Zhang, Muhammad Khalifa, Lajanugen Logeswaran, Moontae Lee, Honglak Lee, Lu Wang
Text Fact Transfer
Nishant Balepur, Jie Huang, Kevin Chang
A Cheaper and Better Diffusion Language Model with Soft-Masked Noise
Jiaao Chen, Aston Zhang, Mu Li, Alex Smola, Diyi Yang
Mirages. On Anthropomorphism in Dialogue Systems
Gavin Abercrombie, Amanda Curry, Tanvi Dinkar, Verena Rieser, Zeerak Talat
KEBAP: Korean Error Explainable Benchmark Dataset for ASR and Post-processing
Seonmin Koo, Chanjun Park, Jinsung Kim, Jaehyung Seo, Sugyeong Eo, Hyeonseok Moon, Heuiseok Lim
Adaptive Policy with Wait-k Model for Simultaneous Translation
Libo Zhao, Kai Fan, Wei Luo, Wu Jing, Shushu Wang, Ziqian Zeng, Zhongqiang Huang
Cross-Document Event Coreference Resolution on Discourse Structure
Xinyu Chen, Sheng Xu, Peifeng Li, Qiaoming Zhu
Post-hoc Utterance Refining Method by Entity Mining for Faithful Knowledge Grounded Conversations
Yoonna Jang, Suhyune Son, Jeongwoo Lee, Junyoung Son, Yuna Hur, Jungwoo Lim, Hyeonseok Moon, Kisu Yang, Heuiseok Lim
Can We Edit Factual Knowledge by In-Context Learning?
Ce Zheng, Lei Li, Qingxiu Dong, Yuxuan Fan, Zhiyong Wu, Jingjing Xu, Baobao Chang
EDIS: Entity-Driven Image Search over Multimodal Web Content
Siqi Liu, Weixi Feng, Tsu-Jui Fu, Wenhu Chen, William Wang
Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models
Yifan Hou, Jiaoda Li, Yu Fei, Alessandro Stolfo, Wangchunshu Zhou, Guangtao Zeng, Antoine Bosselut, Mrinmaya Sachan
Text encoders bottleneck compositionality in contrastive vision-language models
Amita Kamath, Jack Hessel, Kai-Wei Chang
Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs Through a Global Prompt Hacking Competition
Sander Schulhoff, Jeremy Pinto, Anaum Khan, Louis-François Bouchard, Chenglei Si, Svetlina Anati, Valen Tagliabue, Anson Kost, Christopher Carnahan, Jordan Boyd-Graber
MMNMT: Modularizing Multilingual Neural Machine Translation with Flexibly Assembled MoE and Dense Blocks
Shangjie Li, Xiangpeng Wei, Shaolin Zhu, Jun Xie, Baosong Yang, Deyi Xiong
Localizing Active Objects from Egocentric Vision with Symbolic World Knowledge
Te-Lin Wu, Yu Zhou, Nanyun Peng
Introducing Rhetorical Parallelism Detection: A New Task with Datasets, Metrics, and Baselines
Stephen Bothwell, Justin DeBenedetto, Theresa Crnkovich, Hildegund Muller, David Chiang
Prompting is not a substitute for probability measurements in large language models
Jennifer Hu, Roger Levy
Parameter-Efficient Language Model Tuning with Active Learning in Low-Resource Settings
Josip Jukić, Jan Snajder
CoLT5: Faster Long-Range Transformers with Conditional Computation
Joshua Ainslie, Tao Lei, Michiel de Jong, Santiago Ontanon, Siddhartha Brahma, Yury Zemlyanskiy, David Uthus, Mandy Guo, James Lee-Thorp, Yi Tay, Yun-Hsuan Sung, Sumit Sanghai
DiSTRICT: Dialogue State Tracking with Retriever Driven In-Context Tuning
Praveen Venkateswaran, Evelyn Duesterwald, Vatche Isahagian
Cross-Cultural Analysis of Human Values, Morals, and Biases in Folk Tales
Winston Wu, Lu Wang, Rada Mihalcea
Non-Programmers Can Label Programs Indirectly via Active Examples: A Case Study with Text-to-SQL
Ruiqi Zhong, Charlie Snell, Dan Klein, Jason Eisner
LINC: A Neurosymbolic Approach for Logical Reasoning by Combining Language Models with First-Order Logic Provers
Theo Olausson, Alex Gu, Ben Lipkin, Cedegao Zhang, Armando Solar-Lezama, Joshua Tenenbaum, Roger Levy
Non-autoregressive Streaming Transformer for Simultaneous Translation
Zhengrui Ma, Shaolei Zhang, Shoutao Guo, Chenze Shao, Min Zhang, Yang Feng
ViSoBERT: A Pre-Trained Language Model for Vietnamese Social Media Text Processing
Nam Nguyen, Thang Phan, Duc-Vu Nguyen, Kiet Nguyen
RAPL: A Relation-Aware Prototype Learning Approach for Few-Shot Document-Level Relation Extraction
Shiao Meng, Xuming Hu, Aiwei Liu, Shuang Li, Fukun Ma, Yawen Yang, Lijie Wen
GeoLM: Empowering Language Models for Geospatially Grounded Language Understanding
Zekun Li, Wenxuan Zhou, Yao-Yi Chiang, Muhao Chen
Cross-Modal Conceptualization in Bottleneck Models
Danis Alukaev, Semen Kiselev, Ilya Pershin, Bulat Ibragimov, Vladimir Ivanov, Alexey Kornaev, Ivan Titov
LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models
Zhiqiang Hu, Lei Wang, Yihuai Lan, Wanyu Xu, Ee-Peng Lim, Lidong Bing, Xing Xu, Soujanya Poria, Roy Lee
DREAM: Deployment of Recombination and Ensembles in Argument Mining
Florian Ruosch, Cristina Sarasua, Abraham Bernstein
Query Rewriting in Retrieval-Augmented Large Language Models
Xinbei Ma, Yeyun Gong, Pengcheng He, Hai Zhao, Nan Duan
PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation
Gaurav Sahu, Olga Vechtomova, Dzmitry Bahdanau, Issam Laradji
COHESENTIA: A Novel Benchmark of Incremental versus Holistic Assessment of Coherence in Generated Texts
Aviya Maimon, Reut Tsarfaty
QUDeval: The Evaluation of Questions Under Discussion Discourse Parsing
Yating Wu, Ritika Mangla, Greg Durrett, Junyi Li
PRCA: Fitting Black-Box Large Language Models for Retrieval Question Answering via Pluggable Reward-Driven Contextual Adapter
Haoyan Yang, Zhitao Li, Yong Zhang, Jianzong Wang, Ning Cheng, Ming Li, Jing Xiao
Exploring Chain of Thought Style Prompting for Text-to-SQL
Chang-Yu Tai, Ziru Chen, Tianshu Zhang, Xiang Deng, Huan Sun
Efficient Algorithms for Recognizing Weighted Tree-Adjoining Languages
Alexandra Butoi, Tim Vieira, Ryan Cotterell, David Chiang
Harnessing Black-Box Control to Boost Commonsense in LM’s Generation
Yufei Tian, Felix Zhang, Nanyun Peng
Representative Demonstration Selection for In-Context Learning with Two-Stage Determinantal Point Process
Zhao Yang, Yuanzhe Zhang, Dianbo Sui, Cao Liu, Jun Zhao, Kang Liu
The Effect of Scaling, Retrieval Augmentation and Form on the Factual Consistency of Language Models
Lovisa Hagström, Denitsa Saynova, Tobias Norlund, Moa Johansson, Richard Johansson
ViPE: Visualise Pretty-much Everything
Hassan Shahmohammadi, Adhiraj Ghosh, Hendrik Lensch
Navigating the Grey Area: How Expressions of Uncertainty and Overconfidence Affect Language Models
Kaitlyn Zhou, Dan Jurafsky, Tatsunori Hashimoto
Elaborative Simplification as Implicit Questions Under Discussion
Yating Wu, William Sheffield, Kyle Mahowald, Junyi Li
SciRepEval: A Multi-Format Benchmark for Scientific Document Representations
Amanpreet Singh, Mike D’Arcy, Arman Cohan, Doug Downey, Sergey Feldman
A Diachronic Perspective on User Trust in AI under Uncertainty
Shehzaad Dhuliawala, Vilém Zouhar, Mennatallah El-Assady, Mrinmaya Sachan
CT-GAT: Cross-Task Generative Adversarial Attack based on Transferability
Minxuan Lv, Chengwei Dai, Kun Li, Wei Zhou, Songlin Hu
Improving Long Document Topic Segmentation Models With Enhanced Coherence Modeling
Hai Yu, Chong Deng, Qinglin Zhang, Jiaqing Liu, Qian Chen, Wen Wang
Dialogue Chain-of-Thought Distillation for Commonsense-aware Conversational Agents
Hyungjoo Chae, Yongho Song, Kai Ong, Taeyoon Kwon, Minjin Kim, Youngjae Yu, Dongha Lee, Dongyeop Kang, Jinyoung Yeo
Information Value: Measuring Utterance Predictability as Distance from Plausible Alternatives
Mario Giulianelli, Sarenne Wallbridge, Raquel Fernández
Generating Commonsense Counterfactuals for Stable Relation Extraction
Xin Miao, Yongqi Li, Tieyun Qian
C-STS: Conditional Semantic Textual Similarity
Ameet Deshpande, Carlos Jimenez, Howard Chen, Vishvak Murahari, Victoria Graf, Tanmay Rajpurohit, Ashwin Kalyan, Danqi Chen, Karthik Narasimhan
Cross-lingual Transfer Can Worsen Bias in Sentiment Analysis
Seraphina Goldfarb-Tarrant, Björn Ross, Adam Lopez
Rumor Detection on Social Media with Crowd Intelligence and ChatGPT-Assisted Networks
Chang Yang, Peng Zhang, Wenbo Qiao, Hui Gao, Jiaming Zhao
Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?
Yichi Zhang, Jiayi Pan, Yuchen Zhou, Rui Pan, Joyce Chai
Controllable Contrastive Generation for Multilingual Biomedical Entity Linking
Tiantian Zhu, Yang Qin, Qingcai Chen, Xin Mu, Changlong Yu, Yang Xiang
MediaHG: Rethinking Eye-catchy Features in Social Media Headline Generation
Boning Zhang, Yang Yang
Fine-tuned LLMs Know More, Hallucinate Less with Few-Shot Sequence-to-Sequence Semantic Parsing over Wikidata
Silei Xu, Shicheng Liu, Theo Culhane, Elizaveta Pertseva, Meng-Hsi Wu, Sina Semnani, Monica Lam
Efficient Grammatical Error Correction Via Multi-Task Training and Optimized Training Schedule
Andrey Bout, Alexander Podolskiy, Sergey Nikolenko, Irina Piontkovskaya
The BLA Benchmark: Investigating Basic Language Abilities of Pre-Trained Multimodal Models
Xinyi Chen, Raquel Fernández, Sandro Pezzelle
RainProof: An Umbrella to Shield Text Generator from Out-Of-Distribution Data
Maxime Darrin, Pablo Piantanida, Pierre Colombo
KEPL: Knowledge Enhanced Prompt Learning for Chinese Hypernym-Hyponym Extraction
Ningchen Ma, Dong Wang, Hongyun Bao, Lei He, Suncong Zheng
Preserving Knowledge Invariance: Rethinking Robustness Evaluation of Open Information Extraction
Ji Qi, Chuchun Zhang, Xiaozhi Wang, Kaisheng Zeng, Jifan Yu, Jinxin Liu, Lei Hou, Juanzi Li, Xu Bin
Why Should This Article Be Deleted? Transparent Stance Detection in Multilingual Wikipedia Editor Discussions
Lucie-Aimée Kaffee, Arnav Arora, Isabelle Augenstein
Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding
Sangmin Bae, Jongwoo Ko, Hwanjun Song, Se-Young Yun
End-to-end Task-oriented Dialogue: A Survey of Tasks, Methods, and Future Directions
Libo Qin, Wenbo Pan, Qiguang Chen, Lizi Liao, Zhou Yu, Yue Zhang, Wanxiang Che, Min Li
Answering Questions by Meta-Reasoning over Multiple Chains of Thought
Ori Yoran, Tomer Wolfson, Ben Bogin, Uri Katz, Daniel Deutch, Jonathan Berant
INSTRUCTSCORE: Towards Explainable Text Generation Evaluation with Automatic Feedback
Wenda Xu, Danqing Wang, Liangming Pan, Zhenqiao Song, Markus Freitag, William Wang, Lei Li
Multi-level Contrastive Learning for Script-based Character Understanding
Dawei Li, Hengyuan Zhang, Yanran Li, Shiping Yang
CHEF in the Language Kitchen: A Generative Data Augmentation Leveraging Korean Morpheme Ingredients
Jaehyung Seo, Hyeonseok Moon, Jaewook Lee, Sugyeong Eo, Chanjun Park, Heuiseok Lim
Automatic Debate Evaluation with Argumentation Semantics and Natural Language Argument Graph Networks
Ramon Ruiz-Dolz, Stella Heras, Ana Garcia
Transfer-Free Data-Efficient Multilingual Slot Labeling
Evgeniia Razumovskaia, Ivan Vulić, Anna Korhonen
Towards Interpretable Mental Health Analysis with Large Language Models
Kailai Yang, Shaoxiong Ji, Tianlin Zhang, Qianqian Xie, Ziyan Kuang, Sophia Ananiadou
Learning to Rank Generation with Pairwise Partial Rewards
Youngwon Lee, Jinu Lee, Seung-won Hwang
GreedyCAS: Unsupervised Scientific Abstract Segmentation with Normalized Mutual Information
Yingqiang Gao, Jessica Lam, Nianlong Gu, Richard Hahnloser
Multimodal Embodied Plan Prediction Augmented with Synthetic Embodied Dialogue
Aishwarya Padmakumar, Mert Inan, Spandana Gella, Patrick Lange, Dilek Hakkani-Tur
GEM: Gestalt Enhanced Markup Language Model for Web Understanding via Render Tree
Zirui Shao, Feiyu Gao, Zhongda Qi, Hangdi Xing, Jiajun Bu, Zhi Yu, Qi Zheng, Xiaozhong Liu
Abstractive Open Information Extraction
Kevin Pei, Ishan Jindal, Kevin Chang
CoSyn: Detecting Implicit Hate Speech in Online Conversations Using a Context Synergized Hyperbolic Network
Sreyan Ghosh, Manan Suri, Purva Chiniya, Utkarsh Tyagi, Sonal Kumar, Dinesh Manocha
CLEME: Debiasing Multi-reference Evaluation for Grammatical Error Correction
Jingheng Ye, Yinghui Li, Qingyu Zhou, Yangning Li, Shirong Ma, Hai-Tao Zheng, Ying Shen
SentiStream: A Co-Training Framework for Adaptive Online Sentiment Analysis in Evolving Data Streams
Yuhao Wu, Karthick Sharma, Chun Seah, Shuhao Zhang
HyperNetwork-based Decoupling to Improve Model Generalization for Few-Shot Relation Extraction
Liang Zhang, Chulun Zhou, Fandong Meng, Jinsong Su, Yidong Chen, Jie Zhou
Solving Hard Analogy Questions with Relation Embedding Chains
Nitesh Kumar, Steven Schockaert
Modeling Empathic Similarity in Personal Narratives
Jocelyn Shen, Maarten Sap, Pedro Colon-Hernandez, Hae Park, Cynthia Breazeal
Tree Prompting: Efficient Task Adaptation without Fine-Tuning
Chandan Singh, John Morris, Alexander Rush, Jianfeng Gao, Yuntian Deng
Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data
Canwen Xu, Daya Guo, Nan Duan, Julian McAuley
Empathy Intent Drives Empathy Detection
Liting Jiang, Di Wu, Bohui Mao, Yanbing Li, Wushour Slamu
Adaptive End-to-End Metric Learning for Zero-Shot Cross-Domain Slot Filling
Yuanjun Shi, Linzhi Wu, Minglai Shao
ReTAG: Reasoning Aware Table to Analytic Text Generation
Deepanway Ghosal, Preksha Nema, Aravindan Raghuveer
Beyond Factuality: A Comprehensive Evaluation of Large Language Models as Knowledge Generators
Liang Chen, Yang Deng, Yatao Bian, Zeyu Qin, Bingzhe Wu, Tat-Seng Chua, Kam-Fai Wong
Compressing Context to Enhance Inference Efficiency of Large Language Models
Yucheng Li, Bo Dong, Frank Guerin, Chenghua Lin
MoT: Memory-of-Thought Enables ChatGPT to Self-Improve
Xiaonan Li, Xipeng Qiu
Can You Follow Me? Testing Situational Understanding for ChatGPT
Chenghao Yang, Allyson Ettinger
Towards Reliable Misinformation Mitigation: Generalization, Uncertainty, and GPT-4
Kellin Pelrine, Anne Imouza, Camille Thibault, Meilina Reksoprodjo, Caleb Gupta, Joel Christoph, Jean-François Godbout, Reihaneh Rabbany
Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation
Bashar Alhafni, Go Inoue, Christian Khairallah, Nizar Habash
HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large Language Models
Junyi Li, Xiaoxue Cheng, Xin Zhao, Jian-Yun Nie, Ji-Rong Wen
Enabling Large Language Models to Generate Text with Citations
Tianyu Gao, Howard Yen, Jiatong Yu, Danqi Chen
Revisiting Machine Translation for Cross-lingual Classification
Mikel Artetxe, Vedanuj Goswami, Shruti Bhosale, Angela Fan, Luke Zettlemoyer
Counting the Bugs in ChatGPT’s Wugs: A Multilingual Investigation into the Morphological Capabilities of a Large Language Model
Leonie Weissweiler, Valentin Hofmann, Anjali Kantharuban, Anna Cai, Ritam Dutt, Amey Hengle, Anubha Kabra, Atharva Kulkarni, Abhishek Vijayakumar, Haofei Yu, Hinrich Schuetze, Kemal Oflazer, David Mortensen
Adapt in Contexts: Retrieval-Augmented Domain Adaptation via In-Context Learning
Quanyu Long, Wenya Wang, Sinno Pan
Dual-Feedback Knowledge Retrieval for Task-Oriented Dialogue Systems
Tianyuan Shi, Liangzhi Li, Zijian Lin, Tao Yang, Xiaojun Quan, Qifan Wang
MAF: Multi-Aspect Feedback for Improving Reasoning in Large Language Models
Deepak Nathani, David Wang, Liangming Pan, William Wang
Granularity Matters: Pathological Graph-driven Cross-modal Alignment for Brain CT Report Generation
Yanzhao Shi, Junzhong Ji, Xiaodan Zhang, Liangqiong Qu, Ying Liu
Enhancing Structured Evidence Extraction for Fact Verification
Zirui Wu, Nan Hu, Yansong Feng
Rethinking Model Selection and Decoding for Keyphrase Generation with Pre-trained Sequence-to-Sequence Models
Di Wu, Wasi Ahmad, Kai-Wei Chang
A Fair and In-Depth Evaluation of Existing End-to-End Entity Linking Systems
Hannah Bast, Matthias Hertel, Natalie Prange
A Multi-Task Dataset for Assessing Discourse Coherence in Chinese Essays: Structure, Theme, and Logic Analysis
Hongyi Wu, Xinshu Shen, Man Lan, Shaoguang Mao, Xiaopeng Bai, Yuanbin Wu
SKD-NER: Continual Named Entity Recognition via Span-based Knowledge Distillation with Reinforcement Learning
Yi Chen, Liang He
Lifelong Sequence Generation with Dynamic Module Expansion and Adaptation
Chengwei Qin, Chen Chen, Shafiq Joty
When the Majority is Wrong: Modeling Annotator Disagreement for Subjective Tasks
Eve Fleisig, Rediet Abebe, Dan Klein
Lazy-k Decoding: Constrained Decoding for Information Extraction
Arthur Hemmer, Mickael Coustaty, Nicola Bartolo, Jerome Brachat, Jean-marc Ogier
Personalized Distillation: Empowering Open-Sourced LLMs with Adaptive Learning for Code Generation
Hailin Chen, Amrita Saha, Steven Hoi, Shafiq Joty
Do Language Models Have a Common Sense regarding Time? Revisiting Temporal Commonsense Reasoning in the Era of Large Language Models
Raghav Jain, Daivik Sojitra, Arkadeep Acharya, Sriparna Saha, Adam Jatowt, Sandipan Dandapat
Comparing Styles across Languages
Shreya Havaldar, Matthew Pressimone, Eric Wong, Lyle Ungar
Event Causality Extraction via Implicit Cause-Effect Interactions
Jintao Liu, Zequn Zhang, Kaiwen Wei, Zhi Guo, Xian Sun, Li Jin, Xiaoyu Li
Evaluation of African American Language Bias in Natural Language Generation
Nicholas Deas, Jessica Grieser, Shana Kleiner, Desmond Patton, Elsbeth Turcan, Kathleen McKeown
A Systematic Study of Performance Disparities in Multilingual Task-Oriented Dialogue Systems
Songbo Hu, Han Zhou, Moy Yuan, Milan Gritta, Guchun Zhang, Ignacio Iacobacci, Anna Korhonen, Ivan Vulić
Cognate Transformer for Automated Phonological Reconstruction and Cognate Reflex Prediction
V.S.D.S.Mahesh Akavarapu, Arnab Bhattacharya
Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning
Ximing Lu, Faeze Brahman, Peter West, Jaehun Jung, Khyathi Chandu, Abhilasha Ravichander, Prithviraj Ammanabrolu, Liwei Jiang, Sahana Ramnath, Nouha Dziri, Jillian Fisher, Bill Lin, Skyler Hallinan, Lianhui Qin, Xiang Ren, Sean Welleck, Yejin Choi
Weakly Supervised Semantic Parsing with Execution-based Spurious Program Filtering
Kang-il Lee, Segwang Kim, Kyomin Jung
Taxonomy Expansion for Named Entity Recognition
Karthikeyan K, Yogarshi Vyas, Jie Ma, Giovanni Paolini, Neha John, Shuai Wang, Yassine Benajiba, Vittorio Castelli, Dan Roth, Miguel Ballesteros
Rather a Nurse than a Physician - Contrastive Explanations under Investigation
Oliver Eberle, Ilias Chalkidis, Laura Cabello, Stephanie Brandl
An Investigation of LLMs’ Inefficacy in Understanding Converse Relations
Chengwen Qi, Bowen Li, Binyuan Hui, Bailin Wang, Jinyang Li, Jinwang Wu, Yuanjun Laili
Towards Low-Resource Automatic Program Repair with Meta-Learning and Pretrained Language Models
Weishi Wang, Yue Wang, Steven Hoi, Shafiq Joty
ZGUL: Zero-shot Generalization to Unseen Languages using Multi-source Ensembling of Language Adapters
Vipul Rathore, Rajdeep Dhingra, Parag Singla, Mausam
Log-FGAER: Logic-Guided Fine-Grained Address Entity Recognition from Multi-Turn Spoken Dialogue
Xue Han, Yitong Wang, Qian Hu, Pengwei Hu, Chao Deng, Junlan Feng
Unified Low-Resource Sequence Labeling by Sample-Aware Dynamic Sparse Finetuning
Sarkar Snigdha Sarathi Das, Haoran Zhang, Peng Shi, Wenpeng Yin, Rui Zhang
On the Representational Capacity of Recurrent Neural Language Models
Franz Nowak, Anej Svete, Li Du, Ryan Cotterell
A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis
Alessandro Stolfo, Yonatan Belinkov, Mrinmaya Sachan
Benchmarking and Improving Text-to-SQL Generation under Ambiguity
Adithya Bhaskar, Tushar Tomar, Ashutosh Sathe, Sunita Sarawagi
Non-autoregressive Text Editing with Copy-aware Latent Alignments
Yu Zhang, Yue Zhang, Leyang Cui, Guohong Fu
Translating away Translationese without Parallel Data
Rricha Jalota, Koel Chowdhury, Cristina España-Bonet, Josef van Genabith
CaseEncoder: A Knowledge-enhanced Pre-trained Model for Legal Case Encoding
Yixiao Ma, Yueyue WU, Weihang Su, Qingyao Ai, Yiqun Liu
HiddenTables and PyQTax: A Cooperative Game and Dataset For TableQA to Ensure Scale and Data Privacy Across a Myriad of Taxonomies
William Watson, Nicole Cho, Tucker Balch, Manuela Veloso
Causal Document-Grounded Dialogue Pre-training
Yingxiu Zhao, Bowen Yu, Bowen Li, Haiyang Yu, Jinyang Li, Chao Wang, Fei Huang, Yongbin Li, Nevin Zhang
Accented Speech Recognition With Accent-specific Codebooks
Darshan Prabhu, Preethi Jyothi, Sriram Ganapathy, Vinit Unni
Linking Surface Facts to Large-Scale Knowledge Graphs
Gorjan Radevski, Kiril Gashteovski, Chia-Chien Hung, Carolin Lawrence, Goran Glavaš
Sentiment Analysis on Streaming User Reviews via Dual-Channel Dynamic Graph Neural Network
Xin Zhang, Linhai Zhang, Deyu Zhou
DUMB: A Dutch Model Benchmark
Wietse de Vries, Martijn Wieling, Malvina Nissim
OssCSE: Overcoming Surface Structure Bias in Contrastive Learning for Unsupervised Sentence Embedding
Zhan Shi, Guoyin Wang, Ke Bai, Jiwei Li, Xiang Li, Qingjun Cui, Belinda Zeng, Trishul Chilimbi, Xiaodan Zhu
End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation
Juan Pablo Zuluaga-Gomez, Zhaocheng Huang, Xing Niu, Rohit Paturi, Sundararajan Srinivasan, Prashant Mathur, Brian Thompson, Marcello Federico
A Fine-Grained Taxonomy of Replies to Hate Speech
Xinchen Yu, Ashley Zhao, Eduardo Blanco, Lingzi Hong
JointMatch: A Unified Approach for Diverse and Collaborative Pseudo-Labeling to Semi-Supervised Text Classification
Henry Zou, Cornelia Caragea
Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4
Kent Chang, Mackenzie Cramer, Sandeep Soni, David Bamman
CiteBench: A Benchmark for Scientific Citation Text Generation
Martin Funkquist, Ilia Kuznetsov, Yufang Hou, Iryna Gurevych
From Heuristic to Analytic: Cognitively Motivated Strategies for Coherent Physical Commonsense Reasoning
Zheyuan Zhang, Shane Storks, Fengyuan Hu, Sungryull Sohn, Moontae Lee, Honglak Lee, Joyce Chai
A Challenging Multimodal Video Summary: Simultaneously Extracting and Generating Keyframe-Caption Pairs from Video
Keito Kudo, Haruki Nagasawa, Jun Suzuki, Nobuyuki Shimizu
Effects of sub-word segmentation on performance of transformer language models
Jue Hou, Anisia Katinskaia, Anh-Duc Vu, Roman Yangarber
Symbolic Planning and Code Generation for Grounded Dialogue
Justin Chiu, Wenting Zhao, Derek Chen, Saujas Vaduguru, Alexander Rush, Daniel Fried
Universal Self-Adaptive Prompting
Xingchen Wan, Ruoxi Sun, Hootan Nakhost, Hanjun Dai, Julian Eisenschlos, Sercan Arik, Tomas Pfister
Content- and Topology-Aware Representation Learning for Scientific Multi-Literature
Kai Zhang, Kaisong Song, Yangyang Kang, Xiaozhong Liu
Joint Entity and Relation Extraction with Span Pruning and Hypergraph Neural Networks
Zhaohui Yan, Songlin Yang, Wei Liu, Kewei Tu
Have LLMs Advanced Enough? A Challenging Problem Solving Benchmark For Large Language Models
Daman Arora, Himanshu Singh, Mausam
StrAE: Autoencoding for Pre-Trained Embeddings using Explicit Structure
Mattia Opper, Victor Prokhorov, Siddharth N
WiCE: Real-World Entailment for Claims in Wikipedia
Ryo Kamoi, Tanya Goyal, Juan Rodriguez, Greg Durrett
Natural Disaster Tweets Classification Using Multimodal Data
Mohammad Basit, Bashir Alam, Zubaida Fatima, Salman Shaikh
On the Challenges of Using Black-Box APIs for Toxicity Evaluation in Research
Luiza Pozzobon, Beyza Ermis, Patrick Lewis, Sara Hooker
RoBoCoP: A Comprehensive ROmance BOrrowing COgnate Package and Benchmark for Multilingual Cognate Identification
Liviu Dinu, Ana Uban, Alina Cristea, Anca Dinu, Ioan-Bogdan Iordache, Simona Georgescu, Laurentiu Zoicas
Instructive Dialogue Summarization with Query Aggregations
Bin Wang, Zhengyuan Liu, Nancy Chen
Semantic matching for text classification with complex class descriptions
Brian De Silva, Kuan-Wen Huang, Gwang Lee, Karen Hovsepian, Yan Xu, Mingwei Shen
MADNet: Maximizing Addressee Deduction Expectation for Multi-Party Conversation Generation
Jia-Chen Gu, Chao-Hong Tan, Caiyuan Chu, Zhen-Hua Ling, Chongyang Tao, Quan Liu, Cong Liu
GLEN: Generative Retrieval via Lexical Index Learning
Sunkyung Lee, Minjin Choi, Jongwuk Lee
Turn-Level Active Learning for Dialogue State Tracking
Zihan Zhang, Meng Fang, Fanghua Ye, Ling Chen, Mohammad-Reza Namazi-Rad
ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue
Haoqin Tu, Yitong Li, Fei Mi, Zhongliang Yang
Modeling Conceptual Attribute Likeness and Domain Inconsistency for Metaphor Detection
Yuan Tian, Nan Xu, Wenji Mao, Daniel Zeng
Referring Image Segmentation via Joint Mask Contextual Embedding Learning and Progressive Alignment Network
Ziling Huang, Shin’ichi Satoh
Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study
Boxin Wang, Wei Ping, Peng Xu, Lawrence McAfee, Zihan Liu, Mohammad Shoeybi, Yi Dong, Oleksii Kuchaiev, Bo Li, Chaowei Xiao, Anima Anandkumar, Bryan Catanzaro
SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables
Xinyuan Lu, Liangming Pan, Qian Liu, Preslav Nakov, Min-Yen Kan
Training Simultaneous Speech Translation with Robust and Random Wait-k-Tokens Strategy
Linlin Zhang, Kai Fan, Jiajun Bu, Zhongqiang Huang
SCENE: Self-Labeled Counterfactuals for Extrapolating to Negative Examples
Deqing Fu, Ameya Godbole, Robin Jia
Task-Agnostic Low-Rank Adapters for Unseen English Dialects
Zedian Xiao, William Held, Yanchen Liu, Diyi Yang
Federated Learning of Large Language Models with Parameter-Efficient Prompt Tuning and Adaptive Optimization
Tianshi Che, Ji Liu, Yang Zhou, Jiaxiang Ren, Jiwen Zhou, Victor Sheng, Huaiyu Dai, Dejing Dou
TheoremQA: A Theorem-driven Question Answering Dataset
Wenhu Chen, Ming Yin, Max Ku, Pan Lu, Yixin Wan, Xueguang Ma, Jianyu Xu, Xinyi Wang, Tony Xia
Scalable-DSC: A Structural Template Prompt Approach to Scalable Dialogue State Correction
Haoxiang Su, Hongyan Xie, Hao Huang, Shuangyong Song, Ruiyu Fang, Xiaomeng Huang, Sijie Feng
Don’t Trust ChatGPT when your Question is not in English: A Study of Multilingual Abilities and Types of LLMs
Xiang Zhang, Senyu Li, Bradley Hauer, Ning Shi, Grzegorz Kondrak
Empirical Study of Zero-Shot NER with ChatGPT
Tingyu Xie, Qi Li, Jian Zhang, Yan Zhang, Zuozhu Liu, Hongwei Wang
Automatic Prompt Optimization with ``Gradient Descent’’ and Beam Search
Reid Pryzant, Dan Iter, Jerry Li, Yin Lee, Chenguang Zhu, Michael Zeng
Active Retrieval Augmented Generation
Zhengbao Jiang, Frank Xu, Luyu Gao, Zhiqing Sun, Qian Liu, Jane Dwivedi-Yu, Yiming Yang, Jamie Callan, Graham Neubig
Multi-level Adaptive Contrastive Learning for Knowledge Internalization in Dialogue Generation
Chenxu Yang, Zheng Lin, Lanrui Wang, Chong Tian, Liang Pang, Jiangnan Li, Qirong Ho, Yanan Cao, Weiping Wang
Enhancing Biomedical Lay Summarisation with External Knowledge Graphs
Tomas Goldsack, Zhihao Zhang, Chen Tang, Carolina Scarton, Chenghua Lin
A Diffusion Weighted Graph Framework for New Intent Discovery
Wenkai Shi, Wenbin An, Feng Tian, Qinghua Zheng, QianYing Wang, Ping Chen
A Self-enhancement Multitask Framework for Unsupervised Aspect Category Detection
Thi-Nhung Nguyen, Hoang Ngo, Kiem-Hieu Nguyen, Tuan-Dung Cao
DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models
Chengcheng Han, Xiaowei Du, Che Zhang, Yixin Lian, Xiang Li, Ming Gao, Baoyuan Wang
Recurrent Neural Language Models as Probabilistic Finite-state Automata
Anej Svete, Ryan Cotterell
Revisiting Source Context in Nearest Neighbor Machine Translation
Xuanhong Li, Peng Li, Po Hu
Find-2-Find: Multitask Learning for Anaphora Resolution and Object Localization
Cennet Oguz, Pascal Denis, Emmanuel Vincent, Simon Ostermann, Josef van Genabith
Background Summarization of Event Timelines
Adithya Pratapa, Kevin Small, Markus Dreyer
Superlim: A Swedish Language Understanding Evaluation Benchmark
Aleksandrs Berdicevskis, Gerlof Bouma, Robin Kurtz, Felix Morger, Joey öhman, Yvonne Adesam, Lars Borin, Dana Dannélls, Markus Forsberg, Tim Isbister, Anna Lindahl, Martin Malmsten, Faton Rekathati, Magnus Sahlgren, Elena Volodina, Love Börjeson, Simon Hengchen, Nina Tahmasebi
Reasoning with Language Model is Planning with World Model
Shibo Hao, Yi Gu, Haodi Ma, Joshua Hong, Zhen Wang, Daisy Wang, Zhiting Hu
LLM-enhanced Self-training for Cross-domain Constituency Parsing
Jianling Li, Meishan Zhang, Peiming Guo, Min Zhang, Yue Zhang
Continual Named Entity Recognition without Catastrophic Forgetting
Duzhen Zhang, Wei Cong, Jiahua Dong, Yahan Yu, Xiuyi Chen, Yonggang Zhang, Zhen Fang
DSI++: Updating Transformer Memory with New Documents
Sanket Mehta, Jai Gupta, Yi Tay, Mostafa Dehghani, Vinh Tran, Jinfeng Rao, Marc Najork, Emma Strubell, Donald Metzler
Editing Common Sense in Transformers
Anshita Gupta, Debanjan Mondal, Akshay Sheshadri, Wenlong Zhao, Xiang Li, Sarah Wiegreffe, Niket Tandon
Air-Decoding: Attribute Distribution Reconstruction for Decoding-Time Controllable Text Generation
Tianqi Zhong, Quan Wang, Jingxuan Han, Yongdong Zhang, Zhendong Mao
Homophone Disambiguation Reveals Patterns of Context Mixing in Speech Transformers
Hosein Mohebbi, Grzegorz Chrupała, Willem Zuidema, Afra Alishahi
Retrieval-Generation Alignment for End-to-End Task-Oriented Dialogue System
Weizhou Shen, Yingqi Gao, Canbin Huang, Fanqi Wan, Xiaojun Quan, Wei Bi
IfQA: A Dataset for Open-domain Question Answering under Counterfactual Presuppositions
Wenhao Yu, Meng Jiang, Peter Clark, Ashish Sabharwal
How Do Large Language Models Capture the Ever-changing World Knowledge? A Review of Recent Advances
Zihan Zhang, Meng Fang, Ling Chen, Mohammad-Reza Namazi-Rad, Jun Wang
Memorisation Cartography: Mapping out the Memorisation-Generalisation Continuum in Neural Machine Translation
Verna Dankers, Ivan Titov, Dieuwke Hupkes
DecipherPref: Analyzing Influential Factors in Human Preference Judgments via GPT-4
Yebowen Hu, Kaiqiang Song, Sangwoo Cho, Xiaoyang Wang, Hassan Foroosh, Fei Liu
Gender Biases in Automatic Evaluation Metrics for Image Captioning
Haoyi Qiu, Zi-Yi Dou, Tianlu Wang, Asli Celikyilmaz, Nanyun Peng
QA-NatVer: Question Answering for Natural Logic-based Fact Verification
Rami Aly, Marek Strong, Andreas Vlachos
Increasing Probability Mass on Answer Choices Does Not Always Improve Accuracy
Sarah Wiegreffe, Matthew Finlayson, Oyvind Tafjord, Peter Clark, Ashish Sabharwal
Generating Data for Symbolic Language with Large Language Models
Jiacheng Ye, Chengzu Li, Lingpeng Kong, Tao Yu
IDTraffickers: An Authorship Attribution Dataset to link and connect Potential Human-Trafficking Operations on Text Escort Advertisements
Vageesh Saxena, Benjamin Ashpole, Gijs van Dijck, Gerasimos Spanakis
Evaluating Bias and Fairness in Gender-Neutral Pretrained Vision-and-Language Models
Laura Cabello, Emanuele Bugliarello, Stephanie Brandl, Desmond Elliott
Improving Dialogue Discourse Parsing via Reply-to Structures of Addressee Recognition
Yaxin Fan, Feng Jiang, Peifeng Li, Fang Kong, Qiaoming Zhu
Improving Language Models’ Meaning Understanding and Consistency by Learning Conceptual Roles from Dictionary
Myeongjun Jang, Thomas Lukasiewicz
DALE: Generative Data Augmentation for Low-Resource Legal NLP
Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, S Ramaneswaran, S Sakshi, Utkarsh Tyagi, Dinesh Manocha
FedID: Federated Interactive Distillation for Large-Scale Pretraining Language Models
Xinge Ma, Jiangming Liu, Jin Wang, Xuejie Zhang
trlX: A Framework for Large Scale Reinforcement Learning from Human Feedback
Alexander Havrilla, Maksym Zhuravinskyi, Duy Phung, Aman Tiwari, Jonathan Tow, Stella Biderman, Quentin Anthony, Louis Castricato
This is not a Dataset: A Large Negation Benchmark to Challenge Large Language Models
Iker García-Ferrero, Begoña Altuna, Javier Alvez, Itziar Gonzalez-Dios, German Rigau
MT2: Towards a Multi-Task Machine Translation Model with Translation-Specific In-Context Learning
Chunyou Li, Mingtong Liu, Hongxiao Zhang, Yufeng Chen, Jinan Xu, Ming Zhou
CleanCoNLL: A Nearly Noise-Free Named Entity Recognition Dataset
Susanna Rücker, Alan Akbik
Disentangling Transformer Language Models as Superposed Topic Models
Jia Lim, Hady Lauw
Conversational Semantic Parsing using Dynamic Context Graphs
Parag Jain, Mirella Lapata
Not all quantifiers are equal: Probing Transformer-based language models’ understanding of generalised quantifiers
Tharindu Madusanka, Iqra Zahid, Hao Li, Ian Pratt-Hartmann, Riza Batista-Navarro
Structure-aware Knowledge Graph-to-text Generation with Planning Selection and Similarity Distinction
Feng Zhao, Hongzhi Zou, Cheng Yan
Regulation and NLP (RegNLP): Taming Large Language Models
Catalina Goanta, Nikolaos Aletras, Ilias Chalkidis, Sofia Ranchordás, Gerasimos Spanakis
MedEval: A Multi-Level, Multi-Task, and Multi-Domain Medical Benchmark for Language Model Evaluation
Zexue He, Yu Wang, An Yan, Yao Liu, Eric Chang, Amilcare Gentili, Julian McAuley, Chun-Nan Hsu
Seeing through the mess: evolutionary dynamics of lexical polysemy
Andreas Baumann, Andreas Stephan, Benjamin Roth
Are Embedded Potatoes Still Vegetables? On the Limitations of WordNet Embeddings for Lexical Semantics
Xuyou Cheng, Michael Schlichtkrull, Guy Emerson
Event-Location Tracking in Narratives: A Case Study on Holocaust Testimonies
Eitan Wagner, Renana Keydar, Omri Abend
Dialogizer: Context-aware Conversational-QA Dataset Generation from Textual Sources
Yerin Hwang, Yongil Kim, Hyunkyung Bae, Hwanhee Lee, Jeesoo Bang, Kyomin Jung
Learning to Predict Task Transferability via Soft Prompt
Lingyun Feng
Chain-of-Questions Training with Latent Answers for Robust Multistep Question Answering
Wang Zhu, Jesse Thomason, Robin Jia
Mirror: A Universal Framework for Various Information Extraction Tasks
Tong Zhu, Junfei Ren, Zijian Yu, Mengsong Wu, Guoliang Zhang, Xiaoye Qu, Wenliang Chen, Zhefeng Wang, Baoxing Huai, Min Zhang
``Mistakes Help Us Grow’’: Facilitating and Evaluating Growth Mindset Supportive Language in Classrooms
Kunal Handa, Margarett Clapper, Jessica Boyle, Rose Wang, Diyi Yang, David Yeager, Dorottya Demszky
Detecting and Mitigating Hallucinations in Multilingual Summarisation
Yifu Qiu, Yftah Ziser, Anna Korhonen, Edoardo Ponti, Shay Cohen
AMR Parsing with Causal Hierarchical Attention and Pointers
Chao Lou, Kewei Tu
Self-Evolution Learning for Mixup: Enhance Data Augmentation on Few-Shot Text Classification Tasks
Haoqi Zheng, Qihuang Zhong, Liang Ding, Zhiliang Tian, Xin Niu, Changjian Wang, Dongsheng Li, Dacheng Tao
IC3: Image Captioning by Committee Consensus
David Chan, Austin Myers, Sudheendra Vijayanarasimhan, David Ross, John Canny
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models
Potsawee Manakul, Adian Liusie, Mark Gales
Fair Without Leveling Down: A New Intersectional Fairness Definition
Gaurav Maheshwari, Aurélien Bellet, Pascal Denis, Mikaela Keller
M2DF: Multi-grained Multi-curriculum Denoising Framework for Multimodal Aspect-based Sentiment Analysis
Fei Zhao, Chunhui Li, Zhen Wu, Yawen Ouyang, Jianbing Zhang, Xinyu Dai
Detection of Multiple Mental Disorders from Social Media with Two-Stream Psychiatric Experts
Siyuan Chen, Zhiling Zhang, Mengyue Wu, Kenny Zhu
Understanding the Role of Input Token Characters in Language Models: How Does Information Loss Affect Performance?
Ahmed Alajrami, Katerina Margatina, Nikolaos Aletras
EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs
Hanlin Tang, Yifu Sun, Decheng Wu, Kai Liu, Jianchen Zhu, Zhanhui Kang
Polar Ducks and Where to Find Them: Enhancing Entity Linking with Duck Typing and Polar Box Embeddings
Mattia Atzeni, Mikhail Plekhanov, Frederic Dreyer, Nora Kassner, Simone Merello, Louis Martin, Nicola Cancedda
APrompt: Attention Prompt Tuning for Efficient Adaptation of Pre-trained Language Models
Qifan Wang, Yuning Mao, Jingang Wang, Hanchao Yu, Shaoliang Nie, Sinong Wang, Fuli Feng, Lifu Huang, Xiaojun Quan, Zenglin Xu, Dongfang Liu
What’s ``up’’ with vision-language models? Investigating their struggle with spatial reasoning
Amita Kamath, Jack Hessel, Kai-Wei Chang
IBADR: an Iterative Bias-Aware Dataset Refinement Framework for Debiasing NLU models
Xiaoyue Wang, Xin Liu, Lijie Wang, Yaoxiang Wang, Jinsong Su, Hua Wu
Learning Preference Model for LLMs via Automatic Preference Data Generation
Shijia Huang, Jianqiao Zhao, Yanyang Li, Liwei Wang
Causal Reasoning through Two Cognition Layers for Improving Generalization in Visual Question Answering
Trang Nguyen, Naoaki Okazaki
StructGPT: A General Framework for Large Language Model to Reason over Structured Data
Jinhao Jiang, Kun Zhou, Zican Dong, Keming Ye, Xin Zhao, Ji-Rong Wen
Modeling Legal Reasoning: LM Annotation at the Edge of Human Agreement
Rosamond Thalken, Edward Stiglitz, David Mimno, Matthew Wilkens
Model-tuning Via Prompts Makes NLP Models Adversarially Robust
Mrigank Raman, Pratyush Maini, J Kolter, Zachary Lipton, Danish Pruthi
Learning Co-Speech Gesture for Multimodal Aphasia Type Detection
Daeun Lee, Sejung Son, Hyolim Jeon, Seungbae Kim, Jinyoung Han
STINMatch: Semi-Supervised Semantic-Topological Iteration Network for Financial Risk Detection via News Label Diffusion
Xurui Li, Yue Qin, Rui Zhu, Tianqianjin Lin, Yongming Fan, Yangyang Kang, Kaisong Song, Fubang Zhao, Changlong Sun, Haixu Tang, Xiaozhong Liu
Centering the Margins: Outlier-Based Identification of Harmed Populations in Toxicity Detection
Vyoma Raman, Eve Fleisig, Dan Klein
Describe Me an Auklet: Generating Grounded Perceptual Category Descriptions
Bill Noble, Nikolai Ilinykh
ORCHID: A Chinese Debate Corpus for Target-Independent Stance Detection and Argumentative Dialogue Summarization
Xiutian Zhao, Ke Wang, Wei Peng
On the Benefits of Learning to Route in Mixture-of-Experts Models
Nishanth Dikkala, Nikhil Ghosh, Raghu Meka, Rina Panigrahy, Nikhil Vyas, Xin Wang
SEAHORSE: A Multilingual, Multifaceted Dataset for Summarization Evaluation
Elizabeth Clark, Shruti Rijhwani, Sebastian Gehrmann, Joshua Maynez, Roee Aharoni, Vitaly Nikolaev, Thibault Sellam, Aditya Siddhant, Dipanjan Das, Ankur Parikh
We Need to Talk About Reproducibility in NLP Model Comparison
Yan Xue, Xuefei Cao, Xingli Yang, Yu Wang, Ruibo Wang, Jihong Li
Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration
Fanqi Wan, Xinting Huang, Tao Yang, Xiaojun Quan, Wei Bi, Shuming Shi
Just Adjust One Prompt: Enhancing In-Context Dialogue Scoring via Constructing the Optimal Subgraph of Demonstrations and Prompts
Jiashu Pu, Ling Cheng, Lu Fan, Tangjie Lv, Rongsheng Zhang
Multilingual estimation of political-party positioning: From label aggregation to long-input Transformers
Dmitry Nikolaev, Tanise Ceron, Sebastian Padó
ART: rule bAsed futuRe-inference deducTion
Mengze Li, Tianqi Zhao, Bai Jionghao, Baoyi He, Jiaxu Miao, Wei Ji, Zheqi Lv, Zhou Zhao, Shengyu Zhang, Wenqiao Zhang, Fei Wu
EpiK-Eval: Evaluation for Language Models as Epistemic Models
Gabriele Prato, Jerry Huang, Prasanna Parthasarathi, Shagun Sodhani, Sarath Chandar
From Dissonance to Insights: Dissecting Disagreements in Rationale Construction for Case Outcome Classification
Shanshan Xu, Santosh T.Y.S.S, Oana Ichim, Isabella Risini, Barbara Plank, Matthias Grabmair
On Bilingual Lexicon Induction with Large Language Models
Yaoyiran Li, Anna Korhonen, Ivan Vulić
Statistical Depth for Ranking and Characterizing Transformer-Based Text Embeddings
Parker Seegmiller, Sarah Preum
CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model
Kaiyan Zhang, Ning Ding, Biqing Qi, Xuekai Zhu, Xinwei Long, Bowen Zhou
From Multilingual Complexity to Emotional Clarity: Leveraging Commonsense to Unveil Emotions in Code-Mixed Dialogues
Shivani Kumar, Ramaneswaran S, Md Akhtar, Tanmoy Chakraborty
SummEdits: Measuring LLM Ability at Factual Reasoning Through The Lens of Summarization
Philippe Laban, Wojciech Kryscinski, Divyansh Agarwal, Alexander Fabbri, Caiming Xiong, Shafiq Joty, Chien-Sheng Wu
DIVE: Towards Descriptive and Diverse Visual Commonsense Generation
Jun-Hyung Park, Hyuntae Park, Youjin Kang, Eojin Jeon, SangKeun Lee
Towards Conceptualization of ``Fair Explanation’’: Disparate Impacts of anti-Asian Hate Speech Explanations on Content Moderators
Tin Nguyen, Jiannan Xu, Aayushi Roy, Hal Daumé III, Marine Carpuat
Bridging Background Knowledge Gaps in Translation with Automatic Explicitation
HyoJung Han, Jordan Boyd-Graber, Marine Carpuat
A Quality-based Syntactic Template Retriever for Syntactically-Controlled Paraphrase Generation
Xue Zhang, Songming Zhang, Yunlong Liang, Yufeng Chen, Jian Liu, Wenjuan Han, Jinan Xu
Beyond Shared Vocabulary: Increasing Representational Word Similarities across Languages for Multilingual Machine Translation
Di Wu, Christof Monz
Quantifying the redundancy between prosody and text
Lukas Wolf, Tiago Pimentel, Evelina Fedorenko, Ryan Cotterell, Alex Warstadt, Ethan Wilcox, Tamar Regev
CRoW: Benchmarking Commonsense Reasoning in Real-World Tasks
Mete Ismayilzada, Debjit Paul, Syrielle Montariol, Mor Geva, Antoine Bosselut
A Video Is Worth 4096 Tokens: Verbalize Story Videos To Understand Them In Zero Shot
Aanisha Bhattacharyya, Yaman Singla, Balaji Krishnamurthy, Rajiv Shah, Changyou Chen
Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
Lean Wang, Lei Li, Damai Dai, Deli Chen, Hao Zhou, Fandong Meng, Jie Zhou, Xu Sun
Active Learning for Natural Language Generation
Yotam Perlitz, Ariel Gera, Michal Shmueli-Scheuer, Dafna Sheinwald, Noam Slonim, Liat Ein-Dor
Re$^3$Dial: Retrieve, Reorganize and Rescale Conversations for Long-Turn Open-Domain Dialogue Pre-training
Jiaxin Wen, Hao Zhou, Jian Guan, Jie Zhou, Minlie Huang
Do All Languages Cost the Same? Tokenization in the Era of Commercial Language Models
Orevaoghene Ahia, Sachin Kumar, Hila Gonen, Jungo Kasai, David Mortensen, Noah Smith, Yulia Tsvetkov
Characterizing Mechanisms for Factual Recall in Language Models
Qinan Yu, Jack Merullo, Ellie Pavlick
MULTITuDE: Large-Scale Multilingual Machine-Generated Text Detection Benchmark
Dominik Macko, Robert Moro, Adaku Uchendu, Jason Lucas, Michiharu Yamashita, Matúš Pikuliak, Ivan Srba, Thai Le, Dongwon Lee, Jakub Simko, Maria Bielikova
Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?
Cheng Zhang, Jianyi Cheng, Ilia Shumailov, George Constantinides, Yiren Zhao
Reducing Sequence Length by Predicting Edit Spans with Large Language Models
Masahiro Kaneko, Naoaki Okazaki
Instruct and Extract: Instruction Tuning for On-Demand Information Extraction
Yizhu Jiao, Ming Zhong, Sha Li, Ruining Zhao, Siru Ouyang, Heng Ji, Jiawei Han
Rethinking the Evaluation for Conversational Recommendation in the Era of Large Language Models
Xiaolei Wang, Xinyu Tang, Xin Zhao, Jingyuan Wang, Ji-Rong Wen
ReCEval: Evaluating Reasoning Chains via Correctness and Informativeness
Archiki Prasad, Swarnadeep Saha, Xiang Zhou, Mohit Bansal
Expand, Highlight, Generate: RL-driven Document Generation for Passage Reranking
Arian Askari, Mohammad Aliannejadi, Chuan Meng, Evangelos Kanoulas, Suzan Verberne
Make Every Example Count: On the Stability and Utility of Self-Influence for Learning from Noisy NLP Datasets
Irina Bejan, Artem Sokolov, Katja Filippova
Appraising the Potential Uses and Harms of LLMs for Medical Systematic Reviews
Hye Yun, Iain Marshall, Thomas Trikalinos, Byron Wallace
PromptST: Abstract Prompt Learning for End-to-End Speech Translation
Tengfei Yu, Liang Ding, Xuebo Liu, Kehai Chen, Meishan Zhang, Dacheng Tao, Min Zhang
Text Rendering Strategies for Pixel Language Models
Jonas Lotz, Elizabeth Salesky, Phillip Rust, Desmond Elliott
APoLLo : Unified Adapter and Prompt Learning for Vision Language Models
Sanjoy Chowdhury, Sayan Nag, Dinesh Manocha
SAMRank: Unsupervised Keyphrase Extraction using Self-Attention Map in BERT and GPT-2
Byungha Kang, Youhyun Shin
Contrastive Learning for Inference in Dialogue
Etsuko Ishii, Yan Xu, Bryan Wilie, Ziwei Ji, Holy Lovenia, Willy Chung, Pascale Fung
Editing Large Language Models: Problems, Methods, and Opportunities
Yunzhi Yao, Peng Wang, Bozhong Tian, Siyuan Cheng, Zhoubo Li, Shumin Deng, Huajun Chen, Ningyu Zhang
MarkQA: A large scale KBQA dataset with numerical reasoning
Xiang Huang, Sitao Cheng, Yuheng Bao, Shanshan Huang, Yuzhong Qu
Comparing Biases and the Impact of Multilingual Training across Multiple Languages
Sharon Levy, Neha John, Ling Liu, Yogarshi Vyas, Jie Ma, Yoshinari Fujinuma, Miguel Ballesteros, Vittorio Castelli, Dan Roth
HutCRS: Hierarchical User-Interest Tracking for Conversational Recommender System
Mingjie Qian, Yongsen Zheng, Jinghui Qin, Liang Lin
Large Language Models Meet Open-World Intent Discovery and Recognition: An Evaluation of ChatGPT
Xiaoshuai Song, Keqing He, Pei Wang, Guanting Dong, Yutao Mou, Jingang Wang, Yunsen Xian, Xunliang Cai, Weiran Xu
The Distributional Hypothesis Does Not Fully Explain the Benefits of Masked Language Model Pretraining
Ting-Rui Chiang, Dani Yogatama
Simple and Effective Input Reformulations for Translation
Brian Yu, Hansen Lillemark, Kurt Keutzer
Pointwise Mutual Information Based Metric and Decoding Strategy for Faithful Generation in Document Grounded Dialogs
Yatin Nandwani, Vineet Kumar, Dinesh Raghu, Sachindra Joshi, Luis Lastras
The ACL OCL Corpus: Advancing Open Science in Computational Linguistics
Shaurya Rohatgi, Yanxia Qin, Benjamin Aw, Niranjana Unnithan, Min-Yen Kan
Learning to Rank Context for Named Entity Recognition Using a Synthetic Dataset
Arthur Amalvy, Vincent Labatut, Richard Dufour
Improving Diversity of Demographic Representation in Large Language Models via Collective-Critiques and Self-Voting
Preethi Lahoti, Nicholas Blumm, Xiao Ma, Raghavendra Kotikalapudi, Sahitya Potluri, Qijun Tan, Hansa Srinivasan, Ben Packer, Ahmad Beirami, Alex Beutel, Jilin Chen
Hidding the Ghostwriters: An Adversarial Evaluation of AI-Generated Student Essay Detection
Xinlin Peng, Ying Zhou, Ben He, Le Sun, Yingfei Sun
Contextual Interaction for Argument Post Quality Assessment
Yiran Wang, Xuanang Chen, Ben He, Le Sun
Pre-training Intent-Aware Encoders for Zero- and Few-Shot Intent Classification
Mujeen Sung, James Gung, Elman Mansimov, Nikolaos Pappas, Raphael Shu, Salvatore Romeo, Yi Zhang, Vittorio Castelli
Synthetic Data Generation with Large Language Models for Text Classification: Potential and Limitations
Zhuoyan Li, Hangxiao Zhu, Zhuoran Lu, Ming Yin
GazeVQA: A Video Question Answering Dataset for Multiview Eye-Gaze Task-Oriented Collaborations
Muhammet Ilaslan, Chenan Song, Joya Chen, Difei Gao, Weixian Lei, Qianli Xu, Joo Lim, Mike Shou
People Make Better Edits: Measuring the Efficacy of LLM-Generated Counterfactually Augmented Data for Harmful Language Detection
Indira Sen, Dennis Assenmacher, Mattia Samory, Isabelle Augenstein, Wil Aalst, Claudia Wagner
Unraveling Feature Extraction Mechanisms in Neural Networks
Xiaobing Sun, Jiaxi Li, Wei Lu
CAPSTONE: Curriculum Sampling for Dense Retrieval with Document Expansion
Xingwei He, Yeyun Gong, A-Long Jin, Hang Zhang, Anlei Dong, Jian Jiao, Siu Yiu, Nan Duan
Balance Act: Mitigating Hubness in Cross-Modal Retrieval with Query and Gallery Banks
Yimu Wang, Xiangru Jian, Bo Xue
E-CORE: Emotion Correlation Enhanced Empathetic Dialogue Generation
Fengyi Fu, Lei Zhang, Quan Wang, Zhendong Mao
ALDi: Quantifying the Arabic Level of Dialectness of Text
Amr Keleg, Sharon Goldwater, Walid Magdy
3DRP-Net: 3D Relative Position-aware Network for 3D Visual Grounding
Zehan Wang, Haifeng Huang, Yang Zhao, Linjun Li, Xize Cheng, Yichen Zhu, Aoxiong Yin, Zhou Zhao
Goal-Driven Explainable Clustering via Language Descriptions
Zihan Wang, Jingbo Shang, Ruiqi Zhong
Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models
Jirui Qi, Raquel Fernández, Arianna Bisazza
Learning from Mistakes via Cooperative Study Assistant for Large Language Models
Danqing Wang, Lei Li
Bridging the Digital Divide: Performance Variation across Socio-Economic Factors in Vision-Language Models
Joan Nwatu, Oana Ignat, Rada Mihalcea
Conceptor-Aided Debiasing of Large Language Models
Li Yifei, Lyle Ungar, João Sedoc
AMR Parsing is Far from Solved: GrAPES, the Granular AMR Parsing Evaluation Suite
Jonas Groschwitz, Shay Cohen, Lucia Donatelli, Meaghan Fowlie
Rethinking and Improving Multi-task Learning for End-to-end Speech Translation
Yuhao Zhang, Chen Xu, Bei Li, Hao Chen, Tong Xiao, Chunliang Zhang, Jingbo Zhu
AD-NLP: A Benchmark for Anomaly Detection in Natural Language Processing
Matei Bejan, Andrei Manolache, Marius Popescu
Enhancing the Ranking Context of Dense Retrieval through Reciprocal Nearest Neighbors
George Zerveas, Navid Rekabsaz, Carsten Eickhoff
Cross-Lingual Cross-Target Stance Detection with Dual Knowledge Distillation Framework
Ruike Zhang, Hanxuan Yang, Wenji Mao
PRESTO: A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs
Rahul Goel, Waleed Ammar, Aditya Gupta, Siddharth Vashishtha, Motoki Sano, Faiz Surani, Max Chang, HyunJeong Choe, David Greene, Chuan He, Rattima Nitisaroj, Anna Trukhina, Shachi Paul, Pararth Shah, Rushin Shah, Zhou Yu
An Iteratively Parallel Generation Method with the Pre-Filling Strategy for Document-level Event Extraction
Guanhua Huang, Runxin Xu, Ying Zeng, Jiaze Chen, Zhouwang Yang, Weinan E
CoMPosT: Characterizing and Evaluating Caricature in LLM Simulations
Myra Cheng, Tiziano Piccardi, Diyi Yang
Reduce Human Labor On Evaluating Conversational Information Retrieval System: A Human-Machine Collaboration Approach
Chen Huang, Peixin Qin, Wenqiang Lei, Jiancheng Lv
BERTie Bott’s Every Flavor Labels: A Tasty Introduction to Semantic Role Labeling for Galician
Micaella Bruton, Meriem Beloucif
Program Translation via Code Distillation
Yufan Huang, Mengnan Qi, Yongqiang Yao, Maoquan Wang, Bin Gu, Colin Clement, Neel Sundaresan
FaMeSumm: Investigating and Improving Faithfulness of Medical Summarization
Nan Zhang, Yusen Zhang, Wu Guo, Prasenjit Mitra, Rui Zhang
Grammar-Constrained Decoding for Structured NLP Tasks without Finetuning
Saibo Geng, Martin Josifoski, Maxime Peyrard, Robert West
Systematic word meta-sense extension
Lei Yu
Evaluating Evaluation Metrics: A Framework for Analyzing NLG Evaluation Metrics using Measurement Theory
Ziang Xiao, Susu Zhang, Vivian Lai, Q. Vera Liao
Revisiting the Knowledge Injection Frameworks
Peng Fu, Yiming Zhang, Haobo Wang, Weikang Qiu, Junbo Zhao
We Are What We Repeatedly Do: Inducing and Deploying Habitual Schemas in Persona-Based Responses
Benjamin Kane, Lenhart Schubert
Zero-shot Faithfulness Evaluation for Text Summarization with Foundation Language Model
Qi Jia, Siyu Ren, Yizhu Liu, Kenny Zhu
TaskWeb: Selecting Better Source Tasks for Multi-task NLP
Joongwon Kim, Akari Asai, Gabriel Ilharco, Hannaneh Hajishirzi
Improving Bias Mitigation through Bias Experts in Natural Language Understanding
Eojin Jeon, Mingyu Lee, Juhyeong Park, Yeachan Kim, Wing-Lam Mok, SangKeun Lee
Semi-supervised multimodal coreference resolution in image narrations
Arushi Goel, Basura Fernando, Frank Keller, Hakan Bilen
A Predictive Factor Analysis of Social Biases and Task-Performance in Pretrained Masked Language Models
Yi Zhou, Jose Camacho-Collados, Danushka Bollegala
Argument-based Detection and Classification of Fallacies in Political Debates
Pierpaolo Goffredo, Mariana Espinoza, Serena Villata, Elena Cabrio
SpEL: Structured Prediction for Entity Linking
Hassan Shavarani, Anoop Sarkar
Architectural Sweet Spots for Modeling Human Label Variation by the Example of Argument Quality: It’s Best to Relate Perspectives!
Philipp Heinisch, Matthias Orlikowski, Julia Romberg, Philipp Cimiano
Explicit Planning Helps Language Models in Logical Reasoning
Hongyu Zhao, Kangrui Wang, Mo Yu, Hongyuan Mei
clembench: Using Game Play to Evaluate Chat-Optimized Language Models as Conversational Agents
Kranti Chalamalasetti, Jana Götze, Sherzod Hakimov, Brielen Madureira, Philipp Sadler, David Schlangen
Explaining with Contrastive Phrasal Highlighting: A Case Study in Assisting Humans to Detect Translation Differences
Eleftheria Briakou, Navita Goyal, Marine Carpuat
UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers
Jon Saad-Falcon, Omar Khattab, Keshav Santhanam, Radu Florian, Martin Franz, Salim Roukos, Avirup Sil, Md Sultan, Christopher Potts
TATA: Stance Detection via Topic-Agnostic and Topic-Aware Embeddings
Hans Hanley, Zakir Durumeric
Zero-shot Sharpness-Aware Quantization for Pre-trained Language Models
Miaoxi Zhu, Qihuang Zhong, Li Shen, Liang Ding, Juhua Liu, Bo Du, Dacheng Tao
Deciphering Stereotypes in Pre-Trained Language Models
Weicheng Ma, Henry Scheible, Brian Wang, Goutham Veeramachaneni, Pratim Chowdhary, Alan Sun, Andrew Koulogeorge, Lili Wang, Diyi Yang, Soroush Vosoughi
An “Integrative Survey on Mental Health Conversational Agents to Bridge Computer Science and Medical Perspectives”
Young Cho, Sunny Rai, Lyle Ungar, João Sedoc, Sharath Guntuku
Do LLMs Understand Social Knowledge? Evaluating the Sociability of Large Language Models with SocKET Benchmark
Minje Choi, Jiaxin Pei, Sagar Kumar, Chang Shu, David Jurgens
Interventional Rationalization
Linan Yue, Qi Liu, Li Wang, Yanqing An, Yichao Du, Zhenya Huang
Don’t Take This Out of Context!: On the Need for Contextual Models and Evaluations for Stylistic Rewriting
Akhila Yerukola, Xuhui Zhou, Elizabeth Clark, Maarten Sap
Axiomatic Preference Modeling for Longform Question Answering
Corby Rosset, Guoqing Zheng, Victor Dibia, Ahmed Awadallah, Paul Bennett
Countering Misinformation via Emotional Response Generation
Daniel Russo, Shane Kaszefski-Yaschuk, Jacopo Staiano, Marco Guerini
Seq2seq is All You Need for Coreference Resolution
Wenzheng Zhang, Sam Wiseman, Karl Stratos
StoryAnalogy: Deriving Story-level Analogies from Large Language Models to Unlock Analogical Understanding
Cheng Jiayang, Lin Qiu, Tsz Chan, Tianqing Fang, Weiqi Wang, Chunkit Chan, Dongyu Ru, Qipeng Guo, Hongming Zhang, Yangqiu Song, Yue Zhang, Zheng Zhang
Beyond Detection: A Defend-and-Summarize Strategy for Robust and Interpretable Rumor Analysis on Social Media
Yi-Ting Chang, Yun-Zhu Song, Yi-Syuan Chen, Hong-Han Shuai
Crystal: Introspective Reasoners Reinforced with Self-Feedback
Jiacheng Liu, Ramakanth Pasunuru, Hannaneh Hajishirzi, Yejin Choi, Asli Celikyilmaz
DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct Speech-to-Speech Translation
Yongxin Zhu, Zhujin Gao, Xinyuan Zhou, Ye Zhongyi, Linli Xu
BioFEG: Generate Latent Features for Biomedical Entity Linking
Xuhui Sui, Ying Zhang, Xiangrui Cai, Kehui Song, Baohang Zhou, Xiaojie Yuan, Wensheng Zhang
TRIGO: Benchmarking Formal Mathematical Proof Reduction for Generative Language Models
Jing Xiong, Jianhao Shen, Ye Yuan, Haiming Wang, Yichun Yin, Zhengying Liu, Lin Li, Zhijiang Guo, Qingxing Cao, Yinya Huang, Chuanyang Zheng, Xiaodan Liang, Ming Zhang, Qun Liu
Physician Detection of Clinical Harm in Machine Translation: Quality Estimation Aids in Reliance and Backtranslation Identifies Critical Errors
Nikita Mehandru, Sweta Agrawal, Yimin Xiao, Ge Gao, Elaine Khoong, Marine Carpuat, Niloufar Salehi
Vicarious Offense and Noise Audit of Offensive Speech Classifiers: Unifying Human and Machine Disagreement on What is Offensive
Tharindu Weerasooriya, Sujan Dutta, Tharindu Ranasinghe, Marcos Zampieri, Christopher Homan, Ashiqur KhudaBukhsh
Generating Summaries with Controllable Readability Levels
Leonardo Ribeiro, Mohit Bansal, Markus Dreyer
CESAR: Automatic Induction of Compositional Instructions for Multi-turn Dialogs
Taha Aksu, Devamanyu Hazarika, Shikib Mehri, Seokhwan Kim, Dilek Hakkani-Tur, Yang Liu, Mahdi Namazifar
ACQUIRED: A Dataset for Answering Counterfactual Questions In Real-Life Videos
Te-Lin Wu, Zi-Yi Dou, Qingyuan Hu, Yu Hou, Nischal Chandra, Marjorie Freedman, Ralph Weischedel, Nanyun Peng
From Parse-Execute to Parse-Execute-Refine: Improving Semantic Parser for Complex Question Answering over Knowledge Base
Wangzhen Guo, Linyin Luo, Hanjiang Lai, Jian Yin
CORE: A Few-Shot Company Relation Classification Dataset for Robust Domain Adaptation.
Philipp Borchert, Jochen De Weerdt, Kristof Coussement, Arno De Caigny, Marie-Francine Moens
Models See Hallucinations: Evaluating the Factuality in Video Captioning
Hui Liu, Xiaojun Wan
Back Transcription as a Method for Evaluating Robustness of Natural Language Understanding Models to Speech Recognition Errors
Marek Kubis, Paweł Skórzewski, Marcin Sowańnski, Tomasz Ziętkiewicz
Can Language Models Understand Physical Concepts?
Lei Li, Jingjing Xu, Qingxiu Dong, Ce Zheng, Xu Sun, Lingpeng Kong, Qi Liu
SPT: Learning to Selectively Insert Prompts for Better Prompt Tuning
Wei Zhu, Ming Tan
Once Upon a ${\it Time}$ in ${\it Graph}$: Relative-Time Pretraining for Complex Temporal Reasoning
Sen Yang, Xin Li, Lidong Bing, Wai Lam
Expository Text Generation: Imitate, Retrieve, Paraphrase
Nishant Balepur, Jie Huang, Kevin Chang
Enhancing Textbooks with Visuals from the Web for Improved Learning
Janvijay Singh, Vilém Zouhar, Mrinmaya Sachan
Continual Event Extraction with Semantic Confusion Rectification
Zitao Wang, Xinyi Wang, Wei Hu
An Empirical Study of Translation Hypothesis Ensembling with Large Language Models
António Farinhas, José de Souza, Andre Martins
Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models
Geewook Kim, Hodong Lee, Daehee Kim, Haeji Jung, Sanghee Park, Yoonsik Kim, Sangdoo Yun, Taeho Kil, Bado Lee, Seunghyun Park
Continual Learning for Multilingual Neural Machine Translation via Dual Importance-based Model Division
Junpeng Liu, Kaiyu Huang, Hao Yu, Jiuyi Li, Jinsong Su, Degen Huang
SimCSE++: Improving Contrastive Learning for Sentence Embeddings from Two Perspectives
Jiahao Xu, Wei Shao, Lihui Chen, Lemao Liu
Unlearn What You Want to Forget: Efficient Unlearning for LLMs
Jiaao Chen, Diyi Yang
Precedent-Enhanced Legal Judgment Prediction with LLM and Domain-Model Collaboration
Yiquan Wu, Siying Zhou, Yifei Liu, Weiming Lu, Xiaozhong Liu, Yating Zhang, Changlong Sun, Fei Wu, Kun Kuang
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation
Sewon Min, Kalpesh Krishna, Xinxi Lyu, Mike Lewis, Wen-tau Yih, Pang Koh, Mohit Iyyer, Luke Zettlemoyer, Hannaneh Hajishirzi
When Language Models Fall in Love: Animacy Processing in Transformer Language Models
Michael Hanna, Yonatan Belinkov, Sandro Pezzelle
Improving Unsupervised Relation Extraction by Augmenting Diverse Sentence Pairs
Qing Wang, Kang Zhou, Qiao Qiao, Yuepei Li, Qi Li
Paraphrase Types for Generation and Detection
Jan Wahle, Bela Gipp, Terry Ruas
Target-to-Source Augmentation for Aspect Sentiment Triplet Extraction
Yice Zhang, Yifan Yang, Meng Li, Bin Liang, Shiwei Chen, Ruifeng Xu
PAC-tuning: Fine-tuning Pre-trained Language Models with PAC-driven Perturbed Gradient Descent
Guangliang Liu, Zhiyu Xue, Xitong Zhang, Kristen Johnson, Rongrong Wang
Emergence of Abstract State Representations in Embodied Sequence Modeling
Tian Yun, Zilai Zeng, Kunal Handa, Ashish Thapliyal, Bo Pang, Ellie Pavlick, Chen Sun
Accelerating Toeplitz Neural Network with Constant-time Inference Complexity
Zhen Qin, Yiran Zhong
Dissecting Recall of Factual Associations in Auto-Regressive Language Models
Mor Geva, Jasmijn Bastings, Katja Filippova, Amir Globerson
StereoMap: Quantifying the Awareness of Human-like Stereotypes in Large Language Models
Sullam Jeoung, Yubin Ge, Jana Diesner
Impressions: Visual Semiotics and Aesthetic Impact Understanding
Julia Kruk, Caleb Ziems, Diyi Yang
DNA: Denoised Neighborhood Aggregation for Fine-grained Category Discovery
Wenbin An, Feng Tian, Wenkai Shi, Yan Chen, Qinghua Zheng, QianYing Wang, Ping Chen
Prompt as Triggers for Backdoor Attack: Examining the Vulnerability in Language Models
Shuai Zhao, Jinming Wen, Anh Luu, Junbo Zhao, Jie Fu
UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation
Daixuan Cheng, Shaohan Huang, Junyu Bi, Yuefeng Zhan, Jianfeng Liu, Yujing Wang, Hao Sun, Furu Wei, Weiwei Deng, Qi Zhang
KRLS: Improving End-to-End Response Generation in Task Oriented Dialog with Reinforced Keywords Learning
Xiao Yu, Qingyang Wu, Kun Qian, Zhou Yu
Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLU
Fajri Koto, Nurul Aisyah, Haonan Li, Timothy Baldwin
Let’s Sample Step by Step: Adaptive-Consistency for Efficient Reasoning and Coding with LLMs
Pranjal Aggarwal, Aman Madaan, Yiming Yang, Mausam
Bridging Information-Theoretic and Geometric Compression in Language Models
Emily Cheng, Corentin Kervadec, Marco Baroni
Pre-training Language Models for Comparative Reasoning
Mengxia Yu, Zhihan Zhang, Wenhao Yu, Meng Jiang
Improved Pseudo Data for Machine Translation Quality Estimation with Constrained Beam Search
Xiang Geng, Yu Zhang, Zhejian Lai, Shuaijie She, Wei Zou, Shimin Tao, Hao Yang, Jiajun Chen, Shujian Huang
Text Embeddings Reveal (Almost) As Much As Text
John Morris, Volodymyr Kuleshov, Vitaly Shmatikov, Alexander Rush
AutoTrial: Prompting Language Models for Clinical Trial Design
Zifeng Wang, Cao Xiao, Jimeng Sun
Enhancing Generative Retrieval with Reinforcement Learning from Relevance Feedback
Yujia Zhou, Zhicheng Dou, Ji-Rong Wen
Multi-Source Probing for Open-Domain Conversational Understanding
Yuanxi Li, Hao Zhou, Jie Zhou, Minlie Huang
Hallucination Mitigation in Natural Language Generation from Large-Scale Open-Domain Knowledge Graphs
Xiao Shi, Zhengyuan Zhu, Zeyu Zhang, Chengkai Li
Multi-Source Multi-Type Knowledge Exploration and Exploitation for Dialogue Generation
Xuanfan Ni, Hongliang Dai, Zhaochun Ren, Piji Li
Focus Your Attention (with Adaptive IIR Filters)
Shahar Lutati, Itamar Zimerman, Lior Wolf
Identifying Statements Crucial for Awareness of Interpretive Nonsense to Prevent Communication Breakdowns
Tomoyuki Maekawa, Michita Imai
Multilingual Large Language Models Are Not (Yet) Code-Switchers
Ruochen Zhang, Samuel Cahyawijaya, Jan Christian Blaise Cruz, Genta Winata, Alham Aji
Reinforced Target-driven Conversational Promotion
Huy Dao, Lizi Liao, Dung Le, Yuxiang Nie
Identification of Multimodal Stance Towards Frames of Communication
Maxwell Weinzierl, Sanda Harabagiu
Unsupervised Sounding Pixel Learning
Yining Zhang, Yanli Ji, Yang Yang
LM vs LM: Detecting Factual Errors via Cross Examination
Roi Cohen, May Hamri, Mor Geva, Amir Globerson
Large Language Models: The Need for Nuance in Current Debates and a Pragmatic Perspective on Understanding
Bram van Dijk, Tom Kouwenhoven, Marco Spruit, Max Johannes van Duijn
PIEClass: Weakly-Supervised Text Classification with Prompting and Noise-Robust Iterative Ensemble Training
Yunyi Zhang, Minhao Jiang, Yu Meng, Yu Zhang, Jiawei Han
MeaeQ: Mount Model Extraction Attacks with Efficient Queries
Chengwei Dai, Minxuan Lv, Kun Li, Wei Zhou
The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning
Seungone Kim, Se Joo, Doyoung Kim, Joel Jang, Seonghyeon Ye, Jamin Shin, Minjoon Seo
Explaining Interactions Between Text Spans
Sagnik Choudhury, Pepa Atanasova, Isabelle Augenstein
Predictive Chemistry Augmented with Text Retrieval
Yujie Qian, Zhening Li, Zhengkai Tu, Connor Coley, Regina Barzilay
System Combination via Quality Estimation for Grammatical Error Correction
Muhammad Qorib, Hwee Ng
Rethinking Negative Pairs in Code Search
Haochen Li, Xin Zhou, Anh Luu, Chunyan Miao
Question Answering as Programming for Solving Time-Sensitive Questions
Xinyu Zhu, Cheng Yang, Bei Chen, Siheng Li, Jian-Guang Lou, Yujiu Yang
Joint Geometrical and Statistical Domain Adaptation for Cross-domain Code Vulnerability Detection
Qianjin Du, Shiji Zhou, Xiaohui Kuang, Gang Zhao, Jidong Zhai
Controlling Pre-trained Language Models for Grade-Specific Text Simplification
Sweta Agrawal, Marine Carpuat
CLEVR-Implicit: A Diagnostic Dataset for Implicit Reasoning in Referring Expression Comprehension
Jingwei Zhang, Xin Wu, Yi Cai
``Are Your Explanations Reliable?’’ Investigating the Stability of LIME in Explaining Text Classifiers by Marrying XAI and Adversarial Attack
Christopher Burger, Lingwei Chen, Thai Le
CQE: A Comprehensive Quantity Extractor
Satya Almasian, Vivian Kazakova, Philipp Göldner, Michael Gertz
A Unified View of Evaluation Metrics for Structured Prediction
Yunmo Chen, William Gantt, Tongfei Chen, Aaron White, Benjamin Van Durme
A Deeper (Autoregressive) Approach to Non-Convergent Discourse Parsing
Oren Tsur, Yoav Tulpan
We are Who We Cite: Bridges of Influence Between Natural Language Processing and Other Academic Fields
Jan Wahle, Terry Ruas, Mohamed Abdalla, Bela Gipp, Saif Mohammad
Ties Matter: Meta-Evaluating Modern Metrics with Pairwise Accuracy and Tie Calibration
Daniel Deutsch, George Foster, Markus Freitag
SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization
Hyunwoo Kim, Jack Hessel, Liwei Jiang, Peter West, Ximing Lu, Youngjae Yu, Pei Zhou, Ronan Bras, Malihe Alikhani, Gunhee Kim, Maarten Sap, Yejin Choi
Multi-view Contrastive Learning for Entity Typing over Knowledge Graphs
Zhiwei Hu, Victor Basulto, Zhiliang Xiang, Ru Li, Jeff Pan
MailEx: Email Event and Argument Extraction
Saurabh Srivastava, Gaurav Singh, Shou Matsumoto, Ali Raz, Paulo Costa, Joshua Poore, Ziyu Yao
Optimized Tokenization for Transcribed Error Correction
Tomer Wullach, Shlomo Chazan
Beware of Model Collapse! Fast and Stable Test-time Adaptation for Robust Question Answering
Yi Su, Yixin Ji, Juntao Li, Hai Ye, Min Zhang
Generative Adversarial Training with Perturbed Token Detection for Model Robustness
Jiahao Zhao, Wenji Mao
Multi-Task Knowledge Distillation with Embedding Constraints for Scholarly Keyphrase Boundary Classification
Seo Park, Cornelia Caragea
Large Language Models and Multimodal Retrieval for Visual Word Sense Disambiguation
Anastasia Kritharoula, Maria Lymperaiou, Giorgos Stamou
Be Selfish, But Wisely: Investigating the Impact of Agent Personality in Mixed-Motive Human-Agent Interactions
Kushal Chawla, Ian Wu, Yu Rong, Gale Lucas, Jonathan Gratch
Doolittle: Benchmarks and Corpora for Academic Writing Formalization
Shizhe Diao, Yongyu Lei, Liangming Pan, Tianqing Fang, Wangchunshu Zhou, Sedrick Keh, Min-Yen Kan, Tong Zhang
Reconstruct Before Summarize: An Efficient Two-Step Framework for Condensing and Summarizing Meeting Transcripts
Haochen Tan, Han Wu, Wei Shao, Xinyun Zhang, Mingjie Zhan, Zhaohui Hou, Ding Liang, Linqi Song
XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked Language Models
Davis Liang, Hila Gonen, Yuning Mao, Rui Hou, Naman Goyal, Marjan Ghazvininejad, Luke Zettlemoyer, Madian Khabsa
Character-LLM: A Trainable Agent for Role-Playing
Yunfan Shao, Linyang Li, Junqi Dai, Xipeng Qiu
Natural Language Decompositions of Implicit Content Enable Better Text Representations
Alexander Hoyle, Rupak Sarkar, Pranav Goel, Philip Resnik
A Scalable Framework for Table of Contents Extraction from Complex ESG Annual Reports
Xinyu Wang, Lin Gui, Yulan He
Semantic Space Grounded Weighted Decoding for Multi-Attribute Controllable Dialogue Generation
Zhiling Zhang, Mengyue Wu, Kenny Zhu
How do languages influence each other? Studying cross-lingual data sharing during LM fine-tuning
Rochelle Choenni, Dan Garrette, Ekaterina Shutova
COFFEE: Counterfactual Fairness for Personalized Text Generation in Explainable Recommendation
Nan Wang, Qifan Wang, Yi-Chia Wang, Maziar Sanjabi, Jingzhou Liu, Hamed Firooz, Hongning Wang, Shaoliang Nie
NameGuess: Column Name Expansion for Tabular Data
Jiani Zhang, Zhengyuan Shen, Balasubramaniam Srinivasan, Shen Wang, Huzefa Rangwala, George Karypis
BLESS: Benchmarking Large Language Models on Sentence Simplification
Tannon Kew, Alison Chi, Laura Vásquez-Rodríguez, Sweta Agrawal, Dennis Aumiller, Fernando Alva-Manchego, Matthew Shardlow
To Build Our Future, We Must Know Our Past: Contextualizing Paradigm Shifts in Natural Language Processing
Sireesh Gururaja, Amanda Bertsch, Clara Na, David Widder, Emma Strubell
PALS: Personalized Active Learning for Subjective Tasks in NLP
Kamil Kanclerz, Konrad Karanowski, Julita Bielaniewicz, Marcin Gruza, Piotr Miłkowski, Jan Kocon, Przemyslaw Kazienko
ViStruct: Visual Structural Knowledge Extraction via Curriculum Guided Code-Vision Representation
Yangyi Chen, Xingyao Wang, Manling Li, Derek Hoiem, Heng Ji
LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models
Huiqiang Jiang, Qianhui Wu, Chin-Yew Lin, Yuqing Yang, Lili Qiu
EXPLAIN, EDIT, GENERATE: Rationale-Sensitive Counterfactual Data Augmentation for Multi-hop Fact Verification
Yingjie Zhu, Jiasheng Si, Yibo Zhao, Haiyang Zhu, Deyu Zhou, Yulan He
An Exploration of Left-Corner Transformations
Andreas Opedal, Eleftheria Tsipidi, Tiago Pimentel, Ryan Cotterell, Tim Vieira
Characterizing and Verifying Scientific Claims: Qualitative Causal Structure is All You Need
Jinxuan Wu, Wenhan Chao, Xian Zhou, Zhunchen Luo
FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models
Konstantin Dobler, Gerard de Melo
ByteSized32: A Corpus and Challenge Task for Generating Task-Specific World Models Expressed as Text Games
Ruoyao Wang, Graham Todd, Xingdi Yuan, Ziang Xiao, Marc-Alexandre Côté, Peter Jansen
Skill-Based Few-Shot Selection for In-Context Learning
Shengnan An, Bo Zhou, Zeqi Lin, Qiang Fu, Bei Chen, Nanning Zheng, Weizhu Chen, Jian-Guang Lou
MaNtLE: Model-agnostic Natural Language Explainer
Rakesh Menon, Kerem Zaman, Shashank Srivastava
PTP: Boosting Stability and Performance of Prompt Tuning with Perturbation-Based Regularizer
Lichang Chen, Jiuhai Chen, Heng Huang, Minhao Cheng
Ling-CL: Understanding NLP Models through Linguistic Curricula
Mohamed Elgaar, Hadi Amiri
Towards a Better Understanding of Variations in Zero-Shot Neural Machine Translation Performance
Shaomu Tan, Christof Monz
SEER : A Knapsack approach to Exemplar Selection for In-Context HybridQA
Jonathan Tonglet, Manon Reusens, Philipp Borchert, Bart Baesens
Conversation Chronicles: Towards Diverse Temporal and Relational Dynamics in Multi-Session Conversations
Jihyoung Jang, Minseong Boo, Hyounghun Kim
DueT: Image-Text Contrastive Transfer Learning with Dual-adapter Tuning
Taku Hasegawa, Kyosuke Nishida, Koki Maeda, Kuniko Saito
Towards a Unified Conversational Recommendation System: Multi-task Learning via Contextualized Knowledge Distillation
Yeongseo Jung, Eunseo Jung, Lei Chen
MoPe: Model Perturbation based Privacy Attacks on Language Models
Marvin Li, Jason Wang, Jeffrey Wang, Seth Neel
q2d: Turning Questions into Dialogs to Teach Models How to Search
Yonatan Bitton, Shlomi Cohen-Ganor, Ido Hakimi, Yoad Lewenberg, Roee Aharoni, Enav Weinreb
Aligning Large Language Models through Synthetic Feedback
Sungdong Kim, Sanghwan Bae, Jamin Shin, Soyoung Kang, Donghyun Kwak, Kang Yoo, Minjoon Seo
You Told Me That Joke Twice: A Systematic Investigation of Transferability and Robustness of Humor Detection Models
Alexander Baranov, Vladimir Kniazhevsky, Pavel Braslavski
Reading Order Matters: Information Extraction from Visually-rich Documents by Token Path Prediction
Chong Zhang, Ya Guo, Yi Tu, Huan Chen, Jinyang Tang, Huijia Zhu, Qi Zhang, Tao Gui
Empower Nested Boolean Logic via Self-Supervised Curriculum Learning
Hongqiu Wu, Linfeng Liu, Hai Zhao, Min Zhang
The Sentiment Problem: A Critical Survey towards Deconstructing Sentiment Analysis
Pranav Venkit, Mukund Srinath, Sanjana Gautam, Saranya Venkatraman, Vipul Gupta, Rebecca Passonneau, Shomir Wilson
DADA: Dialect Adaptation via Dynamic Aggregation of Linguistic Rules
Yanchen Liu, William Held, Diyi Yang
Unifying Discrete and Continuous Representations for Unsupervised Paraphrase Generation
Mingfeng Xue, Dayiheng Liu, Wenqiang Lei, Jie Fu, Jian Lan, Mei Li, Baosong Yang, Jun Xie, Yidan Zhang, Dezhong Peng, Jiancheng Lv
The Benefits of Label-Description Training for Zero-Shot Text Classification
Lingyu Gao, Debanjan Ghosh, Kevin Gimpel
Multilingual Pixel Representations for Translation and Effective Cross-lingual Transfer
Elizabeth Salesky, Neha Verma, Philipp Koehn, Matt Post
Finding Authentic Counterhate Arguments: A Case Study with Public Figures
Abdullah Albanyan, Ahmed Hassan, Eduardo Blanco
Can We Edit Multimodal Large Language Models?
Siyuan Cheng, Bozhong Tian, Qingbin Liu, Xi Chen, Yongheng Wang, Huajun Chen, Ningyu Zhang
Exploring Discourse Structure in Document-level Machine Translation
Xinyu Hu, Xiaojun Wan
ClusterLLM: Large Language Models as a Guide for Text Clustering
Yuwei Zhang, Zihan Wang, Jingbo Shang
CodeBERTScore: Evaluating Code Generation with Pretrained Models of Code
Shuyan Zhou, Uri Alon, Sumit Agarwal, Graham Neubig
Learn and Consolidate: Continual Adaptation for Zero-Shot and Multilingual Neural Machine Translation
Kaiyu Huang, Peng Li, Junpeng Liu, Maosong Sun, Yang Liu
e-THERAPIST: I suggest you to cultivate a mindset of positivity and nurture uplifting thoughts
Kshitij Mishra, Priyanshu Priya, Manisha Burja, Asif Ekbal
AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages
Shamsuddeen Muhammad, Idris Abdulmumin, Abinew Ayele, Nedjma Ousidhoum, David Adelani, Seid Yimam, Ibrahim Ahmad, Meriem Beloucif, Saif Mohammad, Sebastian Ruder, Oumaima Hourrane, Alipio Jorge, Pavel Brazdil, Felermino Ali, Davis David, Salomey Osei, Bello Shehu-Bello, Falalu Lawan, Tajuddeen Gwadabe, Samuel Rutunda, Tadesse Belay, Wendimu Messelle, Hailu Balcha, Sisay Chala, Hagos Gebremichael, Bernard Opoku, Stephen Arthur
Quantifying Character Similarity with Vision Transformers
Xinmei Yang, Abhishek Arora, Shao-Yu Jheng, Melissa Dell
Syllogistic Reasoning for Legal Judgment Analysis
Wentao Deng, Jiahuan Pei, Keyi Kong, Zhe Chen, Furu Wei, Yujun Li, Zhaochun Ren, Zhumin Chen, Pengjie Ren
Improving Transformer-based Program Repair Model through False Behavior Diagnosis
Youngkyoung Kim, Misoo Kim, Eunseok Lee
KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detection
Sehyun Choi, Tianqing Fang, Zhaowei Wang, Yangqiu Song
CRUSH4SQL: Collective Retrieval Using Schema Hallucination For Text2SQL
Mayank Kothyari, Dhruva Dhingra, Sunita Sarawagi, Soumen Chakrabarti
Incorporating Structured Representations into Pretrained Vision & Language Models Using Scene Graphs
Roei Herzig, Alon Mendelson, Leonid Karlinsky, Assaf Arbelle, Rogerio Feris, Trevor Darrell, Amir Globerson
TLM: Token-Level Masking for Transformers
Yangjun Wu, Kebin Fang, Dongxiang Zhang, Han Wang, Hao Zhang, Gang Chen
Addressing NER Annotation Noises with Uncertainty-Guided Tree-Structured CRFs
Jian Liu, Weichang Liu, Yufeng Chen, Jinan Xu, Zhe Zhao
Hi Guys or Hi Folks? Benchmarking Gender-Neutral Machine Translation with the GeNTE Corpus
Andrea Piergentili, Beatrice Savoldi, Dennis Fucci, Matteo Negri, Luisa Bentivogli
Multilingual Holistic Bias: Extending Descriptors and Patterns to Unveil Demographic Biases in Languages at Scale
Marta Costa-jussà, Pierre Andrews, Eric Smith, Prangthip Hansanti, Christophe Ropers, Elahe Kalbassi, Cynthia Gao, Daniel Licht, Carleigh Wood
GlobalBench: A Benchmark for Global Progress in Natural Language Processing
Yueqi Song, Simran Khanuja, Pengfei Liu, Fahim Faisal, Alissa Ostapenko, Genta Winata, Alham Aji, Samuel Cahyawijaya, Yulia Tsvetkov, Antonios Anastasopoulos, Graham Neubig
DetGPT: Detect What You Need via Reasoning
Renjie Pi, Jiahui Gao, Shizhe Diao, Rui Pan, Hanze Dong, Jipeng Zhang, Lewei Yao, Jianhua Han, Hang Xu, Lingpeng Kong, Tong Zhang
Language Models with Rationality
Nora Kassner, Oyvind Tafjord, Ashish Sabharwal, Kyle Richardson, Hinrich Schuetze, Peter Clark
Self-Improvement of Non-autoregressive Model via Sequence-Level Distillation
Yusheng Liao, Shuyang Jiang, Yiqi Li, Yu Wang, Yanfeng Wang
Mitigating Temporal Misalignment by Discarding Outdated Facts
Michael Zhang, Eunsol Choi
Open-world Semi-supervised Generalized Relation Discovery Aligned in a Real-world Setting
William Hogan, Jiacheng Li, Jingbo Shang
IEKG: A Commonsense Knowledge Graph for Idiomatic Expressions
Ziheng Zeng, Kellen Cheng, Srihari Nanniyur, Jianing Zhou, Suma Bhat
Bias Neutralization in Non-Parallel Texts: A Cyclic Approach with Auxiliary Guidance
Karthic Madanagopal, James Caverlee
Fighting Fire with Fire: The Dual Role of LLMs in Crafting and Detecting Elusive Disinformation
Jason Lucas, Adaku Uchendu, Michiharu Yamashita, Jooyoung Lee, Shaurya Rohatgi, Dongwon Lee
BRAINTEASER: Lateral Thinking Puzzles for Large Language Models
Yifan Jiang, Filip Ilievski, Kaixin Ma, Zhivar Sourati
When are Lemons Purple? The Concept Association Bias of Vision-Language Models
Yingtian Tang, Yutaro Yamada, Yoyo Zhang, Ilker Yildirim
What Comes Next? Evaluating Uncertainty in Neural Text Generators Against Human Production Variability
Mario Giulianelli, Joris Baan, Wilker Aziz, Raquel Fernández, Barbara Plank
Text Representation Distillation via Information Bottleneck Principle
Yanzhao Zhang, Dingkun Long, Zehan Li, Pengjun Xie
Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation
Zhenwen Liang, Wenhao Yu, Tanmay Rajpurohit, Peter Clark, Xiangliang Zhang, Ashwin Kalyan
FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions
Hyunwoo Kim, Melanie Sclar, Xuhui Zhou, Ronan Bras, Gunhee Kim, Yejin Choi, Maarten Sap
Exploring the Boundaries of GPT-4 in Radiology
Qianchu Liu, Stephanie Hyland, Shruthi Bannur, Kenza Bouzid, Daniel Castro, Maria Wetscherek, Robert Tinn, Harshita Sharma, Fernando Pérez-García, Anton Schwaighofer, Pranav Rajpurkar, Sameer Khanna, Hoifung Poon, Naoto Usuyama, Anja Thieme, Aditya Nori, Matthew Lungren, Ozan Oktay, Javier Alvarez-Valle
A Frustratingly Easy Post-Training Quantization Scheme for LLMs
Yongkweon Jeon, Chungman Lee, Kyungphil Park, Ho-young Kim
A Comprehensive Evaluation of Biomedical Entity Linking Models
David Kartchner, Jennifer Deng, Shubham Lohiya, Tejasri Kopparthi, Prasanth Bathala, Daniel Domingo-Fernández, Cassie Mitchell
Exploring Jiu-Jitsu Argumentation for Writing Peer Review Rebuttals
Sukannya Purkayastha, Anne Lauscher, Iryna Gurevych
LIMIT: Language Identification, Misidentification, and Translation using Hierarchical Models in 350+ Languages
Milind Agarwal, Md Mahfuz Ibn Alam, Antonios Anastasopoulos
FreeAL: Towards Human-Free Active Learning in the Era of Large Language Models
Ruixuan Xiao, Yiwen Dong, Junbo Zhao, Runze Wu, Minmin Lin, Gang Chen, Haobo Wang
API-Assisted Code Generation for Question Answering on Varied Table Structures
Yihan Cao, Shuyi Chen, Ryan Liu, Zhiruo Wang, Daniel Fried
Data Factors for Better Compositional Generalization
Xiang Zhou, Yichen Jiang, Mohit Bansal
ChatEdit: Towards Multi-turn Interactive Facial Image Editing via Dialogue
Xing Cui, Zekun Li, Pei Li, Yibo Hu, Hailin Shi, Chunshui Cao, Zhaofeng He
Bridging Continuous and Discrete Spaces: Interpretable Sentence Representation Learning via Compositional Operations
James Huang, Wenlin Yao, Kaiqiang Song, Hongming Zhang, Muhao Chen, Dong Yu
Hi-ArG: Exploring the Integration of Hierarchical Argumentation Graphs in Language Pretraining
Jingcong Liang, Rong Ye, Meng Han, Qi Zhang, Ruofei Lai, Xinyu Zhang, Zhao Cao, Xuanjing Huang, Zhongyu Wei
Biomedical Named Entity Recognition via Dictionary-based Synonym Generalization
Zihao Fu, Yixuan Su, Zaiqiao Meng, Nigel Collier
GNAT: A General Narrative Alignment Tool
Tanzir Pial, Steven Skiena
UniChart: A Universal Vision-language Pretrained Model for Chart Comprehension and Reasoning
Ahmed Masry, Parsa Kavehzadeh, Do Long, Enamul Hoque, Shafiq Joty
Distance-Based Propagation for Efficient Knowledge Graph Reasoning
Harry Shomer, Yao Ma, Juanhui Li, Bo Wu, Charu Aggarwal, Jiliang Tang
What to Read in a Contract? Party-Specific Summarization of Legal Obligations, Entitlements, and Prohibitions
Abhilasha Sancheti, Aparna Garimella, Balaji Srinivasan, Rachel Rudinger
Enhancing Computation Efficiency in Large Language Models through Weight and Activation Quantization
Janghwan Lee, Minsoo Kim, Seungcheol Baek, Seok Hwang, Wonyong Sung, Jungwook Choi
CP-BCS: Binary Code Summarization Guided by Control Flow Graph and Pseudo Code
Tong Ye, Lingfei Wu, Tengfei Ma, Xuhong Zhang, Yangkai Du, Peiyu Liu, Shouling Ji, Wenhai Wang
Chain-of-Thought Tuning: Masked Language Models can also Think Step By Step in Natural Language Understanding
Caoyun Fan, Jidong Tian, Yitian Li, Wenqing Chen, Hao He, Yaohui Jin
Large Language Models are Complex Table Parsers
Bowen Zhao, Changkai Ji, Yuejie Zhang, Wen He, Yingwen Wang, Qing Wang, Rui Feng, Xiaobo Zhang
R2H: Building Multimodal Navigation Helpers that Respond to Help Requests
Yue Fan, Jing Gu, Kaizhi Zheng, Xin Wang
Speech-enriched Memory for Inference-time Adaptation of ASR Models to Word Dictionaries
Ashish Mittal, Sunita Sarawagi, Preethi Jyothi, George Saon, Gakuto Kurata
Generative Table Pre-training Empowers Models for Tabular Prediction
Tianping Zhang, Shaowen Wang, Shuicheng Yan, Li Jian, Qian Liu
Learning to Describe for Predicting Zero-shot Drug-Drug Interactions
Fangqi Zhu, Yongqi Zhang, Lei Chen, Bing Qin, Ruifeng Xu
Privacy Implications of Retrieval-Based Language Models
Yangsibo Huang, Samyak Gupta, Zexuan Zhong, Kai Li, Danqi Chen
IMTLab: An Open-Source Platform for Building, Evaluating, and Diagnosing Interactive Machine Translation Systems
Xu Huang, Zhirui Zhang, Ruize Gao, Yichao Du, Lemao Liu, Guoping Huang, Shuming Shi, Jiajun Chen, Shujian Huang
Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking Agents
Weiwei Sun, Lingyong Yan, Xinyu Ma, Shuaiqiang Wang, Pengjie Ren, Zhumin Chen, Dawei Yin, Zhaochun Ren
DiNeR: A Large Realistic Dataset for Evaluating Compositional Generalization
Chengang Hu, Xiao Liu, Yansong Feng
Can Pre-trained Vision and Language Models Answer Visual Information-Seeking Questions?
Yang Chen, Hexiang Hu, Yi Luan, Haitian Sun, Soravit Changpinyo, Alan Ritter, Ming-Wei Chang
EDeR: Towards Understanding Dependency Relations Between Events
Ruiqi Li, Patrik Haslum, Leyang Cui
It Ain’t Over: A Multi-aspect Diverse Math Word Problem Dataset
Jiwoo Kim, Youngbin Kim, Ilwoong Baek, JinYeong Bak, Jongwuk Lee
Dr ChatGPT tell me what I want to hear: How different prompts impact health answer correctness
Bevan Koopman, Guido Zuccon
$k$NN-LM Does Not Improve Open-ended Text Generation
Shufan Wang, Yixiao Song, Andrew Drozdov, Aparna Garimella, Varun Manjunatha, Mohit Iyyer
Towards A Unified View of Sparse Feed-Forward Network in Pretraining Large Language Model
Zeyu Liu, Tim Dettmers, Xi Victoria Lin, Veselin Stoyanov, Xian Li
Exploring the Impact of Model Scaling on Parameter-Efficient Tuning
Yusheng Su, Chi-Min Chan, Jiali Cheng, Yujia Qin, Yankai Lin, Shengding Hu, Zonghan Yang, Ning Ding, Xingzhi Sun, Guotong Xie, Zhiyuan Liu, Maosong Sun
STAIR: Learning Sparse Text and Image Representation in Grounded Tokens
Chen Chen, Bowen Zhang, Liangliang Cao, Jiguang Shen, Tom Gunter, Albin Jose, Alexander Toshev, Yantao Zheng, Jonathon Shlens, Ruoming Pang, Yinfei Yang
Crossing the Threshold: Idiomatic Machine Translation through Retrieval Augmentation and Loss Weighting
Emmy Liu, Aditi Chaudhary, Graham Neubig
A linear time approximation of Wasserstein distance with word embedding selection
Sho Otao, Makoto Yamada
Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication
Zhangyue Yin, Qiushi Sun, Cheng Chang, Qipeng Guo, Junqi Dai, Xuanjing Huang, Xipeng Qiu
Conversation Understanding using Relational Temporal Graph Neural Networks with Auxiliary Cross-Modality Interaction
Cam Van Thi Nguyen, Tuan Mai, Son The, Dang Kieu, Duc-Trong Le
Connecting degree and polarity: An artificial language learning study
Lisa Bylinina, Alexey Tikhonov, Ekaterina Garmash
Prompting with Pseudo-Code Instructions
Mayank Mishra, Prince Kumar, Riyaz Bhat, Rudra Murthy, Danish Contractor, Srikanth Tamilselvam
CRAB: Assessing the Strength of Causal Relationships Between Real-world Events
Angelika Romanou, Syrielle Montariol, Debjit Paul, Leo Laugier, Karl Aberer, Antoine Bosselut
NORMSAGE: Multi-Lingual Multi-Cultural Norm Discovery from Conversations On-the-Fly
Yi Fung, Tuhin Chakrabarty, Hao Guo, Owen Rambow, Smaranda Muresan, Heng Ji
A State-Vector Framework for Dataset Effects
Esmat Sahak, Zining Zhu, Frank Rudzicz
Challenges in Context-Aware Neural Machine Translation
Linghao Jin, Jacqueline He, Jonathan May, Xuezhe Ma
Task-Adaptive Tokenization: Enhancing Long-Form Text Generation Efficacy in Mental Health and Beyond
Siyang Liu, Naihao Deng, Sahand Sabour, Yilin Jia, Minlie Huang, Rada Mihalcea
FACTIFY3M: A benchmark for multimodal fact verification with explainability through 5W Question-Answering
Megha Chakraborty, Khushbu Pahwa, Anku Rani, Shreyas Chatterjee, Dwip Dalal, Harshit Dave, Ritvik G, Preethi Gurumurthy, Adarsh Mahor, Samahriti Mukherjee, Aditya Pakala, Ishan Paul, Janvita Reddy, Arghya Sarkar, Kinjal Sensharma, Aman Chadha, Amit Sheth, Amitava Das
Building Multi-domain Dialog State Trackers from Single-domain Dialogs
Qi Zhu, Zheng Zhang, Xiaoyan Zhu, Minlie Huang
Specialist or Generalist? Instruction Tuning for Specific NLP Tasks
Chufan Shi, Yixuan Su, Cheng Yang, Yujiu Yang, Deng Cai
Making Large Language Models Better Data Creators
Dong-Ho Lee, Jay Pujara, Mohit Sewak, Ryen White, Sujay Jauhar
Hallucination Detection for Generative Large Language Models by Bayesian Sequential Estimation
Xiaohua Wang, Yuliang Yan, Longtao Huang, Xiaoqing Zheng, Xuanjing Huang
Guideline Learning for In-Context Information Extraction
Chaoxu Pang, Yixuan Cao, Qiang Ding, Ping Luo
Open Information Extraction via Chunks
Kuicai Dong, Aixin Sun, Jung-jae Kim, Xiaoli Li
Rethinking Word-Level Auto-Completion in Computer-Aided Translation
Xingyu Chen, Lemao Liu, Guoping Huang, Zhirui Zhang, Mingming Yang, Shuming Shi, Rui Wang
Automatic Transcription of Handwritten Old Occitan Language
Esteban Arias, Vallari Pai, Matthias Schöffel, Christian Heumann, Matthias Aßenmacher
CorefPrompt: Prompt-based Event Coreference Resolution by Measuring Event Type and Argument Compatibilities
Sheng Xu, Peifeng Li, Qiaoming Zhu
Anaphor Assisted Document-Level Relation Extraction
Chonggang Lu, Richong Zhang, Kai Sun, Jaein Kim, Cunwang Zhang, Yongyi Mao
All Things Considered: Detecting Partisan Events from News Media with Cross-Article Comparison
Yujian Liu, Xinliang Zhang, Kaijian Zou, Ruihong Huang, Nicholas Beauchamp, Lu Wang
BanglaAbuseMeme: A Dataset for Bengali Abusive Meme Classification
Mithun Das, Animesh Mukherjee
ScanDL: A Diffusion Model for Generating Synthetic Scanpaths on Texts
Lena Bolliger, David Reich, Patrick Haller, Deborah Jakobi, Paul Prasse, Lena Jäger
From Values to Opinions: Predicting Human Behaviors and Stances Using Value-Injected Large Language Models
Dongjun Kang, Joonsuk Park, Yohan Jo, JinYeong Bak
Analyzing Film Adaptation through Narrative Alignment
Tanzir Pial, Shahreen Aunti, Charuta Pethe, Allen Kim, Steven Skiena
Nearest Neighbor Machine Translation is Meta-Optimizer on Output Projection Layer
Ruize Gao, Zhirui Zhang, Yichao Du, Lemao Liu, Rui Wang
Variance Matters: Detecting Semantic Differences without Corpus/Word Alignment
Ryo Nagata, Hiroya Takamura, Naoki Otani, Yoshifumi Kawasaki
MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter
Zhiyuan Liu, Sihang Li, Yanchen Luo, Hao Fei, Yixin Cao, Kenji Kawaguchi, Xiang Wang, Tat-Seng Chua
A Training-Free Debiasing Framework with Counterfactual Reasoning for Conversational Emotion Detection
Geng Tu, Ran Jing, Bin Liang, Min Yang, Kam-Fai Wong, Ruifeng Xu
Self-ICL: Zero-Shot In-Context Learning with Self-Generated Demonstrations
Wei-Lin Chen, Cheng-Kuang Wu, Yun-Nung Chen, Hsin-Hsi Chen
Learning Knowledge-Enhanced Contextual Language Representations for Domain Natural Language Understanding
Taolin Zhang, Ruyao Xu, Chengyu Wang, Zhongjie Duan, Cen Chen, Minghui Qiu, Dawei Cheng, Xiaofeng He, Weining Qian
MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
Zexuan Zhong, Zhengxuan Wu, Christopher Manning, Christopher Potts, Danqi Chen
Stance Detection on Social Media with Background Knowledge
Ang Li, Bin Liang, Jingqian Zhao, Bowen Zhang, Min Yang, Ruifeng Xu
Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning
Hao Wang, Xiahua Chen, Rui Wang, Chenhui Chu
Leap-of-Thought: Accelerating Transformers via Dynamic Token Routing
Yeachan Kim, Junho Kim, Jun-Hyung Park, Mingyu Lee, SangKeun Lee
Reinforcement Replaces Supervision: Query focused Summarization using Deep Reinforcement Learning
Swaroop Nath, Pushpak Bhattacharyya, Harshad Khadilkar
Fair Text Classification with Wasserstein Independence
Thibaud Leteno, Antoine Gourru, Charlotte Laclau, Rémi Emonet, Christophe Gravier
TacoPrompt: A Collaborative Multi-Task Prompt Learning Method for Self-Supervised Taxonomy Completion
Hongyuan Xu, Ciyi Liu, Yuhang Niu, Yunong Chen, Xiangrui Cai, Yanlong Wen, Xiaojie Yuan
Global Voices, Local Biases: Socio-Cultural Prejudices across Languages
Anjishnu Mukherjee, Chahat Raj, Ziwei Zhu, Antonios Anastasopoulos
Graph vs. Sequence: An Empirical Study on Knowledge Forms for Knowledge-Grounded Dialogue
Yizhe Yang, Heyan Huang, Yuhang Liu, Yang Gao
NL2TL: Transforming Natural Languages to Temporal Logics using Large Language Models
Yongchao Chen, Rujul Gandhi, Yang Zhang, Chuchu Fan
Reformulating NLP tasks to Capture Longitudinal Manifestation of Language Disorders in People with Dementia.
Dimitris Gkoumas, Matthew Purver, Maria Liakata
Elevating Code-mixed Text Handling through Auditory Information of Words
Mamta Mamta, Zishan Ahmad, Asif Ekbal
Predict and Use: Harnessing Predicted Gaze to Improve Multimodal Sarcasm Detection
Divyank Tiwari, Diptesh Kanojia, Anupama Ray, Apoorva Nunna, Pushpak Bhattacharyya
ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer
Huadai Liu, Rongjie Huang, Xuan Lin, Wenqiang Xu, Maozong Zheng, Hong Chen, Jinzheng He, Zhou Zhao
Consistency Analysis of ChatGPT
Myeongjun Jang, Thomas Lukasiewicz
Do Differences in Values Influence Disagreements in Online Discussions?
Michiel van der Meer, Piek Vossen, Catholijn Jonker, Pradeep Murukannaiah
A Digital Language Coherence Marker for Monitoring Dementia
Dimitris Gkoumas, Adam Tsakalidis, Maria Liakata
Detecting Spoilers in Movie Reviews with External Movie Knowledge and User Networks
Heng Wang, Wenqian Zhang, Yuyang Bai, Zhaoxuan Tan, Shangbin Feng, Qinghua Zheng, Minnan Luo
Joyful: Joint Modality Fusion and Graph Contrastive Learning for Multimoda Emotion Recognition
Dongyuan Li, Yusong Wang, Kotaro Funakoshi, Manabu Okumura
HyperRank: Hyperbolic Ranking Model for Unsupervised Keyphrase Extraction
Mingyang Song, Huafeng Liu, Liping Jing
Federated Meta-Learning for Emotion and Sentiment Aware Multi-modal Complaint Identification
Apoorva Singh, Siddarth Chandrasekar, Sriparna Saha, Tanmay Sen
Semantic Similarity Models for Depression Severity Estimation
Anxo Pérez, Neha Warikoo, Kexin Wang, Javier Parapar, Iryna Gurevych
Hop, Union, Generate: Explainable Multi-hop Reasoning without Rationale Supervision
Wenting Zhao, Justin Chiu, Claire Cardie, Alexander Rush
ToolWriter: Question Specific Tool Synthesis for Tabular Data
Carlos Gemmell, Jeff Dalton
Interactive Text-to-SQL Generation via Editable Step-by-Step Explanations
Yuan Tian, Zheng Zhang, Zheng Ning, Toby Li, Jonathan Kummerfeld, Tianyi Zhang
CoCo: Coherence-Enhanced Machine-Generated Text Detection Under Low Resource With Contrastive Learning
Xiaoming Liu, Zhaohan Zhang, Yichen Wang, Hang Pu, Yu Lan, Chao Shen
AnyTOD: A Programmable Task-Oriented Dialog System
Jeffrey Zhao, Yuan Cao, Raghav Gupta, Harrison Lee, Abhinav Rastogi, Mingqiu Wang, Hagen Soltau, Izhak Shafran, Yonghui Wu
Can LMs Generalize to Future Data? An Empirical Analysis on Text Summarization
Chi Cheang, Hou Chan, Derek Wong, Xuebo Liu, Zhaocong Li, Yanming Sun, Shudong Liu, Lidia Chao
Zero-Shot Multi-Label Topic Inference with Sentence Encoders and LLMs
Souvika Sarkar, Dongji Feng, Shubhra Kanti Karmaker Santu
Not all Fake News is Written: A Dataset and Analysis of Misleading Video Headlines
Yoo Sung, Jordan Boyd-Graber, Naeemul Hassan
Learning From Free-Text Human Feedback – Collect New Datasets Or Extend Existing Ones?
Dominic Petrak, Nafise Moosavi, Ye Tian, Nikolai Rozanov, Iryna Gurevych
Euphemistic Abuse – A New Dataset and Classification Experiments for Implicitly Abusive Language
Michael Wiegand, Jana Kampfmeier, Elisabeth Eder, Josef Ruppenhofer
Exploring Distributional Shifts in Large Language Models for Code Analysis
Shushan Arakelyan, Rocktim Das, Yi Mao, Xiang Ren
ATHENA: Mathematical Reasoning with Thought Expansion
JB. Kim, Hazel Kim, Joonghyuk Hahn, Yo-Sub Han
TIMELINE: Exhaustive Annotation of Temporal Relations Supporting the Automatic Ordering of Events in News Articles
Sarah Alsayyahi, Riza Batista-Navarro
Mitigating Over-Generation for Unsupervised Keyphrase Extraction with Heterogeneous Centrality Detection
Mingyang Song, Pengyu Xu, Yi Feng, Huafeng Liu, Liping Jing
More Than Spoken Words: Nonverbal Message Extraction and Generation
Dian Yu, Xiaoyang Wang, Wanshun Chen, Nan Du, Longyue Wang, Haitao Mi, Dong Yu
Can language models learn analogical reasoning? Investigating training objectives and comparisons to human performance
Molly Petersen, Lonneke van der Plas
FAME: Flexible, Scalable Analogy Mappings Engine
Shahar Jacob, Chen Shani, Dafna Shahaf
Interpreting and Exploiting Functional Specialization in Multi-Head Attention under Multi-task Learning
Chong Li, Shaonan Wang, Yunhao Zhang, Jiajun Zhang, Chengqing Zong
Multilingual Previously Fact-Checked Claim Retrieval
Matúš Pikuliak, Ivan Srba, Robert Moro, Timo Hromadka, Timotej Smoleň, Martin Melišek, Ivan Vykopal, Jakub Simko, Juraj Podroužek, Maria Bielikova
ALCAP: Alignment-Augmented Music Captioner
Zihao He, Weituo Hao, Wei-Tsung Lu, Changyou Chen, Kristina Lerman, Xuchen Song
Do Transformers Parse while Predicting the Masked Word?
Haoyu Zhao, Abhishek Panigrahi, Rong Ge, Sanjeev Arora
Composable Text Controls in Latent Space with ODEs
Guangyi Liu, Zeyu Feng, Yuan Gao, Zichao Yang, Xiaodan Liang, Junwei Bao, Xiaodong He, Shuguang Cui, Zhen Li, Zhiting Hu
P5: Plug-and-Play Persona Prompting for Personalized Response Selection
Joosung Lee, Minsik Oh, Donghun Lee
Reader: Model-based language-instructed reinforcement learning
Nicola Dainese, Pekka Marttinen, Alexander Ilin
Adapting Offline Speech Translation Models for Streaming with Future-Aware Distillation and Inference
Biao Fu, Minpeng Liao, Kai Fan, Zhongqiang Huang, Boxing Chen, Yidong Chen, Xiaodong Shi
GenEx: A Commonsense-aware Unified Generative Framework for Explainable Cyberbullying Detection
Krishanu Maity, Raghav Jain, Prince Jha, Sriparna Saha, Pushpak Bhattacharyya
Document-Level Machine Translation with Large Language Models
Longyue Wang, Chenyang Lyu, Tianbo Ji, Zhirui Zhang, Dian Yu, Shuming Shi, Zhaopeng Tu
Multilingual Simplification of Medical Texts
Sebastian Joseph, Kathryn Kazanas, Keziah Reina, Vishnesh Ramanathan, Wei Xu, Byron Wallace, Junyi Li
Argue with Me Tersely: Towards Sentence-Level Counter-Argument Generation
Jiayu Lin, Rong Ye, Meng Han, Qi Zhang, Ruofei Lai, Xinyu Zhang, Zhao Cao, Xuanjing Huang, zhongyu wei
JASMINE: Arabic GPT Models for Few-Shot Learning
El Moatez Billah Nagoudi, Muhammad Abdul-Mageed, AbdelRahim Elmadany, Alcides Inciarte, Md Tawkat Islam Khondaker
NLI4CT: Multi-Evidence Natural Language Inference for Clinical Trial Reports
Mael Jullien, Marco Valentino, Hannah Frost, Paul O’Regan, Dónal Landers, Andre Freitas
Addressing Linguistic Bias through a Contrastive Analysis of Academic Writing in the NLP Domain
Robert Ridley, Zhen Wu, Jianbing Zhang, Shujian Huang, Xinyu Dai
RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation
Yue Zhang, Leyang Cui, Enbo Zhao, Wei Bi, Shuming Shi
Detecting Propaganda Techniques in Code-Switched Social Media Text
Muhammad Salman, Asif Hanif, Shady Shehata, Preslav Nakov
Speech Recognition and Meaning Interpretation: Towards Disambiguation of Structurally Ambiguous Spoken Utterances in Indonesian
Ruhiyah Widiaputri, Ayu Purwarianti, Dessi Lestari, Kurniawati Azizah, Dipta Tanaya, Sakriani Sakti
Target-Agnostic Gender-Aware Contrastive Learning for Mitigating Bias in Multilingual Machine Translation
Minwoo Lee, Hyukhun Koh, Kang-il Lee, Dongdong Zhang, Minsung Kim, Kyomin Jung
Code-Switching Metrics Using Intonation Units
Rebecca Pattichis, Dora LaCasse, Sonya Trawick, Rena Cacoullos
Short Papers
Fine-grained Conversational Decoding via Isotropic and Proximal Search
Yuxuan Yao, Han Wu, Qiling Xu, Linqi Song
Primacy Effect of ChatGPT
Yiwei Wang, Yujun Cai, Muhao Chen, Yuxuan Liang, Bryan Hooi
Better Quality Pre-training Data and T5 Models for African Languages
Akintunde Oladipo, Mofetoluwa Adeyemi, Orevaoghene Ahia, Abraham Owodunni, Odunayo Ogundepo, David Adelani, Jimmy Lin
Establishing Trustworthiness: Rethinking Tasks and Model Evaluation
Robert Litschko, Max Müller-Eberstein, Rob van der Goot, Leon Weber-Genzel, Barbara Plank
Bootstrapping Small & High Performance Language Models with Unmasking-Removal Training Policy
Yahan Yang, Elior Sulem, Insup Lee, Dan Roth
Fidelity-Enriched Contrastive Search: Reconciling the Faithfulness-Diversity Trade-Off in Text Generation
Wei-Lin Chen, Cheng-Kuang Wu, Hsin-Hsi Chen, Chung-Chi Chen
Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Models
Gangwoo Kim, Sungdong Kim, Byeongguk Jeon, Joonsuk Park, Jaewoo Kang
Hyperpolyglot LLMs: Cross-Lingual Interpretability in Token Embeddings
Andrea Wen-Yi, David Mimno
Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation
Jian Wang, Yi Cheng, Dongding Lin, Chak Leong, Wenjie Li
Penalty Decoding: Well Suppress the Self-Reinforcement Effect in Open-Ended Text Generation
Wenhong Zhu, Hongkun Hao, Rui Wang
PEFTDebias : Capturing debiasing information using PEFTs
Sumit Agarwal, Aditya Veerubhotla, Srijan Bansal
ACTOR: Active Learning with Annotator-specific Classification Heads to Embrace Human Label Variation
Xinpeng Wang, Barbara Plank
VivesDebate-Speech: A Corpus of Spoken Argumentation to Leverage Audio Features for Argument Mining
Ramon Ruiz-Dolz, Javier Sanchez
Larger Probes Tell a Different Story: Extending Psycholinguistic Datasets Via In-Context Learning
Namrata Shivagunde, Vladislav Lialin, Anna Rumshisky
Did You Mean…? Confidence-based Trade-offs in Semantic Parsing
Elias Stengel-Eskin, Benjamin Van Durme
Understanding the Effect of Model Compression on Social Bias in Large Language Models
Gustavo Gonçalves, Emma Strubell
Once is Enough: A Light-Weight Cross-Attention for Fast Sentence Pair Modeling
Yuanhang Yang, Shiyi Qi, Chuanyi Liu, Qifan Wang, Cuiyun Gao, Zenglin Xu
Critic-Driven Decoding for Mitigating Hallucinations in Data-to-text Generation
Mateusz Lango, Ondrej Dusek
Investigating Bias in Multilingual Language Models: Cross-Lingual Transfer of Debiasing Techniques
Manon Reusens, Philipp Borchert, Margot Mieskes, Jochen De Weerdt, Bart Baesens
Oolong: Investigating What Makes Transfer Learning Hard with Controlled Studies
Zhengxuan Wu, Alex Tamkin, Isabel Papadimitriou
GROOViST: A Metric for Grounding Objects in Visual Storytelling
Aditya Surikuchi, Sandro Pezzelle, Raquel Fernández
When Do Decompositions Help for Machine Reading?
Kangda Wei, Dawn Lawrie, Benjamin Van Durme, Yunmo Chen, Orion Weller
Revisiting De-Identification of Electronic Medical Records: Evaluation of Within- and Cross-Hospital Generalization
Yiyang Liu, Jinpeng Li, Enwei Zhu
Language Representation Projection: Can We Transfer Factual Knowledge across Languages in Multilingual Language Models?
Shaoyang Xu, Junzhuo Li, Deyi Xiong
Are All Steps Equally Important? Benchmarking Essentiality Detection in Event Processes
Haoyu Wang, Hongming Zhang, Yueguan Wang, Yuqian Deng, Muhao Chen, Dan Roth
ULF: Unsupervised Labeling Function Correction using Cross-Validation for Weak Supervision
Anastasiia Sedova, Benjamin Roth
Uncertainty Guided Global Memory Improves Multi-Hop Question Answering
Alsu Sagirova, Mikhail Burtsev
Knowledge Distillation {$\approx$} Label Smoothing: Fact or Fallacy?
Md Sultan
Analyzing Cognitive Plausibility of Subword Tokenization
Lisa Beinborn, Yuval Pinter
POE: Process of Elimination for Multiple Choice Reasoning
Chenkai Ma, Xinya Du
Noisy Exemplars Make Large Language Models More Robust: A Domain-Agnostic Behavioral Analysis
Hongyi Zheng, Abulhair Saparov
Best of Both Worlds: Towards Improving Temporal Knowledge Base Question Answering via Targeted Fact Extraction
Nithish Kannen, Udit Sharma, Sumit Neelam, Dinesh Khandelwal, Shajith Ikbal, Hima Karanam, L Subramaniam
Cognitive Dissonance: Why Do Language Model Outputs Disagree with Internal Representations of Truthfulness?
Kevin Liu, Stephen Casper, Dylan Hadfield-Menell, Jacob Andreas
GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints
Joshua Ainslie, James Lee-Thorp, Michiel de Jong, Yury Zemlyanskiy, Federico Lebron, Sumit Sanghai
BiasX: ``Thinking Slow’’ in Toxic Content Moderation with Explanations of Implied Social Biases
Yiming Zhang, Sravani Nanduri, Liwei Jiang, Tongshuang Wu, Maarten Sap
Stop Uploading Test Data in Plain Text: Practical Strategies for Mitigating Data Contamination by Evaluation Benchmarks
Alon Jacovi, Avi Caciularu, Omer Goldman, Yoav Goldberg
MILDSum: A Novel Benchmark Dataset for Multilingual Summarization of Indian Legal Case Judgments
Debtanu Datta, Shubham Soni, Rajdeep Mukherjee, Saptarshi Ghosh
Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback
Katherine Tian, Eric Mitchell, Allan Zhou, Archit Sharma, Rafael Rafailov, Huaxiu Yao, Chelsea Finn, Christopher Manning
Semi-automatic Data Enhancement for Document-Level Relation Extraction with Distant Supervision from Large Language Models
Junpeng Li, Zixia Jia, Zilong Zheng
EntSUMv2: Dataset, Models and Evaluation for More Abstractive Entity-Centric Summarization
Dhruv Mehra, Lingjue Xie, Ella Hofmann-Coyle, Mayank Kulkarni, Daniel Preotiuc-Pietro
Analysing State-Backed Propaganda Websites: a New Dataset and Linguistic Study
Freddy Heppell, Kalina Bontcheva, Carolina Scarton
HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts
Truong Do, Le Khiem, Quang Pham, TrungTin Nguyen, Thanh-Nam Doan, Binh Nguyen, Chenghao Liu, Savitha Ramasamy, Xiaoli Li, Steven Hoi
ZEROTOP: Zero-Shot Task-Oriented Semantic Parsing using Large Language Models
Dheeraj Mekala, Jason Wolfe, Subhro Roy
Ditto: A Simple and Efficient Approach to Improve Sentence Embeddings
Qian Chen, Wen Wang, Qinglin Zhang, Siqi Zheng, Chong Deng, Hai Yu, Jiaqing Liu, Yukun Ma, Chong Zhang
Spoiler Detection as Semantic Text Matching
Ryan Tran, Canwen Xu, Julian McAuley
Dynamic Top-k Estimation Consolidates Disagreement between Feature Attribution Methods
Jonathan Kamp, Lisa Beinborn, Antske Fokkens
BasahaCorpus: An Expanded Linguistic Resource for Readability Assessment in Central Philippine Languages
Joseph Imperial, Ekaterina Kochmar
4 and 7-bit Labeling for Projective and Non-Projective Dependency Trees
Carlos Gómez-Rodríguez, Diego Roca, David Vilares
Pre-Trained Language Models Augmented with Synthetic Scanpaths for Natural Language Understanding
Shuwen Deng, Paul Prasse, David Reich, Tobias Scheffer, Lena Jäger
Understanding the Inner-workings of Language Models Through Representation Dissimilarity
Davis Brown, Charles Godfrey, Nicholas Konz, Jonathan Tu, Henry Kvinge
Efficient Classification of Long Documents via State-Space Models
Peng Lu, Suyuchen Wang, Mehdi Rezagholizadeh, Bang Liu, Ivan Kobyzev
Construction Artifacts in Metaphor Identification Datasets
Joanne Boisson, Luis Espinosa-Anke, Jose Camacho-Collados
EtiCor: Corpus for Analyzing LLMs for Etiquettes
Ashutosh Dwivedi, Pradhyumna Lavania, Ashutosh Modi
Prompt-Based Monte-Carlo Tree Search for Goal-oriented Dialogue Policy Planning
Xiao Yu, Maximillian Chen, Zhou Yu
UniMath: A Foundational and Multimodal Mathematical Reasoner
Zhenwen Liang, Tianyu Yang, Jipeng Zhang, Xiangliang Zhang
Simple Temporal Adaptation to Changing Label Sets: Hashtag Prediction via Dense KNN
Niloofar Mireshghallah, Nikolai Vogler, Junxian He, Omar Florez, Ahmed El-Kishky, Taylor Berg-Kirkpatrick
A Study on Accessing Linguistic Information in Pre-Trained Language Models by Using Prompts
Marion Di Marco, Katharina Hämmerl, Alexander Fraser
Copyright Violations and Large Language Models
Antonia Karamolegkou, Jiaang Li, Li Zhou, Anders Søgaard
Somali Information Retrieval Corpus: Bridging the Gap between Query Translation and Dedicated Language Resources
Abdisalam Badel, Ting Zhong, Wenxin Tai, Fan Zhou
Beat LLMs at Their Own Game: Zero-Shot LLM-Generated Text Detection via Querying ChatGPT
Biru Zhu, Lifan Yuan, Ganqu Cui, Yangyi Chen, Chong Fu, Bingxiang He, Yangdong Deng, Zhiyuan Liu, Maosong Sun, Ming Gu
Faithful Model Evaluation for Model-Based Metrics
Qian Hu, Palash Goyal, Rahul Gupta
Language Model Quality Correlates with Psychometric Predictive Power in Multiple Languages
Ethan Wilcox, Clara Meister, Ryan Cotterell, Tiago Pimentel
Enhancing Code-Switching for Cross-lingual SLU: A Unified View of Semantic and Grammatical Coherence
Zhihong Zhu, Xuxin Cheng, Zhiqi Huang, Dongsheng Chen, Yuexian Zou
M$^3$Seg: A Maximum-Minimum Mutual Information Paradigm for Unsupervised Topic Segmentation in ASR Transcripts
Ke Wang, Xiutian Zhao, Yanghui Li, Wei Peng
GD-COMET: A Geo-Diverse Commonsense Inference Model
Mehar Bhatia, Vered Shwartz
PreWoMe: Exploiting Presuppositions as Working Memory for Long Form Question Answering
Wookje Han, Jinsol Park, Kyungjae Lee
SOUL: Towards Sentiment and Opinion Understanding of Language
Yue Deng, Wenxuan Zhang, Sinno Pan, Lidong Bing
Evaluation Metrics in the Era of GPT-4: Reliably Evaluating Large Language Models on Sequence to Sequence Tasks
Andrea Sottana, Bin Liang, Kai Zou, Zheng Yuan
Unnatural Error Correction: GPT-4 Can Almost Perfectly Handle Unnatural Scrambled Text
Qi Cao, Takeshi Kojima, Yutaka Matsuo, Yusuke Iwasawa
Exploring Linguistic Probes for Morphological Inflection
Jordan Kodner, Salam Khalifa, Sarah Ruth Brogden Payne
FLatS: Principled Out-of-Distribution Detection with Feature-Based Likelihood Ratio Score
Haowei Lin, Yuntian Gu
Revisiting Instruction Fine-tuned Model Evaluation to Guide Industrial Applications
Manuel Faysse, Gautier Viaud, Céline Hudelot, Pierre Colombo
CLAD-ST: Contrastive Learning with Adversarial Data for Robust Speech Translation
Sathish Indurthi, Shamil Chollampatt, Ravi Agrawal, Marco Turchi
Improved Unsupervised Chinese Word Segmentation Using Pre-trained Knowledge and Pseudo-labeling Transfer
Hsiu-Wen Li, Ying-Jia Lin, Yi-Ting Li, Chun Lin, Hung-Yu Kao
Multilingual $k$-Nearest-Neighbor Machine Translation
David Stap, Christof Monz
Understanding Computational Models of Semantic Change: New Insights from the Speech Community
Filip Miletić, Anne Przewozny-Desriaux, Ludovic Tanguy
Revisiting Automated Topic Model Evaluation with Large Language Models
Dominik Stammbach, Vilém Zouhar, Alexander Hoyle, Mrinmaya Sachan, Elliott Ash
Query2doc: Query Expansion with Large Language Models
Liang Wang, Nan Yang, Furu Wei
Practical Computational Power of Linear Transformers and Their Recurrent and Self-Referential Extensions
Kazuki Irie, Róbert Csordás, Jürgen Schmidhuber
InterFair: Debiasing with Natural Language Feedback for Fair Interpretable Predictions
Bodhisattwa Majumder, Zexue He, Julian McAuley
Large Language Models are biased to overestimate profoundness
Eugenio Herrera-Berg, Tomás Browne, Pablo León-Villagrá, Marc-Lluís Vives, Cristian Calderon
Prompting Scientific Names for Zero-Shot Species Recognition
Shubham Parashar, Zhiqiu Lin, Yanan Li, Shu Kong
MultiTurnCleanup: A Benchmark for Multi-Turn Spoken Conversational Transcript Cleanup
Hua Shen, Vicky Zayats, Johann Rocholl, Daniel Walker, Dirk Padfield
Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
Srijith Radhakrishnan, Chao-Han Yang, Sumeer Khan, Rohit Kumar, Narsis Kiani, David Gomez-Cabrero, Jesper Tegnér
Transformer-based Live Update Generation for Soccer Matches from Microblog Posts
Masashi Oshika, Kosuke Yamada, Ryohei Sasano, Koichi Takeda
Using Artificial French Data to Understand the Emergence of Gender Bias in Transformer Language Models
Lina Conti, Guillaume Wisniewski
What do Deck Chairs and Sun Hats Have in Common? Uncovering Shared Properties in Large Concept Vocabularies
Amit Gajbhiye, Zied Bouraoui, Na Li, Usashi Chatterjee, Luis Espinosa-Anke, Steven Schockaert
Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation
Wanrong Zhu, Xinyi Wang, Yujie Lu, Tsu-Jui Fu, Xin Wang, Miguel Eckstein, William Wang
Polyglot or Not? Measuring Multilingual Encyclopedic Knowledge in Foundation Models
Tim Schott, Daniel Furman, Shreshta Bhat
Anchoring Fine-tuning of Sentence Transformer with Semantic Label Information for Efficient Truly Few-shot Classification
Amalie Pauli, Leon Derczynski, Ira Assent
Data Similarity is Not Enough to Explain Language Model Performance
Gregory Yauney, Emily Reif, David Mimno
Integrating Language Models into Direct Speech Translation: An Inference-Time Solution to Control Gender Inflection
Dennis Fucci, Marco Gaido, Sara Papi, Mauro Cettolo, Matteo Negri, Luisa Bentivogli
mAggretriever: A Simple yet Effective Approach to Zero-Shot Multilingual Dense Retrieval
Sheng-Chieh Lin, Amin Ahmad, Jimmy Lin
CodeFusion: A Pre-trained Diffusion Model for Code Generation
Mukul Singh, José Cambronero, Sumit Gulwani, Vu Le, Carina Negreanu, Gust Verbruggen
VECHR: A Dataset for Explainable and Robust Classification of Vulnerability Type in the European Court of Human Rights
Shanshan Xu, Leon Staufer, Santosh T.Y.S.S, Oana Ichim, Corina Heri, Matthias Grabmair
Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
Haikang Deng, Colin Raffel
Cabbage Sweeter than Cake? Analysing the Potential of Large Language Models for Learning Conceptual Spaces
Usashi Chatterjee, Amit Gajbhiye, Steven Schockaert
Large-scale similarity search with Optimal Transport
Cléa Laouar, Yuki Takezawa, Makoto Yamada
FedTherapist: Mental Health Monitoring with User-Generated Linguistic Expressions on Smartphones via Federated Learning
Jaemin Shin, Hyungjun Yoon, Seungjoo Lee, Sungjoon Park, Yunxin Liu, Jinho Choi, Sung-Ju Lee
Simplicity Level Estimate (SLE): A Learned Reference-Less Metric for Sentence Simplification
Liam Cripwell, Joël Legrand, Claire Gardent
Calc-X and Calcformers: Empowering Arithmetical Chain-of-Thought through Interaction with Symbolic Systems
Marek Kadlčík, Michal Štefánik, Ondrej Sotolar, Vlastimil Martinek
CoF-CoT: Enhancing Large Language Models with Coarse-to-Fine Chain-of-Thought Prompting for Multi-domain NLU Tasks
Hoang Nguyen, Ye Liu, Chenwei Zhang, Tao Zhang, Philip Yu
Select, Prompt, Filter: Distilling Large Language Models for Summarizing Conversations
Minh-Quang Pham, Sathish Indurthi, Shamil Chollampatt, Marco Turchi
Human Raters Cannot Distinguish English Translations from Original English Texts
Shira Wein
Faster Minimum Bayes Risk Decoding with Confidence-based Pruning
Julius Cheng, Andreas Vlachos
Revisiting Sparse Retrieval for Few-shot Entity Linking
Yulin Chen, Zhenran Xu, Baotian Hu, Min Zhang
Context Compression for Auto-regressive Transformers with Sentinel Tokens
Siyu Ren, Qi Jia, Kenny Zhu
Set Learning for Generative Information Extraction
Jiangnan Li, Yice Zhang, Bin Liang, Kam-Fai Wong, Ruifeng Xu
Token Prediction as Implicit Classification to Identify LLM-Generated Text
Yutian Chen, Hao Kang, Vivian Zhai, Liangze Li, Rita Singh, Bhiksha Raj
On Evaluation of Bangla Word Analogies
Mousumi Akter, Souvika Sarkar, Shubhra Kanti Karmaker Santu
Towards Unsupervised Recognition of Token-level Semantic Differences in Related Documents
Jannis Vamvas, Rico Sennrich
CLAIR: Evaluating Image Captions with Large Language Models
David Chan, Suzanne Petryk, Joseph Gonzalez, Trevor Darrell, John Canny
Poisoning Retrieval Corpora by Injecting Adversarial Passages
Zexuan Zhong, Ziqing Huang, Alexander Wettig, Danqi Chen
Clustering Pseudo Language Family in Multilingual Translation Models with Fisher Information Matrix
Xinyu Ma, Xuebo Liu, Min Zhang
SUT: Active Defects Probing for Transcompiler Models
Mengnan Qi, Yufan Huang, Maoquan Wang, Yongqiang Yao, Zihan Liu, Bin Gu, Colin Clement, Neel Sundaresan
This Reads Like That: Deep Learning for Interpretable Natural Language Processing
Claudio Fanconi, Moritz Vandenhirtz, Severin Husmann, Julia Vogt
SMoP: Towards Efficient and Effective Prompt Tuning with Sparse Mixture-of-Prompts
Joon-Young Choi, Junho Kim, Jun-Hyung Park, Wing-Lam Mok, SangKeun Lee
Outlier Dimensions Encode Task Specific Knowledge
William Rudman, Catherine Chen, Carsten Eickhoff
Self-Ensemble of $N$-best Generation Hypotheses by Lexically Constrained Decoding
Ryota Miyano, Tomoyuki Kajiwara, Yuki Arase
Merging Experts into One: Improving Computational Efficiency of Mixture of Experts
Shwai He, Run-Ze Fan, Liang Ding, Li Shen, Tianyi Zhou, Dacheng Tao
Assessing Step-by-Step Reasoning against Lexical Negation: A Case Study on Syllogism
Mengyu Ye, Tatsuki Kuribayashi, Jun Suzuki, Goro Kobayashi, Hiroaki Funayama
A Simple Baseline for Knowledge-Based Visual Question Answering
Alexandros Xenos, Themos Stafylakis, Ioannis Patras, Georgios Tzimiropoulos
Unveiling the Essence of Poetry: Introducing a Comprehensive Dataset and Benchmark for Poem Summarization
Ridwan Mahbub, Ifrad Khan, Samiha Anuva, Md Shahriar, Md Tahmid Rahman Laskar, Sabbir Ahmed
CoRec: An Easy Approach for Coordination Recognition
Qing Wang, Haojie Jia, Wenfei Song, Qi Li
FinEntity: Entity-level Sentiment Classification for Financial Texts
Yixuan Tang, Yi Yang, Allen Huang, Andy Tam, Justin Tang
Rationale-Enhanced Language Models are Better Continual Relation Learners
Weimin Xiong, Yifan Song, Peiyi Wang, Sujian Li
Inverse Scaling Can Become U-Shaped
Jason Wei, Najoung Kim, Yi Tay, Quoc Le
ScdNER: Span-Based Consistency-Aware Document-Level Named Entity Recognition
Ying Wei, Qi Li
NormDial: A Comparable Bilingual Synthetic Dialog Dataset for Modeling Social Norm Adherence and Violation
Oliver Li, Mallika Subramanian, Arkadiy Saakyan, Sky CH-Wang, Smaranda Muresan
ClimateBERT-NetZero: Detecting and Assessing Net Zero and Reduction Targets
Tobias Schimanski, Julia Bingler, Mathias Kraus, Camilla Hyslop, Markus Leippold
An Attribution Method for Siamese Encoders
Lucas Moeller, Dmitry Nikolaev, Sebastian Padó
Are Compressed Language Models Less Subgroup Robust?
Leonidas Gee, Andrea Zugarini, Novi Quadrianto
Length Does Matter: Summary Length can Bias Summarization Metrics
Xiaobo Guo, Soroush Vosoughi
Fine-grained Medical Vision-Language Representation Learning for Radiology Report Generation
Siyuan Wang, Bo Peng, Yichao Liu, Qi Peng
Automated Fact-Checking in Dialogue: Are Specialized Models Needed?
Eric Chamoun, Marzieh Saeidi, Andreas Vlachos
Assessing the influence of attractor-verb distance on grammatical agreement in humans and language models
Christos Zacharopoulos, Théo Desbordes, Mathias Sablé-Meyer
To Split or Not to Split: Composing Compounds in Contextual Vector Spaces
Christopher Jenkins, Filip Miletic, Sabine Schulte im Walde
TaskDiff: A Similarity Metric for Task-Oriented Conversations
Ankita Bhaumik, Praveen Venkateswaran, Yara Rizk, Vatche Isahagian
A Benchmark for Reasoning with Spatial Prepositions
Iulia Comsa, Srini Narayanan
Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation
Yixin Liu, Alexander Fabbri, Yilun Zhao, Pengfei Liu, Shafiq Joty, Chien-Sheng Wu, Caiming Xiong, Dragomir Radev
MAUD: An Expert-Annotated Legal NLP Dataset for Merger Agreement Understanding
Steven Wang, Antoine Scardigli, Leonard Tang, Wei Chen, Dmitry Levkin, Anya Chen, Spencer Ball, Thomas Woodside, Oliver Zhang, Dan Hendrycks
PK-ICR: Persona-Knowledge Interactive Multi-Context Retrieval for Grounded Dialogue
Minsik Oh, Joosung Lee, Jiwei Li, Guoyin Wang
A Self-training Framework for Automated Medical Report Generation
Siyuan Wang, Zheng Liu, Bo Peng
A Picture is Worth a Thousand Words: Language Models Plan from Pixels
Anthony Liu, Lajanugen Logeswaran, Sungryull Sohn, Honglak Lee
Relation-aware Ensemble Learning for Knowledge Graph Embedding
Ling Yue, Yongqi Zhang, Quanming Yao, Yong Li, Xian Wu, Ziheng Zhang, Zhenxi Lin, Yefeng Zheng
When Reviewers Lock Horns: Finding Disagreements in Scientific Peer Reviews
Sandeep Kumar, Tirthankar Ghosal, Asif Ekbal