Findings

Long Papers

Multi Document Summarization Evaluation in the Presence of Damaging Content
Avshalom Manevich, David Carmel, Nachshon Cohen, Elad Kravi, Ori Shapira

Guiding AMR Parsing with Reverse Graph Linearization
Bofei Gao, Liang Chen, Peiyi Wang, Zhifang Sui, Baobao Chang

Translate the Beauty in Songs: Jointly Learning to Align Melody and Translate Lyrics
Chengxi Li, Kai Fan, Jiajun Bu, Boxing Chen, Zhongqiang Huang, Zhi Yu

Aksharantar: Open Indic-language Transliteration datasets and models for the Next Billion Users
Yash Madhani, Sushane Parthan, Priyanka Bedekar, Gokul NC, Ruchi Khapra, Anoop Kunchukuttan, Pratyush Kumar, Mitesh Khapra

Pretraining Without Attention
Junxiong Wang, Jing Yan, Albert Gu, Alexander Rush

EffEval: A Comprehensive Evaluation of Efficiency for MT Evaluation Metrics
Daniil Larionov, Jens Grünwald, Christoph Leiter, Steffen Eger

Unsupervised Opinion Summarization Using Approximate Geodesics
Somnath Basu Roy Chowdhury, Nicholas Monath, Kumar Dubey, Amr Ahmed, Snigdha Chaturvedi

Investigating the Frequency Distortion of Word Embeddings and Its Impact on Bias Metrics
Francisco Valentini, Juan Sosa, Diego Slezak, Edgar Altszyler

Improving Classifier Robustness through Active Generative Counterfactual Data Augmentation
Ananth Balashankar, Xuezhi Wang, Yao Qin, Ben Packer, Nithum Thain, Ed Chi, Jilin Chen, Alex Beutel

Data Augmentation Techniques for Machine Translation of Code-Switched Texts: A Comparative Study
Injy Hamed, Nizar Habash, Thang Vu

Self-distilled Transitive Instance Weighting for Denoised Distantly Supervised Relation Extraction
Xiangyu Lin, Weijia Jia, Zhiguo Gong

MWE as WSD: Solving Multiword Expression Identification with Word Sense Disambiguation
Joshua Tanner, Jacob Hoffman

Dual Contrastive Learning Framework for Incremental Text Classification
Yigong Wang, Zhuoyi Wang, Yu Lin, Jinghui Guo, Sadaf Halim, Latifur Khan

Reference Free Domain Adaptation for Translation of Noisy Questions with Question Specific Rewards
Baban Gain, Ramakrishna Appicharla, Soumya Chennabasavaraj, Nikesh Garera, Asif Ekbal, Muthusamy Chelliah

Filtered Semi-Markov CRF
Urchade Zaratiana, Nadi Tomeh, Niama El Khbir, Pierre Holat, Thierry Charnois

Data Pruning for Efficient Model Pruning in Neural Machine Translation
Abdul Azeemi, Ihsan Qazi, Agha Raza

Long-Form Speech Translation through Segmentation with Finite-State Decoding Constraints on Large Language Models
Arya McCarthy, Hao Zhang, Shankar Kumar, Felix Stahlberg, Ke Wu

Re-Temp: Relation-Aware Temporal Representation Learning for Temporal Knowledge Graph Completion
Kunze Wang, Caren Han, Josiah Poon

Lexical Entrainment for Conversational Systems
Zhengxiang Shi, Procheta Sen, Aldo Lipani

Code Search Debiasing: Improve Search Results beyond Overall Ranking Performance
Sheng Zhang, Hui Li, Yanlin Wang, Zhao Wei, Yong Xu, Juhong Wang, Rongrong Ji

AutoReply: Detecting Nonsense in Dialogue with Discriminative Replies
Weiyan Shi, Emily Dinan, Adi Renduchintala, Daniel Fried, Athul Jacob, Zhou Yu, Mike Lewis

Follow-on Question Suggestion via Voice Hints for Voice Assistants
Besnik Fetahu, Pedro Faustini, Anjie Fang, Giuseppe Castellucci, Oleg Rokhlenko, Shervin Malmasi

Bidirectional Masked Self-attention and N-gram Span Attention for Constituency Parsing
Soohyeong Kim, Whanhee Cho, Minji Kim, Yong Choi

CR-COPEC: Causal Rationale of Corporate Performance Changes to learn from Financial Reports
Ye Chun, Sunjae Kwon, Kyunghwan Sohn, Nakwon Sung, Junyoup Lee, Byoung Seo, Kevin Compher, Seung-won Hwang, Jaesik Choi

Plausibility Processing in Transformer Language Models: Focusing on the Role of Attention Heads in GPT
Soo Ryu

Automatic Unit Test Data Generation and Actor-Critic Reinforcement Learning for Code Synthesis
Philip Gorinski, Matthieu Zimmer, Gerasimos Lampouras, Derrick Goh Xin Deik, Ignacio Iacobacci

Unlocking the Heterogeneous Landscape of Big Data NLP with DUUI
Alexander Leonhardt, Giuseppe Abrami, Daniel Baumartz, Alexander Mehler

Towards Agile Text Classifiers for Everyone
Maximilian Mozes, Jessica Hoffmann, Katrin Tomanek, Muhamed Kouate, Nithum Thain, Ann Yuan, Tolga Bolukbasi, Lucas Dixon

Beyond Good Intentions: Reporting the Research Landscape of NLP for Social Good
Fernando Adauto, Zhijing Jin, Bernhard Schölkopf, Tom Hope, Mrinmaya Sachan, Rada Mihalcea

PAXQA: Generating Cross-lingual Question Answering Examples at Training Scale
Bryan Li, Chris Callison-Burch

Sharing, Teaching and Aligning: Knowledgeable Transfer Learning for Cross-Lingual Machine Reading Comprehension
Tingfeng Cao, Chengyu Wang, Chuanqi Tan, Jun Huang, Jinhui Zhu

BERT Goes Off-Topic: Investigating the Domain Transfer Challenge using Genre Classification
Dmitri Roussinov, Serge Sharoff

Toward Stronger Textual Attack Detectors
Pierre Colombo, Marine Picot, Nathan Noiry, Guillaume Staerman, Pablo Piantanida

MEAL: Stable and Active Learning for Few-Shot Prompting
Abdullatif Köksal, Timo Schick, Hinrich Schuetze

Structure and Label Constrained Data Augmentation for Cross-domain Few-shot NER
Jingyi Zhang, Ying Zhang, Yufeng Chen, Jinan Xu

Weakly-supervised Deep Cognate Detection Framework for Low-Resourced Languages Using Morphological Knowledge of Closely-Related Languages
Koustava Goswami, Priya Rani, Theodorus Fransen, John McCrae

Toward Building General Foundation Models for Language, Vision, and Vision-Language Understanding Tasks
Xinsong Zhang, Yan Zeng, Jipeng Zhang, Hang Li

Pass-Tuning: Towards Structure-Aware Parameter-Efficient Tuning for Code Representation Learning
Nuo Chen, Qiushi Sun, Jianing Wang, Xiang Li, Ming Gao

Counterfactual Augmentation for Multimodal Learning Under Presentation Bias
Victoria Lin, Louis-Philippe Morency, Dimitrios Dimitriadis, Srinagesh Sharma

A Table-to-Text Framework with Heterogeneous Multidominance Attention and Self-Evaluated Multi-Pass Deliberation
Xi Chen, Xinjiang Lu, Haoran Xin, Wenjun Peng, Haoyang Duan, Feihu Jiang, Jingbo Zhou, Hui Xiong

Video-Text Retrieval by Supervised Sparse Multi-Grained Learning
Yimu Wang, Peng Shi

Zero-Shot-BERT-Adapters: a Zero-Shot Pipeline for Unknown Intent Detection
Daniele Comi, Dimitrios Christofidellis, Pier Piazza, Matteo Manica

ReFSQL: A Retrieval-Augmentation Framework for Text-to-SQL Generation
Kun Zhang, Xiexiong Lin, Yuanzhuo Wang, Xin Zhang, Fei Sun, Cen Jianhe, Hexiang Tan, Xuhui Jiang, Huawei Shen

Approximating Two-Layer Feedforward Networks for Efficient Transformers
Róbert Csordás, Kazuki Irie, Jürgen Schmidhuber

Adapter-TST: A Parameter Efficient Method for Multiple-Attribute Text Style Transfer
Zhiqiang Hu, Nancy Chen, Roy Lee

Solving the Right Problem is Key for Translational NLP: A Case Study in UMLS Vocabulary Insertion
Bernal Gutierrez, Yuqing Mao, Vinh Nguyen, Kin Fung, Yu Su, Olivier Bodenreider

Improving Cross-lingual Transfer through Subtree-aware Word Reordering
Ofir Arviv, Dmitry Nikolaev, Taelin Karidi, Omri Abend

Novel Slot Detection With an Incremental Setting
Chen Liang, Hongliang Li, Changhao Guan, Qingbin Liu, Jian Liu, Jinan Xu, Zhe Zhao

Automatic Model Selection with Large Language Models for Reasoning
James Zhao, Yuxi Xie, Kenji Kawaguchi, Junxian He, Michael Xie

ARKitSceneRefer: Text-based Localization of Small Objects in Diverse Real-World 3D Indoor Scenes
Shunya Kato, Shuhei Kurita, Chenhui Chu, Sadao Kurohashi

Improving Question Generation with Multi-level Content Planning
Zehua Xia, Qi Gou, Bowen Yu, Haiyang Yu, Fei Huang, Yongbin Li, Nguyen Cam-Tu

DelucionQA: Detecting Hallucinations in Domain-specific Question Answering
Mobashir Sadat, Zhengyu Zhou, Lukas Lange, Jun Araki, Arsalan Gundroo, Bingqing Wang, Rakesh Menon, Md Parvez, Zhe Feng

InvGC: Robust Cross-Modal Retrieval by Inverse Graph Convolution
Xiangru Jian, Yimu Wang

MixTEA: Semi-supervised Entity Alignment with Mixture Teaching
Feng Xie, Xin Song, Xiang Zeng, Xuechen Zhao, Lei Tian, Bin Zhou, Yusong Tan

Boot and Switch: Alternating Distillation for Zero-Shot Dense Retrieval
Fan Jiang, Qiongkai Xu, Tom Drummond, Trevor Cohn

TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding
Shuhuai Ren, Sishuo Chen, Shicheng Li, Xu Sun, Lu Hou

Fusing Temporal Graphs into Transformers for Time-Sensitive Question Answering
Xin Su, Phillip Howard, Nagib Hakim, Steven Bethard

The Internal State of an LLM Knows When It’s Lying
Amos Azaria, Tom Mitchell

Factual Relation Discrimination for Factuality-oriented Abstractive Summarization
Zhiguang Gao, Peifeng Li, Feng Jiang, Xiaomin Chu, Qiaoming Zhu

Multi-Modal Knowledge Graph Transformer Framework for Multi-Modal Entity Alignment
Qian Li, Cheng Ji, Shu Guo, Zhaoji Liang, Lihong Wang, Jianxin Li

Towards A Holistic Landscape of Situated Theory of Mind in Large Language Models
Ziqiao Ma, Jacob Sansom, Run Peng, Joyce Chai

Text Augmented Spatial Aware Zero-shot Referring Image Segmentation
Yucheng Suo, Linchao Zhu, Yi Yang

IRFL: Image Recognition of Figurative Language
Ron Yosef, Yonatan Bitton, Dafna Shahaf

Self-supervised Meta-Prompt Learning with Meta-Gradient Regularization for Few-shot Generalization
Kaihang Pan, Juncheng Li, Hongye Song, Jun Lin, Xiaozhong Liu, Siliang Tang

An Adaptive Prompt Generation Framework for Task-oriented Dialogue System
Jun Gao, Liuyu Xiang, Huijia Wu, Han Zhao, Yiqi Tong, Zhaofeng He

Temporal Knowledge Graph Reasoning Based on N-tuple Modeling
Zhongni Hou, Xiaolong Jin, Zixuan Li, Long Bai, Saiping Guan, Yutao Zeng, Jiafeng Guo, Xueqi Cheng

Make Your Decision Convincing! A Unified Two-Stage Framework: Self-Attribution and Decision-Making
Yanrui Du, Sendong Zhao, Haochun Wang, Yuhan Chen, Rui Bai, Zewen Qiang, Muzhen Cai, Bing Qin

Adaptive Structure Induction for Aspect-based Sentiment Analysis with Spectral Perspective
Hao Niu, Yun Xiong, Xiaosu Wang, Wenjing Yu, Yao Zhang, Zhonglei Guo

NovaCOMET: Open Commonsense Foundation Models with Symbolic Knowledge Distillation
Peter West, Ronan Bras, Taylor Sorensen, Bill Lin, Liwei Jiang, Ximing Lu, Khyathi Chandu, Jack Hessel, Ashutosh Baheti, Chandra Bhagavatula, Yejin Choi

In-Context Demonstration Selection with Cross Entropy Difference
Dan Iter, Reid Pryzant, Ruochen Xu, Shuohang Wang, Yang Liu, Yichong Xu, Chenguang Zhu

Can ChatGPT Assess Human Personalities? A General Evaluation Framework
Haocong Rao, Cyril Leung, Chunyan Miao

MoqaGPT : Zero-Shot Multi-modal Open-domain Question Answering with Large Language Model
Le Zhang, Yihong Wu, Fengran Mo, Jian-Yun Nie, Aishwarya Agrawal

Large Language Models Know Your Contextual Search Intent: A Prompting Framework for Conversational Search
Kelong Mao, Zhicheng Dou, Fengran Mo, Jiewen Hou, Haonan Chen, Hongjin Qian

Toxicity in chatgpt: Analyzing persona-assigned language models
Ameet Deshpande, Vishvak Murahari, Tanmay Rajpurohit, Ashwin Kalyan, Karthik Narasimhan

Execution-Based Evaluation for Open-Domain Code Generation
Zhiruo Wang, Shuyan Zhou, Daniel Fried, Graham Neubig

Syntax-Aware Retrieval Augmented Code Generation
Xiangyu Zhang, Yu Zhou, Guang Yang, Taolue Chen

Selecting Key Views for Zero-Shot Entity Linking
Xuhui Sui, Ying Zhang, Kehui Song, Baohang Zhou, Xiaojie Yuan, Wensheng Zhang

Improving the Robustness of Summarization Models by Detecting and Removing Input Noise
Kundan Krishna, Yao Zhao, Jie Ren, Balaji Lakshminarayanan, Jiaming Luo, Mohammad Saleh, Peter Liu

How Reliable Are AI-Generated-Text Detectors? An Assessment Framework Using Evasive Soft Prompts
Tharindu Kumarage, Paras Sheth, Raha Moraffah, Joshua Garland, Huan Liu

Knowledge is a Region in Weight Space for Fine-tuned Language Models
Almog Gueta, Elad Venezian, Colin Raffel, Noam Slonim, Yoav Katz, Leshem Choshen

Unveiling the Multi-Annotation Process: Examining the Influence of Annotation Quantity and Instance Difficulty on Model Performance
Pritam Kadasi, Mayank Singh

On the Risk of Misinformation Pollution with Large Language Models
Yikang Pan, Liangming Pan, Wenhu Chen, Preslav Nakov, Min-Yen Kan, William Wang

Dolphin: A Challenging and Diverse Benchmark for Arabic NLG
El Moatez Billah Nagoudi, AbdelRahim Elmadany, Ahmed El-Shangiti, Muhammad Abdul-Mageed

Hierarchical Enhancement Framework for Aspect-based Argument Mining
Yujie Fu, Yang Li, Suge Wang, Xiaoli Li, Deyu Li, Jian Liao, JianXing Zheng

MenatQA: A New Dataset for Testing the Temporal Comprehension and Reasoning Abilities of Large Language Models
Yifan Wei, Yisong Su, Huanhuan Ma, Xiaoyan Yu, Fangyu Lei, Yuanzhe Zhang, Jun Zhao, Kang Liu

What Makes Chain-of-Thought Prompting Effective? A Counterfactual Study
Aman Madaan, Katherine Hermann, Amir Yazdanbakhsh

An Empirical Study of Multimodal Model Merging
Yi-Lin Sung, Linjie Li, Kevin Lin, Zhe Gan, Mohit Bansal, Lijuan Wang

Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document
Xiangnan Chen, Qian Xiao, Juncheng Li, Duo Dong, Jun Lin, Xiaozhong Liu, Siliang Tang

Learning to Compose Representations of Different Encoder Layers towards Improving Compositional Generalization
Lei Lin, Shuangtao Li, Yafang Zheng, Biao Fu, Shan Liu, Yidong Chen, Xiaodong Shi

SelectNoise: Unsupervised Noise Injection to Enable Zero-Shot Machine Translation for Extremely Low-resource Languages
Maharaj Brahma, Kaushal Maurya, Maunendra Desarkar

Breaking Boundaries in Retrieval Systems: Unsupervised Domain Adaptation with Denoise-Finetuning
Che Chen, Ching Yang, Chun-Yi Lin, Hung-Yu Kao

Simpler neural networks prefer subregular languages
Charles Torres, Richard Futrell

R$^3$ Prompting: Review, Rephrase and Resolve for Chain-of-Thought Reasoning in Large Language Models under Noisy Context
Qingyuan Tian, Hanlun Zhu, Lei Wang, Yang Li, Yunshi Lan

Quality Estimation-Assisted Automatic Post-Editing
Sourabh Deoghare, Diptesh Kanojia, Fred Blain, Tharindu Ranasinghe, Pushpak Bhattacharyya

Self-Supervised Rule Learning to Link Text Segments to Relational Elements of Structured Knowledge
Shajith Ikbal, Udit Sharma, Hima Karanam, Sumit Neelam, Ronny Luss, Dheeraj Sreedhar, Pavan Kapanipathi, Naweed Khan, Kyle Erwin, Ndivhuwo Makondo, Ibrahim Abdelaziz, Achille Fokoue, Alexander Gray, Maxwell Crouse, Subhajit Chaudhury, Chitra Subramanian

TaTA: A Multilingual Table-to-Text Dataset for African Languages
Sebastian Gehrmann, Sebastian Ruder, Vitaly Nikolaev, Jan Botha, Michael Chavinda, Ankur Parikh, Clara Rivera

Explain-then-translate: an analysis on improving program translation with self-generated explanations
Zilu Tang, Mayank Agarwal, Alexander Shypula, Bailin Wang, Derry Wijaya, Jie Chen, Yoon Kim

DemoSG: Demonstration-enhanced Schema-guided Generation for Low-resource Event Extraction
Gang Zhao, Xiaocheng Gong, Xinjie Yang, Guanting Dong, Shudong Lu, Si Li

GLGR: Question-aware Global-to-Local Graph Reasoning for Multi-party Dialogue Reading Comprehension
Yanling Li, Bowei Zou, Yifan Fan, Xibo Li, Ai Ti Aw, Yu Hong

Towards Mitigating LLM Hallucination via Self Reflection
Ziwei Ji, Tiezheng Yu, Yan Xu, Nayeon Lee, Etsuko Ishii, Pascale Fung

Making Body Movement in Sign Language Corpus Accessible for Linguists and Machines with Three-Dimensional Normalization of MediaPipe
Victor Skobov, Mayumi Bono

XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Sebastian Ruder, Jonathan Clark, Alexander Gutkin, Mihir Kale, Min Ma, Massimo Nicosia, Shruti Rijhwani, Parker Riley, Jean-Michel Sarr, Xinyi Wang, John Wieting, Nitish Gupta, Anna Katanova, Christo Kirov, Dana Dickinson, Brian Roark, Bidisha Samanta, Connie Tao, David Adelani, Vera Axelrod, Isaac Caswell, Colin Cherry, Dan Garrette, Reeve Ingle, Melvin Johnson, Dmitry Panteleev, Partha Talukdar

DiffuVST: Narrating Fictional Scenes with Global-History-Guided Denoising Models
Shengguang Wu, Mei Yuan, Qi Su

DiFair: A Benchmark for Disentangled Assessment of Gender Knowledge and Bias
Mahdi Zakizadeh, Kaveh Miandoab, Mohammad Pilehvar

ExplainCPE: A Free-text Explanation Benchmark of Chinese Pharmacist Examination
Dongfang Li, Jindi Yu, Baotian Hu, Zhenran Xu, Min Zhang

CLASS: A Design Framework for Building Intelligent Tutoring Systems Based on Learning Science principles
Shashank Sonkar, Naiming Liu, Debshila Mallick, Richard Baraniuk

Normal-Abnormal Decoupling Memory for Medical Report Generation
Guosheng Zhao, Yan Yan, Zijian Zhao

mmT5: Modular Multilingual Pre-Training Solves Source Language Hallucinations
Jonas Pfeiffer, Francesco Piccinno, Massimo Nicosia, Xinyi Wang, Machel Reid, Sebastian Ruder

ImageNetVC: Zero- and Few-Shot Visual Commonsense Evaluation on 1000 ImageNet Categories
Heming Xia, Qingxiu Dong, Lei Li, Jingjing Xu, Tianyu Liu, Ziwei Qin, Zhifang Sui

A Query-Parallel Machine Reading Comprehension Framework for Low-resource NER
Yuhao Zhang, Yongliang Wang

BiSPN: Generating Entity Set and Relation Set Coherently in One Pass
Yuxin He, Buzhou Tang

MEEP: Is this Engaging? Prompting Large Language Models for Dialogue Evaluation in Multilingual Settings
Amila Ferron, Amber Shore, Ekata Mitra, Ameeta Agrawal

LLMDet: A Third Party Large Language Models Generated Text Detection Tool
Kangxi Wu, Liang Pang, Huawei Shen, Xueqi Cheng, Tat-Seng Chua

RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning
Wenjun Hou, Yi Cheng, Kaishuai Xu, Wenjie Li, Jiang Liu

Causal Intervention for Abstractive Related Work Generation
Jiachang Liu, Qi Zhang, Chongyang Shi, Usman Naseem, Shoujin Wang, Liang Hu, Ivor Tsang

G-SPEED: General SParse Efficient Editing MoDel
Haoke Zhang, Yue Wang, Juntao Li, Xiabing Zhou, Min Zhang

Attack Prompt Generation for Red Teaming and Defending Large Language Models
Boyi Deng, Wenjie Wang, Fuli Feng, Yang Deng, Qifan Wang, Xiangnan He

Multi-Defendant Legal Judgment Prediction via Hierarchical Reasoning
Yougang Lyu, Jitai Hao, Zihan Wang, Kai Zhao, Shen Gao, Pengjie Ren, Zhumin Chen, Fang Wang, Zhaochun Ren

Interpreting Indirect Answers to Yes-No Questions in Multiple Languages
Zijie Wang, Md Hossain, Shivam Mathur, Terry Melo, Kadir Ozler, Keun Park, Jacob Quintero, MohammadHossein Rezaei, Shreya Shakya, Md Uddin, Eduardo Blanco

Generalizing Few-Shot Named Entity Recognizers to Unseen Domains with Type-Related Features
Zihan Wang, Ziqi Zhao, Zhumin Chen, Pengjie Ren, Maarten de Rijke, Zhaochun Ren

Intervention-Based Alignment of Code Search with Execution Feedback
Hojae Han, Minsoo Kim, Seung-won Hwang, Nan Duan, Shuai Lu

Enhancing Neural Machine Translation with Semantic Units
Langlin Huang, Shuhao Gu, Zhang Zhuocheng, Yang Feng

DRAFT: Dense Retrieval Augmented Few-shot Topic classifier Framework
Keonwoo Kim, Younggun Lee

A Framework for Exploring Player Perceptions of LLM-Generated Dialogue in Commercial Video Games
Nader Akoury, Qian Yang, Mohit Iyyer

Generative Calibration for In-context Learning
Zhongtao Jiang, Yuanzhe Zhang, Cao Liu, Jun Zhao, Kang Liu

Chain of Thought with Explicit Evidence Reasoning for Few-shot Relation Extraction
Xilai Ma, Jing Li, Min Zhang

Manipulating the Perceived Personality Traits of Language Models
Graham Caron, Shashank Srivastava

WikiChat: Stopping the Hallucination of Large Language Model Chatbots by Few-Shot Grounding on Wikipedia
Sina Semnani, Violet Yao, Heidi Zhang, Monica Lam

Automated Few-Shot Classification with Instruction-Finetuned Language Models
Rami Aly, Xingjian Shi, Kaixiang Lin, Aston Zhang, Andrew Wilson

Meta-Learning of Prompt Generation for Lightweight Prompt Engineering on Language-Model-as-a-Service
Hyeonmin Ha, Jihye Lee, Wookje Han, Byung-Gon Chun

Beneath Surface Similarity: Large Language Models Make Reasonable Scientific Analogies after Structure Abduction
Siyu Yuan, Jiangjie Chen, Xuyang Ge, Yanghua Xiao, Deqing Yang

HiCL: Hierarchical Contrastive Learning of Unsupervised Sentence Embeddings
Zhuofeng Wu, Chaowei Xiao, VG Vinod Vydiswaran

Density-Aware Prototypical Network for Few-Shot Relation Classification
Jianfeng Wu, Mengting Hu, Yike Wu, Bingzhe Wu, Yalan Xie, Mingming Liu, Renhong Cheng

RegaVAE: A Retrieval-Augmented Gaussian Mixture Variational Auto-Encoder for Language Modeling
Jingcheng Deng, Liang Pang, Huawei Shen, Xueqi Cheng

RefGPT: Dialogue Generation of GPT, by GPT, and for GPT
Dongjie Yang, Ruifeng Yuan, Yuantao Fan, Yifei Yang, Zili Wang, Shusen Wang, Hai Zhao

INA: An Integrative Approach for Enhancing Negotiation Strategies with Reward-Based Dialogue Agent
Zishan Ahmad, Suman Saurabh, Vaishakh Menon, Asif Ekbal, Roshni Ramnani, Anutosh Maitra

Large Language Models are Better Reasoners with Self-Verification
Yixuan Weng, Minjun Zhu, Fei Xia, Bin Li, Shizhu He, Shengping Liu, Bin Sun, Kang Liu, Jun Zhao

Accuracy is not enough: Evaluating Personalization in Summarizers
Rahul Vansh, Darsh Rank, Sourish Dasgupta, Tanmoy Chakraborty

Complexity-Guided Curriculum Learning for Text Graphs
Nidhi Vakil, Hadi Amiri

CoVariance-based Causal Debiasing for Entity and Relation Extraction
Lin Ren, Yongbin Liu, Yixin Cao, Chunping Ouyang

In What Languages are Generative Language Models the Most Formal? Analyzing Formality Distribution across Languages
Asım Ersoy, Gerson Vizcarra, Tahsin Mayeesha, Benjamin Muller

MaXM: Towards Multilingual Visual Question Answering
Soravit Changpinyo, Linting Xue, Michal Yarom, Ashish Thapliyal, Idan Szpektor, Julien Amelot, Xi Chen, Radu Soricut

Efficient Latent Variable Modeling for Knowledge-Grounded Dialogue Generation
Gunsoo Han, Daejin Jo, Daniel Nam, Eunseop Yoon, Taehwan Kwon, Seungeun Rho, Kyoung-Woon On, Chang Yoo, Sungwoong Kim

Ask To The Point: Open-Domain Entity-Centric Question Generation
Yuxiang Liu, Jie Huang, Kevin Chang

Self-prompted Chain-of-Thought on Large Language Models for Open-domain Multi-hop Reasoning
Jinyuan Wang, Junlong Li, Hai Zhao

CASE: Commonsense-Augmented Score with an Expanded Answer Space
Wenkai Chen, Sahithya Ravi, Vered Shwartz

GRENADE: Graph-Centric Language Model for Self-Supervised Representation Learning on Text-Attributed Graphs
Yichuan Li, Kaize Ding, Kyumin Lee

Sources of Hallucination by Large Language Models on Inference Tasks
Nick McKenna, Tianyi Li, Liang Cheng, Mohammad Hosseini, Mark Johnson, Mark Steedman

Efficient Long-Range Transformers: You Need to Attend More, but Not Necessarily at Every Layer
Qingru Zhang, Dhananjay Ram, Cole Hawkins, Sheng Zha, Tuo Zhao

Prompting ChatGPT in MNER: Enhanced Multimodal Named Entity Recognition with Auxiliary Refined Knowledge
Jinyuan Li, Han Li, Zhuo Pan, Di Sun, Jiahao Wang, Wenkun Zhang, Gang Pan

Understanding HTML with Large Language Models
Izzeddin Gur, Ofir Nachum, Yingjie Miao, Mustafa Safdari, Austin Huang, Aakanksha Chowdhery, Sharan Narang, Noah Fiedel, Aleksandra Faust

The PEACE-Reviews dataset: Modeling Cognitive Appraisals in Emotion Text Analysis
Gerard Yeo, Kokil Jaidka

UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model
Jiabo Ye, Anwen Hu, Haiyang Xu, Qinghao Ye, Ming Yan, Guohai Xu, Chenliang Li, Junfeng Tian, Qi Qian, Ji Zhang, Qin Jin, Liang He, Xin Lin, Fei Huang

Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback
Wei Shen, Rui Zheng, Wenyu Zhan, Jun Zhao, Shihan Dou, Tao Gui, Qi Zhang, Xuanjing Huang

Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions
Ziyue Wang, Chi Chen, Peng Li, Yang Liu

Take a Closer Look at Multilinguality! Improve Multilingual Pre-Training Using Monolingual Corpora Only
Jinliang Lu, Yu Lu, Jiajun Zhang

LogiCoT: Logical Chain-of-Thought Instruction Tuning
Hanmeng Liu, Zhiyang Teng, Leyang Cui, Chaoli Zhang, Qiji Zhou, Yue Zhang

Reducing Spurious Correlations in Aspect-based Sentiment Analysis with Explanation from Large Language Models
Qianlong Wang, Keyang Ding, Bin Liang, Min Yang, Ruifeng Xu

High-quality argumentative information in low resources approaches improve counter-narrative generation
Damián Furman, Pablo Torres, José Rodríguez, Diego Letzen, Maria Martinez, Laura Alemany

A Reference-free Segmentation Quality Index (SegReFree)
Evan Lucas, Dylan Kangas, Timothy Havens

In-context Learning for Few-shot Multimodal Named Entity Recognition
Chenran Cai, Qianlong Wang, Bin Liang, Bing Qin, Min Yang, Kam-Fai Wong, Ruifeng Xu

Handshape-Aware Sign Language Recognition: Extended Datasets and Exploration of Handshape-Inclusive Methods
Xuan Zhang, Kevin Duh

SimCKP: Simple Contrastive Learning of Keyphrase Representations
Minseok Choi, Chaeheon Gwak, Seho Kim, Si Kim, Jaegul Choo

LEXTREME: A Multi-Lingual and Multi-Task Benchmark for the Legal Domain
Joel Niklaus, Veton Matoshi, Pooja Rani, Andrea Galassi, Matthias Stürmer, Ilias Chalkidis

Simultaneous Machine Translation with Tailored Reference
Shoutao Guo, Shaolei Zhang, Yang Feng

Dynamic Voting for Efficient Reasoning in Large Language Models
Mingfeng Xue, Dayiheng Liu, Wenqiang Lei, Xingzhang Ren, Baosong Yang, Jun Xie, Yidan Zhang, Dezhong Peng, Jiancheng Lv

AutoPlan: Automatic Planning of Interactive Decision-Making Tasks With Large Language Models
Siqi Ouyang, Lei Li

Measuring Faithful and Plausible Visual Grounding in VQA
Daniel Reich, Felix Putze, Tanja Schultz

Can you Summarize my learnings? Towards Perspective-based Educational Dialogue Summarization
Raghav Jain, Tulika Saha, Jhagrut Lalwani, Sriparna Saha

Adaptive Textual Label Noise Learning based on Pre-trained Models
Shaohuan Cheng, Wenyu Chen, Fu Mingsheng, Xuanting Xie, Hong Qu

Towards Informative Open-ended Text Generation with Dynamic Knowledge Triples
Zixuan Ren, Yang Zhao, Chengqing Zong

Novel Relation Detection: Discovering Unknown Relation Types via Multi-Strategy Self-Supervised Learning
Qingbin Liu, Yin Kung, Yanchao Hao, Dianbo Sui, Siyuan Cheng, Xi Chen, Ningyu Zhang, Jiaoyan Chen

Ask Language Model to Clean Your Noisy Translation Data
Quinten Bolding, Baohao Liao, Brandon Denis, Jun Luo, Christof Monz

Multi-User MultiWOZ: Task-Oriented Dialogues among Multiple Users
Yohan Jo, Xinyan Zhao, Arijit Biswas, Nikoletta Basiou, Vincent Auvray, Nikolaos Malandrakis, Angeliki Metallinou, Alexandros Potamianos

MAPO: Boosting Large Language Model Performance with Model-Adaptive Prompt Optimization
Yuyan Chen, Zhihao Wen, Ge Fan, Zhengyu Chen, Wei Wu, Dayiheng Liu, Zhixu Li, Bang Liu, Yanghua Xiao

PsyCoT: Psychological Questionnaire as Powerful Chain-of-Thought for Personality Detection
Tao Yang, Tianyuan Shi, Fanqi Wan, Xiaojun Quan, Qifan Wang, Bingzhe Wu, Jiaxiang Wu

Harnessing the power of LLMs: Evaluating human-AI text co-creation through the lens of news headline generation
Zijian Ding, Alison Smith-Renner, Wenjuan Zhang, Joel Tetreault, Alejandro Jaimes

NERetrieve: Dataset for Next Generation Named Entity Recognition and Retrieval
Uri Katz, Matan Vetzler, Amir Cohen, Yoav Goldberg

SWEET - Weakly Supervised Person Name Extraction for Fighting Human Trafficking
Javin Liu, Hao Yu, Vidya Sujaya, Pratheeksha Nair, Kellin Pelrine, Reihaneh Rabbany

Watermarking LLMs with Weight Quantization
Linyang Li, Botian Jiang, Pengyu Wang, Ke Ren, Hang Yan, Xipeng Qiu

Disentangling Extraction and Reasoning in Multi-hop Spatial Reasoning
Roshanak Mirzaee, Parisa Kordjamshidi

PsyAttention: Psychological Attention Model for Personality Detection
Baohua Zhang, Yongyi Huang, Wenyao Cui, Zhang Huaping, Jianyun Shang

RoAST: Robustifying Language Models via Adversarial Perturbation with Selective Training
Jaehyung Kim, Yuning Mao, Rui Hou, Hanchao Yu, Davis Liang, Pascale Fung, Qifan Wang, Fuli Feng, Lifu Huang, Madian Khabsa

Symbolization, Prompt, and Classification: A Framework for Implicit Speaker Identification in Novels
Yue Chen, Tianwei He, Hongbin Zhou, Jia-Chen Gu, Heng Lu, Zhen-Hua Ling

Open-Ended Instructable Embodied Agents with Memory-Augmented Large Language Models
Gabriel Sarch, Yue Wu, Michael Tarr, Katerina Fragkiadaki

ACT-SQL: In-Context Learning for Text-to-SQL with Automatically-Generated Chain-of-Thought
Hanchong Zhang, Ruisheng Cao, Lu Chen, Hongshen Xu, Kai Yu

Manifold-Preserving Transformers are Effective for Short-Long Range Encoding
Ayan Sengupta, Md Akhtar, Tanmoy Chakraborty

Detecting Syntactic Change with Pre-trained Transformer Models
Liwen Hou, David Smith

A Word Sense Distribution-based approach for Semantic Change Prediction
Xiaohang Tang, Yi Zhou, Taichi Aida, Procheta Sen, Danushka Bollegala

Gold: A Global and Local-aware Denoising Framework for Commonsense Knowledge Graph Noise Detection
Zheye Deng, Weiqi Wang, Zhaowei Wang, Xin Liu, Yangqiu Song

Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data Augmentation
Xi Wang, Hossein Rahmani, Jiqun Liu, Emine Yilmaz

Exploring Graph Pre-training for Aspect-based Sentiment Analysis
Xiaoyi Bao, Zhongqing Wang, Guodong Zhou

DemaFormer: Damped Exponential Moving Average Transformer with Energy-Based Modeling for Temporal Language Grounding
Thong Nguyen, Xiaobao Wu, Xinshuai Dong, Cong-Duy Nguyen, See-Kiong Ng, Anh Luu

Methodological Insights in Detecting Subtle Semantic Shifts with Contextualized and Static Language Models
Sanne Hoeken, özge Alacam, Antske Fokkens, Pia Sommerauer

Disfluent Cues for Enhanced Speech Understanding in Large Language Models
Morteza Rohanian, Farhad Nooralahzadeh, Omid Rohanian, David Clifton, Michael Krauthammer

Watermarking PLMs on Classification Tasks by Combining Contrastive Learning with Weight Perturbation
Chenxi Gu, Xiaoqing Zheng, Jianhan Xu, Muling Wu, Cenyuan Zhang, Chengsong Huang, Hua Cai, Xuanjing Huang

BanLemma: A Word Formation Dependent Rule and Dictionary Based Bangla Lemmatizer
Sadia Afrin, Md. Shahad Mahmud Chowdhury, Md. Islam, Faisal Khan, Labib Chowdhury, Md. Mahtab, Nazifa Chowdhury, Massud Forkan, Neelima Kundu, Hakim Arif, Mohammad Mamun Or Rashid, Mohammad Amin, Nabeel Mohammed

Search Augmented Instruction Learning
Hongyin Luo, Tianhua Zhang, Yung-Sung Chuang, Yuan Gong, Yoon Kim, Xixin Wu, Helen Meng, James Glass

``Kelly is a Warm Person, Joseph is a Role Model’’: Gender Biases in LLM-Generated Reference Letters
Yixin Wan, George Pu, Jiao Sun, Aparna Garimella, Kai-Wei Chang, Nanyun Peng

TextMixer: Mixing Multiple Inputs for Privacy-Preserving Inference
Xin Zhou, Yi Lu, Ruotian Ma, Tao Gui, Qi Zhang, Xuanjing Huang

Teacher Perception of Automatically Extracted Grammar Concepts for L2 Language Learning
Aditi Chaudhary, Arun Sampath, Ashwin Sheshadri, Antonios Anastasopoulos, Graham Neubig

Allies: Prompting Large Language Model with Beam Search
Hao Sun, Xiao Liu, Yeyun Gong, Yan Zhang, Daxin Jiang, Linjun Yang, Nan Duan

Logic-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning
Liangming Pan, Alon Albalak, Xinyi Wang, William Wang

SiMFy: A Simple Yet Effective Approach for Temporal Knowledge Graph Reasoning
Zhengtao Liu, Lei Tan, Mengfan Li, Yao Wan, Hai Jin, Xuanhua Shi

Understanding Translationese in Cross-Lingual Summarization
Jiaan Wang, Fandong Meng, Yunlong Liang, Tingyi Zhang, Jiarong Xu, Zhixu Li, Jie Zhou

The Truth, The Whole Truth, and Nothing but the Truth: A New Benchmark Dataset for Hebrew Text Credibility Assessment
Ben Hagag, Reut Tsarfaty

Answer-state Recurrent Relational Network (AsRRN) for Constructed Response Assessment and Feedback Grouping
Zhaohui Li, Susan Lloyd, Matthew Beckman, Rebecca Passonneau

A New Benchmark and Reverse Validation Method for Passage-level Hallucination Detection
Shiping Yang, Renliang Sun, Xiaojun Wan

Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation
Heming Xia, Tao Ge, Peiyi Wang, Si-Qing Chen, Furu Wei, Zhifang Sui

APP: Adaptive Prototypical Pseudo-Labeling for Few-shot OOD Detection
Pei Wang, Keqing He, Yutao Mou, Xiaoshuai Song, Yanan Wu, Jingang Wang, Yunsen Xian, Xunliang Cai, Weiran Xu

2INER: Instructive and In-Context Learning on Few-Shot Named Entity Recognition
Jiasheng Zhang, Xikai Liu, Xinyi Lai, Yan Gao, Shusen Wang, Yao Hu, Yiqing LIN

Generative Emotion Cause Triplet Extraction in Conversations with Commonsense Knowledge
Fanfan Wang, Jianfei Yu, Rui Xia

Proto-lm: A Prototypical Network-Based Framework for Built-in Interpretability in Large Language Models
Sean Xie, Soroush Vosoughi, Saeed Hassanpour

GROVE: A Retrieval-augmented Complex Story Generation Framework with A Forest of Evidence
Zhihua Wen, Zhiliang Tian, Wei Wu, Yuxin Yang, Yanqi Shi, Zhen Huang, Dongsheng Li

KAPALM: Knowledge grAPh enhAnced Language Models for Fake News Detection
Jing Ma, Chen Chen, Chunyan Hou, Xiaojie Yuan

Comparing the Evaluation and Production of Loophole Behavior in Humans and Large Language Models
Sonia Murthy, Kiera Parece, Sophie Bridgers, Peng Qian, Tomer Ullman

InstructExcel: A Benchmark for Natural Language Instruction in Excel
Justin Payan, Swaroop Mishra, Mukul Singh, Carina Negreanu, Christian Poelitz, Chitta Baral, Subhro Roy, Rasika Chakravarthy, Benjamin Van Durme, Elnaz Nouri

ECHo: A Visio-Linguistic Dataset for Event Causality Inference via Human-Centric Reasoning
Yuxi Xie, Guanzhen Li, Min-Yen Kan

An Empirical Study of Instruction-tuning Large Language Models in Chinese
Qingyi Si, Tong Wang, Zheng Lin, Xu Zhang, Yanan Cao, Weiping Wang

Debiasing Multimodal Models via Causal Information Minimization
Vaidehi Patil, Adyasha Maharana, Mohit Bansal

Evaluating Emotion Arcs Across Languages: Bridging the Global Divide in Sentiment Analysis
Daniela Teodorescu, Saif Mohammad

Multi-step Jailbreaking Privacy Attacks on ChatGPT
Haoran Li, Dadi Guo, Wei Fan, Mingshi Xu, Jie Huang, Fanpu Meng, Yangqiu Song

Descriptive Prompt Paraphrasing for Target-Oriented Multimodal Sentiment Classification
Dan Liu, Lin Li, Xiaohui Tao, Jian Cui, Qing Xie

Joint Semantic and Strategy Matching for Persuasive Dialogue
Chuhao Jin, Yutao Zhu, Lingzhen Kong, Shijie Li, Xiao Zhang, Ruihua Song, Xu Chen, Huan Chen, Yuchong Sun, Yu Chen, Jun Xu

Non-Autoregressive Sentence Ordering
Yi Bin, Wenhao Shi, Bin Ji, Jipeng Zhang, Yujuan Ding, Yang Yang

Large Language Models are Not Yet Human-Level Evaluators for Abstractive Summarization
Chenhui Shen, Liying Cheng, Xuan-Phi Nguyen, Yang You, Lidong Bing

FREDSum: A Dialogue Summarization Corpus for French Political Debates
Virgile Rennard, Guokan Shang, Damien Grari, Julie Hunter, Michalis Vazirgiannis

Towards Zero-shot Relation Extraction in Web Mining: A Multimodal Approach with Relative XML Path
Zilong Wang, Jingbo Shang

Narrative Style and the Spread of Health Misinformation on Twitter
Achyutarama Ganti, Eslam Ali Hassan Hussein, Steven Wilson, Zexin Ma, Xinyan Zhao

HadSkip: Homotopic and Adaptive Layer Skipping of Pre-trained Language Models for Efficient Inference
Haoyu Wang, Yaqing Wang, Tianci Liu, Tuo Zhao, Jing Gao

Measuring the Knowledge Acquisition-Utilization Gap in Pretrained Language Models
Amirhossein Kazemnejad, Mehdi Rezagholizadeh, Prasanna Parthasarathi, Sarath Chandar

Non-compositional Expression Generation Based on Curriculum Learning and Continual Learning
Jianing Zhou, Ziheng Zeng, Hongyu Gong, Suma Bhat

Transparency at the Source: Evaluating and Interpreting Language Models With Access to the True Distribution
Jaap Jumelet, Willem Zuidema

Continual Generalized Intent Discovery: Marching Towards Dynamic and Open-world Intent Recognition
Xiaoshuai Song, Yutao Mou, Keqing He, Yueyan Qiu, Jinxu Zhao, Pei Wang, Weiran Xu

Frugal Prompting for Dialog Models
Bishal Santra, Sakya Basak, Abhinandan De, Manish Gupta, Pawan Goyal

The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation
Mutian He, Philip Garner

MacLaSa: Multi-Aspect Controllable Text Generation via Efficient Sampling from Compact Latent Space
Hanxing Ding, Liang Pang, Zihao Wei, Huawei Shen, Xueqi Cheng, Tat-Seng Chua

HPE: Answering Complex Questions over Text by Hybrid Question Parsing and Execution
Ye Liu, Semih Yavuz, Rui Meng, Dragomir Radev, Caiming Xiong, Shafiq Joty, Yingbo Zhou

Length-Adaptive Distillation: Customizing Small Language Model for Dynamic Token Pruning
Chang Liu, Chongyang Tao, Jianxin Liang, Jiazhan Feng, Tao Shen, Quzhe Huang, Dongyan Zhao

Toxicity, Morality, and Speech Act Guided Stance Detection
Apoorva Upadhyaya, Marco Fisichella, Wolfgang Nejdl

A Framework for Bidirectional Decoding: Case Study in Morphological Inflection
Marc Canby, Julia Hockenmaier

Text-guided 3D Human Generation from 2D Collections
Tsu-Jui Fu, Wenhan Xiong, Yixin Nie, Jingyu Liu, Barlas Oguz, William Wang

Statistically Profiling Biases in Natural Language Reasoning Datasets and Models
Shanshan Huang, Kenny Zhu

MUX-PLMs: Data Multiplexing for High-throughput Language Models
Vishvak Murahari, Ameet Deshpande, Carlos Jimenez, Izhak Shafran, Mingqiu Wang, Yuan Cao, Karthik Narasimhan

That was the last straw, we need more: Are Translation Systems Sensitive to Disambiguating Context?
Jaechan Lee, Alisa Liu, Orevaoghene Ahia, Hila Gonen, Noah Smith

RobustEmbed: Robust Sentence Embeddings Using Self-Supervised Contrastive Pre-Training
Javad Asl, Eduardo Blanco, Daniel Takabi

Automatic Evaluation of Attribution by Large Language Models
Xiang Yue, Boshi Wang, Ziru Chen, Kai Zhang, Yu Su, Huan Sun

Modeling Highlighting of Metaphors in Multitask Contrastive Learning Paradigms
Meghdut Sengupta, Milad Alshomary, Ingrid Scharlau, Henning Wachsmuth

LDM$^2$: A Large Decision Model Imitating Human Cognition with Dynamic Memory Enhancement
Xingjin Wang, Linjing Li, Daniel Zeng

ZARA: Improving Few-Shot Self-Rationalization for Small Language Models
Wei-Lin Chen, An-Zi Yen, Cheng-Kuang Wu, Hen-Hsen Huang, Hsin-Hsi Chen

Mind the Gap: Automated Corpus Creation for Enthymeme Detection and Reconstruction in Learner Arguments
Maja Stahl, Nick Düsterhus, Mei-Hua Chen, Henning Wachsmuth

Dior-CVAE: Pre-trained Language Models and Diffusion Priors for Variational Dialog Generation
Tianyu Yang, Thy Tran, Iryna Gurevych

Retrieving Multimodal Information for Augmented Generation: A Survey
Ruochen Zhao, Hailin Chen, Weishi Wang, Fangkai Jiao, Do Long, Chengwei Qin, Bosheng Ding, Xiaobao Guo, Minzhi Li, Xingxuan Li, Shafiq Joty

The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation
Dung Nguyen, Le Nam, Anh Dau, Anh Nguyen, Khanh Nghiem, Jin Guo, Nghi Bui

Complex Event Schema Induction with Knowledge-Enriched Diffusion Model
Yupu Hao, Pengfei Cao, Yubo Chen, Kang Liu, Jiexin Xu, Huaijun Li, Xiaojian Jiang, Jun Zhao

Exploiting Emotion-Semantic Correlations for Empathetic Response Generation
Zhou Yang, Zhaochun Ren, Wang Yufeng, Xiaofei Zhu, Zhihao Chen, Tiecheng Cai, Wu Yunbing, Yisong Su, Sibo Ju, Xiangwen Liao

Long-Range Language Modeling with Selective Cache
Xinting Huang, Nora Hollenstein

FaLA: Fast Linear Adaptation for Replacing Backbone Models on Edge Devices
Shuo Huang, Lizhen Qu, Xingliang Yuan, Chunyang Chen

Controllable Chest X-Ray Report Generation from Longitudinal Representations
Francesco Serra, Chaoyang Wang, Fani Deligianni, Jeff Dalton, Alison O’Neil

Is ChatGPT a Good Multi-Party Conversation Solver?
Chao-Hong Tan, Jia-Chen Gu, Zhen-Hua Ling

Improving End-to-End Speech Processing by Efficient Text Data Utilization with Latent Synthesis
Jianqiao Lu, Wenyong Huang, Nianzu Zheng, Xingshan Zeng, Yu Yeung, Xiao Chen

Bipartite Graph Pre-training for Unsupervised Extractive Summarization with Graph Convolutional Auto-Encoders
Qianren Mao, Shaobo Zhao, Jiarui Li, Xiaolei Gu, Shizhu He, Bo Li, Jianxin Li

Bayesian Multi-Task Transfer Learning for Soft Prompt Tuning
Haeju Lee, Minchan Jeong, Se-Young Yun, Kee-Eung Kim

A Critical Analysis of Document Out-of-Distribution Detection
Jiuxiang Gu, Yifei Ming, Yi Zhou, Jason Kuen, Vlad Morariu, Handong Zhao, Ruiyi Zhang, Nikolaos Barmpalios, Anqi Liu, Yixuan Li, Tong Sun, Ani Nenkova

Improving Neural Machine Translation by Multi-Knowledge Integration with Prompting
Ke Wang, Jun Xie, Yuqi Zhang, Yu Zhao

Active Learning Principles for In-Context Learning with Large Language Models
Katerina Margatina, Timo Schick, Nikolaos Aletras, Jane Dwivedi-Yu

InteMATs: Integrating Granularity-Specific Multilingual Adapters for Cross-Lingual Transfer
Meizhen Liu, Xu Guo, He Jiakai, Jianye Chen, Fengyu Zhou, Siu Hui

PlugMed: Improving Specificity in Patient-Centered Medical Dialogue Generation using In-Context Learning
Chengfeng Dou, Zhi Jin, Wenpin Jiao, Haiyan Zhao, Yongqiang Zhao, Zhengwei Tao

CodeTransOcean: A Comprehensive Multilingual Benchmark for Code Translation
Weixiang Yan, Yuchen Tian, Yunzhe Li, Qian Chen, Wen Wang

impact of sample selection on in-context learning for entity extraction from scientific writing
Necva Bölücü, Maciej Rybinski, Stephen Wan

Goodtriever: Adaptive Toxicity Mitigation with Retrieval-augmented Models
Luiza Pozzobon, Beyza Ermis, Patrick Lewis, Sara Hooker

Time-Considerable Dialogue Models via Reranking by Time Dependency
Yuiko Tsunomori, Masakazu Ishihata, Hiroaki Sugiyama

MPrompt: Exploring Multi-level Prompt Tuning for Machine Reading Comprehension
Guoxin Chen, Yiming Qian, Bowen Wang, Liangzhi Li

DocTrack: A Visually-Rich Document Dataset Really Aligned with Human Eye Movement for Machine Reading
Hao Wang, Qingxuan Wang, Yue Li, Changqing Wang, Chenhui Chu, Rui Wang

Adaptation with Self-Evaluation to Improve Selective Prediction in LLMs
Jiefeng Chen, Jinsung Yoon, Sayna Ebrahimi, Sercan Arik, Tomas Pfister, Somesh Jha

Bi-Drop: Enhancing Fine-tuning Generalization via Synchronous sub-net Estimation and Optimization
Shoujie Tong, Heming Xia, Damai Dai, Runxin Xu, Tianyu Liu, Binghuai Lin, Yunbo Cao, Zhifang Sui

ClozEx: A Task toward Generation of English Cloze Explanation
Zizheng Zhang, Masato Mita, Mamoru Komachi

Is Probing All You Need? Indicator Tasks as an Alternative to Probing Embedding Spaces
Tal Levy, Omer Goldman, Reut Tsarfaty

The Cost of Compression: Investigating the Impact of Compression on Parametric Knowledge in Language Models
Satya Sai Srinath Namburi, Makesh Sreedhar, Srinath Srinivasan, Frederic Sala

CoEdIT: Text Editing by Task-Specific Instruction Tuning
Vipul Raheja, Dhruv Kumar, Ryan Koo, Dongyeop Kang

Exploring Large Language Models for Multi-Modal Out-of-Distribution Detection
Yi Dai, Hao Lang, Kaisheng Zeng, Fei Huang, Yongbin Li

DeltaScore: Fine-Grained Story Evaluation with Perturbations
Zhuohan Xie, Miao Li, Trevor Cohn, Jey Lau

MuG: A Multimodal Classification Benchmark on Game Data with Tabular, Textual, and Visual Fields
Jiaying Lu, Yongchen Qian, Shifan Zhao, Yuanzhe Xi, Carl Yang

Black-Box Tuning of Vision-Language Models with Effective Gradient Approximation
Zixian Guo, Yuxiang Wei, Ming Liu, Zhilong Ji, Jinfeng Bai, Yiwen Guo, Wangmeng Zuo

How to Determine the Most Powerful Pre-trained Language Model without Brute Force Fine-tuning? An Empirical Survey
Jun Bai, Xiaofeng Zhang, Chen Li, Hanhua Hong, Xi Xu, Chenghua Lin, Wenge Rong

Licon: A Diverse, Controllable and Challenging Linguistic Concept Learning Benchmark
Shenglong Yu, Ying Zhang, Wenya Guo, Zhengkun Zhang, Ru Zhou, Xiaojie Yuan

InterroLang: Exploring NLP Models and Datasets through Dialogue-based Explanations
Nils Feldhus, Qianli Wang, Tatiana Anikina, Sahil Chopra, Cennet Oguz, Sebastian Möller

Multimodal Automated Fact-Checking: A Survey
Mubashara Akhtar, Michael Schlichtkrull, Zhijiang Guo, Oana Cocarascu, Elena Simperl, Andreas Vlachos

PROTEGE: Prompt-based Diverse Question Generation from Web Articles
Vinayak Puranik, Anirban Majumder, Vineet Chaoji

Mulan: A Multi-Level Alignment Model for Video Question Answering
Yu Fu, Cong Cao, Yuling Yang, Yuhai Lu, Fangfang Yuan, Dakui Wang, Yanbing Liu

Decomposing Complex Queries for Tip-of-the-tongue Retrieval
Kevin Lin, Kyle Lo, Joseph Gonzalez, Dan Klein

Values, Ethics, Morals? On the Use of Moral Concepts in NLP Research
Karina Vida, Judith Simon, Anne Lauscher

Adapting Pretrained Text-to-Text Models for Long Text Sequences
Wenhan Xiong, Anchit Gupta, Shubham Toshniwal, Yashar Mehdad, Scott Yih

xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark
Chen Zhang, Luis D’Haro, Chengguang Tang, Ke Shi, Guohua Tang, Haizhou Li

MathDial: A Dialogue Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems
Jakub Macina, Nico Daheim, Sankalan Chowdhury, Tanmay Sinha, Manu Kapur, Iryna Gurevych, Mrinmaya Sachan

Towards Making the Most of ChatGPT for Machine Translation
Keqin Peng, Liang Ding, Qihuang Zhong, Li Shen, Xuebo Liu, Min Zhang, Yuanxin Ouyang, Dacheng Tao

Large-Scale and Multi-Perspective Opinion Summarization with Diverse Review Subsets
Han Jiang, Rui Wang, Zhihua Wei, Yu Li, Xinpeng Wang

Disentangling Structure and Style: Political Bias Detection in News by Inducing Document Hierarchy
Jiwoo Hong, Yejin Cho, Jiyoung Han, Jaemin Jung, James Thorne

Measuring and Narrowing the Compositionality Gap in Language Models
Ofir Press, Muru Zhang, Sewon Min, Ludwig Schmidt, Noah Smith, Mike Lewis

Unsupervised Candidate Answer Extraction through Differentiable Masker-Reconstructor Model
Zhuoer Wang, Yicheng Wang, Ziwei Zhu, James Caverlee

HoneyBee: Progressive Instruction Finetuning of Large Language Models for Materials Science
Yu Song, Santiago Miret, Huan Zhang, Bang Liu

Prompt-Based Editing for Text Style Transfer
Guoqing Luo, Yu Han, Lili Mou, Mauajama Firdaus

Representativeness as a Forgotten Lesson for Multilingual and Code-switched Data Collection and Preparation
A. Seza Doğruöz, Sunayana Sitaram, Zheng Yong

Sparse Black-Box Multimodal Attack for Vision-Language Adversary Generation
Zhen Yu, Zhou Qin, Zhenhua Chen, Meihui Lian, Haojun Fu, Weigao Wen, Hui Xue, Kun He

Towards a Unified Framework for Reference Retrieval and Related Work Generation
Zhengliang Shi, Shen Gao, Zhen Zhang, Xiuying Chen, Zhumin Chen, Pengjie Ren, Zhaochun Ren

Visual Storytelling with Question-Answer Plans
Danyang Liu, Mirella Lapata, Frank Keller

Investigating Online Community Engagement through Stancetaking
Jai Aggarwal, Brian Diep, Julia Watson, Suzanne Stevenson

ASSERT: Automated Safety Scenario Red Teaming for Evaluating the Robustness of Large Language Models
Alex Mei, Sharon Levy, William Wang

Learning to Correct Noisy Labels for Fine-Grained Entity Typing via Co-Prediction Prompt Tuning
Minghao Tang, Yongquan He, Yongxiu Xu, Hongbo Xu, Wenyuan Zhang, Yang Lin

Co$^2$PT: Mitigating Bias in Pre-trained Language Models through Counterfactual Contrastive Prompt Tuning
Xiangjue Dong, Ziwei Zhu, Zhuoer Wang, Maria Teleki, James Caverlee

A Hierarchical Encoding-Decoding Scheme for Abstractive Multi-document Summarization
Chenhui Shen, Liying Cheng, Xuan-Phi Nguyen, Yang You, Lidong Bing

Universal Domain Adaptation for Robust Handling of Distributional Shifts in NLP
Hyuhng Kim, Hyunsoo Cho, Sang-Woo Lee, Junyeob Kim, Choonghyun Park, Sang-goo Lee, Kang Yoo, Taeuk Kim

Aligning Language Models to User Opinions
EunJeong Hwang, Bodhisattwa Majumder, Niket Tandon

CCSRD: Content-Centric Speech Representation Disentanglement Learning for End-to-End Speech Translation
Xiaohu Zhao, Haoran Sun, Yikun Lei, Shaolin Zhu, Deyi Xiong

Miracle: Towards Personalized Dialogue Generation with Latent-Space Multiple Personal Attribute Control
Zhenyi Lu, Wei Wei, Xiaoye Qu, Xian-Ling Mao, Dangyang Chen, Jixiong Chen

Towards Multilingual Interlinear Morphological Glossing
Shu Okabe, François Yvon

Transformer Working Memory Enables Regular Language Reasoning And Natural Language Length Extrapolation
Ta-Chung Chi, Ting-Han Fan, Alexander Rudnicky, Peter Ramadge

Enhancing Conversational Search: Large Language Model-Aided Informative Query Rewriting
Fanghua Ye, Meng Fang, Shenghui Li, Emine Yilmaz

Distilling ChatGPT for Explainable Automated Student Answer Assessment
Jiazheng Li, Lin Gui, Yuxiang Zhou, David West, Cesare Aloisi, Yulan He

Grammatical Error Correction via Mixed-Grained Weighted Training
Jiahao Li, Quan Wang, Chiwei Zhu, Zhendong Mao, Yongdong Zhang

A Unified Framework for Synaesthesia Analysis
Kun Sheng, Zhongqing Wang, Qingqing Zhao, Xiaotong Jiang, Guodong Zhou

Visual Elements Mining as Prompts for Instruction Learning for Target-Oriented Multimodal Sentiment Classification
Bin Yang, Jinlong Li

NASH: A Simple Unified Framework of Structured Pruning for Accelerating Encoder-Decoder Language Models
Jongwoo Ko, Seungjoon Park, Yujin Kim, Sumyeong Ahn, Du-Seong Chang, Euijai Ahn, Se-Young Yun

GBT: Generative Boosting Training Approach for Paraphrase Identification
Rui Peng, Zhiling Jin, Yu Hong

DeCrisisMB: Debiased Semi-Supervised Learning for Crisis Tweet Classification via Memory Bank
Henry Zou, Yue Zhou, Weizhi Zhang, Cornelia Caragea

Probing LLMs for hate speech detection: strengths and vulnerabilities
Sarthak Roy, Ashish Harshvardhan, Animesh Mukherjee, Punyajoy Saha

From Simple to Complex: A Progressive Framework for Document-level Informative Argument Extraction
Quzhe Huang, Yanxi Zhang, Dongyan Zhao

MultiCMET: A Novel Chinese Benchmark for Understanding Multimodal Metaphor
Dongyu Zhang, Jingwei Yu, Senyuan Jin, Liang Yang, Hongfei Lin

GlotLID: Language Identification for Low-Resource Languages
Amir Kargaran, Ayyoob Imani, François Yvon, Hinrich Schuetze

Finding Support Examples for In-Context Learning
Xiaonan Li, Xipeng Qiu

Uncovering the Root of Hate Speech: A Dataset for Identifying Hate Instigating Speech
Hyoungjun Park, Ho Shim, Kyuhan Lee

Responsible AI Considerations in Text Summarization Research: A Review of Current Practices
Yu Liu, Meng Cao, Su Blodgett, Jackie Cheung, Alexandra Olteanu, Adam Trischler

Improving Speech Translation by Fusing Speech and Text
Wenbiao Yin, Zhicheng Liu, Chengqi Zhao, Tao Wang, Jian Tong, Rong Ye

Narrative Order Aware Story Generation via Bidirectional Pretraining Model with Optimal Transport Reward
Zhicong Lu, Li Jin, Guangluan Xu, Linmei Hu, Nayu Liu, Xiaoyu Li, Xian Sun, Zequn Zhang, Kaiwen Wei

Explainable Claim Verification via Knowledge-Grounded Reasoning with Large Language Models
Haoran Wang, Kai Shu

Efficient Continue Training of Temporal Language Model with Structural Information
Zhaochen Su, Juntao Li, Zikang Zhang, Zihan Zhou, Min Zhang

Retrieval-Augmented Parsing for Complex Graphs by Exploiting Structure and Uncertainty
Zi Lin, Quan Yuan, Panupong Pasupat, Jeremiah Liu, Jingbo Shang

When it Rains, it Pours: Modeling Media Storms and the News Ecosystem
Benjamin Litterer, David Jurgens, Dallas Card

Intra-Event and Inter-Event Dependency-Aware Graph Network for Event Argument Extraction
Hao Li, Yanan Cao, Yubing Ren, Fang Fang, Lanxue Zhang, Yingjie Li, Shi Wang

How to Train Your Dragon: Diverse Augmentation Towards Generalizable Dense Retrieval
Sheng-Chieh Lin, Akari Asai, Minghan Li, Barlas Oguz, Jimmy Lin, Yashar Mehdad, Wen-tau Yih, Xilun Chen

Schema-adaptable Knowledge Graph Construction
Hongbin Ye, Honghao Gui, Xin Xu, Xi Chen, Huajun Chen, Ningyu Zhang

Conic10K: A Challenging Math Problem Understanding and Reasoning Dataset
Haoyi Wu, Wenyang Hui, Yezeng Chen, Weiqi Wu, Kewei Tu, Yi Zhou

DepWiGNN: A Depth-wise Graph Neural Network for Multi-hop Spatial Reasoning in Text
Shuaiyi Li, Yang Deng, Wai Lam

TK-KNN: A Balanced Distance-Based Pseudo Labeling Approach for Semi-Supervised Intent Classification
Nicholas Botzer, David Vazquez, Tim Weninger, Issam Laradji

Inductive Relation Inference of Knowledge Graph Enhanced by Ontology Information
Wentao Zhou, Jun Zhao, Tao Gui, Qi Zhang, Xuanjing Huang

Dynamic Stance: Modeling Discussions by Labeling the Interactions
Blanca Figueras, Irene Baucells, Tommaso Caselli

Harnessing the Power of Large Language Models for Empathetic Response Generation: Empirical Investigations and Improvements
Yushan Qian, Weinan Zhang, Ting Liu

DialogQAE: N-to-N Question Answer Pair Extraction from Customer Service Chatlog
Xin Zheng, Tianyu Liu, Haoran Meng, Xu Wang, Yufan Jiang, Mengliang Rao, Binghuai Lin, Yunbo Cao, Zhifang Sui

Inverse Reinforcement Learning for Text Summarization
Yu Fu, Deyi Xiong, Yue Dong

MM-Reasoner: A Multi-Modal Knowledge-Aware Framework for Knowledge-Based Visual Question Answering
Mahmoud Khademi, Ziyi Yang, Felipe Frujeri, Chenguang Zhu

Toward Joint Language Modeling for Speech Units and Text
Ju-Chieh Chou, Chung-Ming Chien, Wei-Ning Hsu, Karen Livescu, Arun Babu, Alexis Conneau, Alexei Baevski, Michael Auli

From Chaos to Clarity: Claim Normalization to Empower Fact-Checking
Megha Sundriyal, Tanmoy Chakraborty, Preslav Nakov

Mitigating Biases in Hate Speech Detection from A Causal Perspective
Zhehao Zhang, Jiaao Chen, Diyi Yang

PerturbScore: Connecting Discrete and Continuous Perturbations in NLP
Linyang Li, Ke Ren, Yunfan Shao, Pengyu Wang, Xipeng Qiu

InstructoR: Instructing Unsupervised Conversational Dense Retrieval with Large Language Models
Zhuoran Jin, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao

The Iron(ic) Melting Pot: Reviewing Human Evaluation in Humour, Irony and Sarcasm Generation
Tyler Loakman, Aaron Maladry, Chenghua Lin

INGENIOUS: Using Informative Data Subsets for Efficient Pre-Training of Language Models
H S V N S Kowndinya Renduchintala, Krishnateja Killamsetty, Sumit Bhatia, Milan Aggarwal, Ganesh Ramakrishnan, Rishabh Iyer, Balaji Krishnamurthy

Towards General Error Diagnosis via Behavioral Testing in Machine Translation
Junjie Wu, Lemao Liu, Dit-Yan Yeung

Retrieval-Augmented Few-shot Text Classification
Guoxin Yu, Lemao Liu, Haiyun Jiang, Shuming Shi, Xiang Ao

Temporal Extrapolation and Knowledge Transfer for Lifelong Temporal Knowledge Graph Reasoning
Zhongwu Chen, Chengjin Xu, Fenglong Su, Zhen Huang, Yong Dou

The student becomes the master: Outperforming GPT3 on Scientific Factual Error Correction
Dhananjay Ashok, Atharva Kulkarni, Hai Pham, Barnabas Poczos

Leveraging Structured Information for Explainable Multi-hop Question Answering and Reasoning
Ruosen Li, Xinya Du

Hierarchical Catalogue Generation for Literature Review: A Benchmark
Kun Zhu, Xiaocheng Feng, Xiachong Feng, Yingsheng Wu, Bing Qin

MCC-KD: Multi-CoT Consistent Knowledge Distillation
Hongzhan Chen, Siyue Wu, Xiaojun Quan, Rui Wang, Ming Yan, Ji Zhang

An Empirical Study of Frame Selection for Text-to-Video Retrieval
Mengxia Wu, Min Cao, Yang Bai, Ziyin Zeng, Chen Chen, Liqiang Nie, Min Zhang

Conditional Natural Language Inference
Youngwoo Kim, Razieh Rahimi, James Allan

Contrastive Distant Supervision for Debiased and Denoised Machine Reading Comprehension
Ning Bian, Hongyu Lin, Xianpei Han, Ben He, Le Sun

KEPLET: Knowledge-Enhanced Pretrained Language Model with Topic Entity Awareness
Yichuan Li, Jialong Han, Kyumin Lee, Chengyuan Ma, Benjamin Yao, Xiaohu Liu

Revisiting Large Language Models as Zero-shot Relation Extractors
Guozheng Li, Peng Wang, Wenjun Ke

Multi-Stage Pre-training Enhanced by ChatGPT for Multi-Scenario Multi-Domain Dialogue Summarization
Weixiao Zhou, Gengyao Li, Xianfu Cheng, Xinnian Liang, Junnan Zhu, Feifei Zhai, Zhoujun Li

Towards large language model-based personal agents in the enterprise: Current trends and open problems
Vinod Muthusamy, Yara Rizk, Kiran Kate, Praveen Venkateswaran, Vatche Isahagian, Ashu Gulati, Parijat Dube

CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models
Cheng Qian, Chi Han, Yi Fung, Yujia Qin, Zhiyuan Liu, Heng Ji

Query-based Image Captioning from Multi-context 360{$ cdegree$} Images
Koki Maeda, Shuhei Kurita, Taiki Miyanishi, Naoaki Okazaki

Auto Search Indexer for End-to-End Document Retrieval
Tianchi Yang, Minghui Song, Zihan Zhang, Haizhen Huang, Weiwei Deng, Feng Sun, Qi Zhang

‘Person’ == Light-skinned, Western Man, and Sexualization of Women of Color: Stereotypes in Stable Diffusion
Sourojit Ghosh, Aylin Caliskan

Task-Attentive Transformer Architecture for Continual Learning of Vision-and-Language Tasks Using Knowledge Distillation
Yuliang Cai, Jesse Thomason, Mohammad Rostami

Evaluating Verifiability in Generative Search Engines
Nelson Liu, Tianyi Zhang, Percy Liang

Enhancing Abstractiveness of Summarization Models through Calibrated Distillation
Hwanjun Song, Igor Shalyminov, Hang Su, Siffi Singh, Kaisheng Yao, Saab Mansour

Visually Grounded Continual Language Learning with Selective Specialization
Kyra Ahrens, Lennart Bengtson, Jae Lee, Stefan Wermter

RoMQA: A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering
Victor Zhong, Weijia Shi, Wen-tau Yih, Luke Zettlemoyer

Leveraging Multiple Teachers for Test-Time Adaptation of Language-Guided Classifiers
Kangda Wei, Sayan Ghosh, Rakesh Menon, Shashank Srivastava

Summarizing Multiple Documents with Conversational Structure for Meta-Review Generation
Miao Li, Eduard Hovy, Jey Lau

VIPHY: Probing ``Visible’’ Physical Commonsense Knowledge
Shikhar Singh, Ehsan Qasemi, Muhao Chen

Two Directions for Clinical Data Generation with Large Language Models: Data-to-Label and Label-to-Data
Rumeng Li, Xun Wang, Hong Yu

Stylized Dialogue Generation with Feature-Guided Knowledge Augmentation
Jinpeng Li, Zekai Zhang, Xiuying Chen, Dongyan Zhao, Rui Yan

Probing LLMs for Joint Encoding of Linguistic Categories
Giulio Starace, Konstantinos Papakostas, Rochelle Choenni, Apostolos Panagiotopoulos, Matteo Rosati, Alina Leidinger, Ekaterina Shutova

On Robustness of Finetuned Transformer-based NLP Models
Pavan Kalyan Reddy Neerudu, Subba Oota, Mounika Marreddy, Venkateswara Kagita, Manish Gupta

Measuring and Mitigating Constraint Violations of In-Context Learning for Utterance-to-API Semantic Parsing
Shufan Wang, Sébastien Jean, Sailik Sengupta, James Gung, Nikolaos Pappas, Yi Zhang

Topic-DPR: Topic-based Prompts for Dense Passage Retrieval
Qingfa Xiao, Shuangyin Li, Lei Chen

Quantifying the Dialect Gap and its Correlates Across Languages
Anjali Kantharuban, Ivan Vulić, Anna Korhonen

RECAL: Sample-Relation Guided Confidence Calibration over Tabular Data
Wang HaoTian, Zhen Zhang, Mengting Hu, Qichao Wang, Liang Chen, Yatao Bian, Bingzhe Wu

Parameter-Efficient Cross-lingual Transfer of Vision and Language Models via Translation-based Alignment
Zhen Zhang, Jialu Wang, Xin Wang

Lexical Repetitions Lead to Rote Learning: Unveiling the Impact of Lexical Overlap in Train and Test Reference Summaries
Prafulla Choubey, Alexander Fabbri, Caiming Xiong, Chien-Sheng Wu

GDA: Grammar-based Data Augmentation for Text Classification using Slot Information
Joonghyuk Hahn, Hyunjoon Cheon, Elizabeth Orwig, Su-Hyeon Kim, Sang-Ki Ko, Yo-Sub Han

VISTA: Visual-Textual Knowledge Graph Representation Learning
Jaejun Lee, Chanyoung Chung, Hochang Lee, Sungho Jo, Joyce Whang

A Comprehensive Evaluation of Large Language Models on Legal Judgment Prediction
Ruihao Shui, Yixin Cao, Xiang Wang, Tat-Seng Chua

Automatic Evaluate Dialogue Appropriateness by Using Dialogue Act
Bao Chen, Yuanjie Wang, Zeming Liu, Yuhang Guo

TabPrompt: Graph-based Pre-training and Prompting for Few-shot Table Understanding
Rihui Jin, Jianan Wang, Wei Tan, Yongrui Chen, Guilin Qi, Wang Hao

Improving Seq2Seq Grammatical Error Correction via Decoding Interventions
Houquan Zhou, Yumeng Liu, Zhenghua Li, Min Zhang, Bo Zhang, Chen Li, Ji Zhang, Fei Huang

Good Meta-tasks Make A Better Cross-lingual Meta-transfer Learning for Low-resource Languages
Linjuan Wu, Zongyi Guo, Baoliang Cui, Haihong Tang, Weiming Lu

Reasoning Makes Good Annotators : An Automatic Task-specific Rules Distilling Framework for Low-resource Relation Extraction
Yilin Lu, Juncheng Li, Xiaoqiang Wang, Haochen Shi, Tao Chen, Siliang Tang

Co-training and Co-distillation for Quality Improvement and Compression of Language Models
Hayeon Lee, Rui Hou, Jongpil Kim, Davis Liang, Hongbo Zhang, Sung Hwang, Alexander Min

ReadPrompt: A Readable Prompting Method for Reliable Knowledge Probing
Zezhong Wang, Luyao Ye, Hongru Wang, Wai-Chung Kwan, David Ho, Kam-Fai Wong

Coherent Entity Disambiguation via Modeling Topic and Categorical Dependency
Zilin Xiao, Linjun Shou, Xingyao Zhang, Jie Wu, Ming Gong, Daxin Jiang

How Predictable Are Large Language Model Capabilities? A Case Study on BIG-bench
Qinyuan Ye, Harvey Fu, Xiang Ren, Robin Jia

POSQA: Probe the World Models of LLMs with Size Comparisons
Chang Shu, Jiuzhou Han, Fangyu Liu, Ehsan Shareghi, Nigel Collier

Hierarchical Fusion for Online Multimodal Dialog Act Classification
Md Messal Monem Miah, Adarsh Pyarelal, Ruihong Huang

STEER: Unified Style Transfer with Expert Reinforcement
Skyler Hallinan, Faeze Brahman, Ximing Lu, Jaehun Jung, Sean Welleck, Yejin Choi

Examining Inter-Consistency of Large Language Models Collaboration: An In-depth Analysis via Debate
Kai Xiong, Xiao Ding, Yixin Cao, Ting Liu, Bing Qin

Culturally Aware Natural Language Inference
Jing Huang, Diyi Yang

End-to-End Autoregressive Retrieval via Bootstrapping for Smart Reply Systems
Benjamin Towle, Ke Zhou

Evaluating Dependencies in Fact Editing for Language Models: Specificity and Implication Awareness
Zichao Li, Ines Arous, Siva Reddy, Jackie Cheung

Effects of Human Adversarial and Affable Samples on BERT Generalization
Aparna Elangovan, Estrid He, Yuan Li, Karin Verspoor

Logic Unveils Truth, While Disguise Obscures It: Transition Logic Augmented Response Selection for Multi-Turn Dialogue
Tingchen Fu, Xueliang Zhao, Lemao Liu, Rui Yan

A Sequence-to-Structure Approach to Document-level Targeted Sentiment Analysis
Nan Song, Hongjie Cai, Rui Xia, Jianfei Yu, Zhen Wu, Xinyu Dai

Text2Tree: Aligning Text Representation to the Label Tree Hierarchy for Imbalanced Medical Classification
Jiahuan Yan, Haojun Gao, Zhang Kai, Weize Liu, Danny Chen, Jian Wu, Jintai Chen

Impact of Co-occurrence on Factual Knowledge of Large Language Models
Cheongwoong Kang, Jaesik Choi

CTQScorer: Combining Multiple Features for In-context Example Selection for Machine Translation
Aswanth M, Ratish Puduppully, Raj Dabre, Anoop Kunchukuttan

Swap and Predict – Predicting the Semantic Changes in Words across Corpora by Context Swapping
Taichi Aida, Danushka Bollegala

Beyond Layout Embedding: Layout Attention with Gaussian Biases for Structured Document Understanding
Xi Zhu, Xue Han, Shuyuan Peng, Shuo Lei, Chao Deng, Junlan Feng

ESPVR: Entity Spans Position Visual Regions for Multimodal Named Entity Recognition
Xiujiao Li, Guanglu Sun, Xinyu Liu

Flatness-Aware Prompt Selection Improves Accuracy and Sample Efficiency
Lingfeng Shen, Weiting Tan, Boyuan Zheng, Daniel Khashabi

Detecting Erroneously Recognized Handwritten Byzantine Text
John Pavlopoulos, Vasiliki Kougia, Paraskevi Platanou, Holger Essler

Improving Factual Consistency for Knowledge-Grounded Dialogue Systems via Knowledge Enhancement and Alignment
Boyang Xue, Weichao Wang, Hongru Wang, Fei Mi, Rui Wang, Yasheng Wang, Lifeng Shang, Xin Jiang, Qun Liu, Kam-Fai Wong

TRIP: Accelerating Document-level Multilingual Pre-training via Triangular Document-level Pre-training on Parallel Data Triplets
Hongyuan Lu, Haoyang Huang, Shuming Ma, Dongdong Zhang, Wai Lam, Zhaochuan Gao, Anthony Aue, Arul Menezes, Furu Wei

Frequency Balanced Datasets Lead to Better Language Models
Rodolfo Zevallos, Mireia Farrús, Núria Bel

Uncertainty-aware Parameter-Efficient Self-training for Semi-supervised Language Understanding
Jianing Wang, Qiushi Sun, Nuo Chen, Chengyu Wang, Jun Huang, Ming Gao, Xiang Li

TR-Rules: Rule-based Model for Link Forecasting on Temporal Knowledge Graph Considering Temporal Redundancy
Ningyuan Li, Haihong E, Shi Li, Mingzhi Sun, Tianyu Yao, Meina Song, Yong Wang, Haoran Luo

On the Transferability of Visually Grounded PCFGs
Yanpeng Zhao, Ivan Titov

Analysis of Style-Shifting on Social Media: Using Neural Language Model Conditioned by Social Meanings
Seiya Kawano, Shota Kanezaki, Angel Fernando Garcia Contreras, Akishige Yuguchi, Marie Katsurai, Koichiro Yoshino

Linguistic Compression in Single-Sentence Human-Written Summaries
Fangcong Yin, Marten van Schijndel

MCLF: A Multi-grained Contrastive Learning Framework for ASR-robust Spoken Language Understanding
Zhiqi Huang, Dongsheng Chen, Zhihong Zhu, Xuxin Cheng

Beyond Candidates : Adaptive Dialogue Agent Utilizing Persona and Knowledge
Jungwoo Lim, Myunghoon Kang, Jinsung Kim, Jeongwook Kim, Yuna Hur, Heuiseok Lim

SmartSpanNER: Making SpanNER Robust in Low Resource Scenarios
Min Zhang, Xiaosong Qiao, Yanqing Zhao, Shimin Tao, Hao Yang

ZeroSCROLLS: A Zero-Shot Benchmark for Long Text Understanding
Uri Shaham, Maor Ivgi, Avia Efrat, Jonathan Berant, Omer Levy

Measuring bias in Instruction-Following models with P-AT
Dario Onorati, Elena Ruzzetti, Davide Venditti, Leonardo Ranaldi, Fabio Zanzotto

Open-ended Commonsense Reasoning with Unrestricted Answer Candidates
Chen Ling, Xuchao Zhang, Xujiang Zhao, Yanchi Liu, Wei Cheng, Mika Oishi, Takao Osaki, Katsushi Matsuda, Haifeng Chen, Liang Zhao

Speaking Style Conversion in the Waveform Domain Using Discrete Self-Supervised Units
Gallil Maimon, Yossi Adi

Knowledge-Selective Pretraining for Attribute Value Extraction
Hui Liu, Qingyu Yin, Zhengyang Wang, Chenwei Zhang, Haoming Jiang, Yifan Gao, Zheng Li, Xian Li, Chao Zhang, Bing Yin, William Wang, Xiaodan Zhu

New Datasets and Controllable Iterative Data Augmentation Method for Code-switching ASR Error Correction
Zhaohong Wan, Xiaojun Wan, Wei Peng, Rongjun Li

Isotropic Representation Can Improve Zero-Shot Cross-Lingual Transfer on Multilingual Language Models
Yixin Ji, Jikai Wang, Juntao Li, Hai Ye, Min Zhang

Blackbird language matrices (BLM), a new task for rule-like generalization in neural networks: Can Large Language Models pass the test?
Paola Merlo

DistillCSE: Distilled Contrastive Learning for Sentence Embeddings
Jiahao Xu, Wei Shao, Lihui Chen, Lemao Liu

GSAP-NER: A Novel Task, Corpus, and Baseline for Scholarly Entity Extraction Focused on Machine Learning Models and Datasets
Wolfgang Otto, Matthäus Zloch, Lu Gan, Saurav Karmakar, Stefan Dietze

Open Domain Multi-document Summarization: A Comprehensive Study of Model Brittleness under Retrieval
John Giorgi, Luca Soldaini, Bo Wang, Gary Bader, Kyle Lo, Lucy Wang, Arman Cohan

Few-shot Unified Question Answering: Tuning Models or Prompts?
Srijan Bansal, Semih Yavuz, Bo Pang, Meghana Bhat, Yingbo Zhou

Finding Common Ground: Annotating and Predicting Common Ground in Spoken Conversations
Magdalena Markowska, Mohammad Taghizadeh, Adil Soubki, Seyed Mirroshandel, Owen Rambow

Getting MoRE out of Mixture of Language Model Reasoning Experts
Chenglei Si, Weijia Shi, Chen Zhao, Luke Zettlemoyer, Jordan Boyd-Graber

Zero-Shot Data Maps. Efficient Dataset Cartography Without Model Training
Angelo Basile, Marc Franco-Salvador, Paolo Rosso

Isotropy-Enhanced Conditional Masked Language Models
Pei Guo, Yisheng Xiao, Juntao Li, Yixin Ji, Min Zhang

Scaling Law for Document Neural Machine Translation
Zhang Zhuocheng, Shuhao Gu, Min Zhang, Yang Feng

Automatic Pronunciation Assessment - A Review
Yassine Kheir, Ahmed Ali, Shammur Chowdhury

Segmented Recurrent Transformer: An Efficient Sequence-to-Sequence Model
Yinghan Long, Sayeed Chowdhury, Kaushik Roy

PUNR: Pre-training with User Behavior Modeling for News Recommendation
Guangyuan Ma, Hongtao Liu, Xing W, Wanhui Qian, Zhepeng Lv, Qing Yang, Songlin Hu

Measure Children’s Mindreading Ability with Machine Reading
Yuliang Yan, Xiaohua Wang, Xiang Zhou, Xiaoqing Zheng, Xuanjing Huang

Crosslingual Transfer Learning for Low-Resource Languages Based on Multilingual Colexification Graphs
Yihong Liu, Haotian Ye, Leonie Weissweiler, Renhao Pei, Hinrich Schuetze

Injecting structural hints: Using language models to study inductive biases in language learning
Isabel Papadimitriou, Dan Jurafsky

Machine Reading Comprehension using Case-based Reasoning
Dung Thai, Dhruv Agarwal, Mudit Chaudhary, Wenlong Zhao, Rajarshi Das, Jay-Yoon Lee, Hannaneh Hajishirzi, Manzil Zaheer, Andrew McCallum

Unleashing the Power of Language Models in Text-Attributed Graph
Haoyu Kuang, Jiarong Xu, Haozhe Zhang, Zuyu Zhao, Qi Zhang, Xuanjing Huang, Zhongyu Wei

Locally Differentially Private Document Generation Using Zero Shot Prompting
Saiteja Utpala, Sara Hooker, Pin-Yu Chen

Contrastive Deterministic Autoencoders For Language Modeling
Amur Ghose, Pascal Poupart

CHiLL: Zero-shot Custom Interpretable Feature Extraction from Clinical Notes with Large Language Models
Denis McInerney, Geoffrey Young, Jan-Willem van de Meent, Byron Wallace

Large Language Models Meet Harry Potter: A Dataset for Aligning Dialogue Agents with Characters
Nuo Chen, Yan Wang, Haiyun Jiang, Deng Cai, Yuhan Li, Ziyang Chen, Longyue Wang, Jia Li

Quick Back-Translation for Unsupervised Machine Translation
Benjamin Brimacombe, Jiawei Zhou

SIR-ABSC: Incorporating Syntax into RoBERTa-based Sentiment Analysis Models with a Special Aggregator Token
Ikhyun Cho, Yoonhwa Jung, Julia Hockenmaier

Citance-Contextualized Summarization of Scientific Papers
Shahbaz Syed, Ahmad Hakimi, Khalid Al-Khatib, Martin Potthast

SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations
Ioannis Tsiamas, José Fonollosa, Marta Costa-jussà

Dataset Bias Mitigation in Multiple-Choice Visual Question Answering and Beyond
Zhecan Wang, Long Chen, Haoxuan You, Keyang Xu, Yicheng He, Wenhao Li, Noel Codella, Kai-Wei Chang, Shih-Fu Chang

The Intended Uses of Automated Fact-Checking Artefacts: Why, How and Who
Michael Schlichtkrull, Nedjma Ousidhoum, Andreas Vlachos

Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression
Jiduan Liu, Jiahao Liu, Qifan Wang, Jingang Wang, Xunliang Cai, Dongyan Zhao, Ran Wang, Rui Yan

KICGPT: Large Language Model with Knowledge in Context for Knowledge Graph Completion
Yanbin Wei, Qiushi Huang, Yu Zhang, James Kwok

Show, Write, and Retrieve: Entity-aware Article Generation and Retrieval
Zhongping Zhang, Yiwen Gu, Bryan Plummer

A Language Model with Limited Memory Capacity Captures Interference in Human Sentence Processing
William Timkey, Tal Linzen

Annotations Are Not All You Need: A Cross-modal Knowledge Transfer Network for Unsupervised Temporal Sentence Grounding
Xiang Fang, Daizong Liu, Wanlong Fang, Pan Zhou, Yu Cheng, Keke Tang, Kai Zou

Parameter Efficient Multi-task Fine-tuning by Learning to Transfer Token-wise Prompts
Muling Wu, Wenhao Liu, Jianhan Xu, Changze Lv, Zixuan Ling, Tianlong Li, Longtao Huang, Xiaoqing Zheng, Xuanjing Huang

A Rewriting Approach for Gender Inclusivity in Portuguese
Leonor Veloso, Luisa Coheur, Rui Ribeiro

EARA: Improving Biomedical Semantic Textual Similarity with Entity-Aligned Attention and Retrieval Augmentation
Ying Xiong, Xin Yang, Linjing Liu, Ka-Chun Wong, Qingcai Chen, Yang Xiang, Buzhou Tang

Neuro-Symbolic Sentiment Analysis with Dynamic Word Sense Disambiguation
Xulang Zhang, Rui Mao, Kai He, Erik Cambria

mph{CLMSM}: A Multi-Task Learning Framework for Pre-training on Procedural Text
Abhilash Nandy, Manav Kapadnis, Pawan Goyal, Niloy Ganguly

USB: A Unified Summarization Benchmark Across Tasks and Domains
Kundan Krishna, Prakhar Gupta, Sanjana Ramprasad, Byron Wallace, Jeffrey Bigham, Zachary Lipton

tagE: Enabling an Embodied Agent to Understand Human Instructions
Chayan Sarkar, Avik Mitra, Pradip Pramanick, Tapas Nayak

Instances and Labels: Hierarchy-aware Joint Supervised Contrastive Learning for Hierarchical Multi-Label Text Classification
Simon Chi Lok Yu, Jie He, Victor Basulto, Jeff Pan

Uncovering Limitations in Text-to-Image Generation: A Contrastive Approach with Structured Semantic Alignment
Qianyu Feng, Yulei Sui, Hongyu Zhang

An Intent-based and Annotation-free Method for Duplicate Question Detection in CQA Forums
Yubo Shu, Hansu Gu, Peng Zhang, Tun Lu, Ning Gu

Accelerating Multiple Intent Detection and Slot Filling via Targeted Knowledge Distillation
Xuxin Cheng, Zhihong Zhu, Wanshi Xu, Yaowei Li, Hongxiang Li, Yuexian Zou

Type-Aware Decomposed Framework for Few-Shot Named Entity Recognition
Yongqi Li, Yu Yu, Tieyun Qian

Connecting the Dots: What Graph-Based Text Representations Work Best for Text Classification using Graph Neural Networks?
Margarita Bugueño, Gerard de Melo

Pre-trained Speech Processing Models Contain Human-Like Biases that Propagate to Speech Emotion Recognition
Isaac Slaughter, Craig Greenberg, Reva Schwartz, Aylin Caliskan

Text Classification via Large Language Models
Xiaofei Sun, Xiaoya Li, Jiwei Li, Fei Wu, Shangwei Guo, Tianwei Zhang, Guoyin Wang

On Task-personalized Multimodal Few-shot Learning for Visually-rich Document Entity Retrieval
Jiayi Chen, Hanjun Dai, Bo Dai, Aidong Zhang, Wei Wei

Semi-Structured Object Sequence Encoders
Rudra Murthy, Riyaz Bhat, Chulaka Gunasekara, Siva Patel, Hui Wan, Tejas Dhamecha, Danish Contractor, Marina Danilevsky

DeTiME: Diffusion-Enhanced Topic Modeling using Encoder-decoder based LLM
Weijie Xu, Wenxiang Hu, Fanyou Wu, Srinivasan Sengamedu

Democratizing LLMs: An Exploration of Cost-Performance Trade-offs in Self-Refined Open-Source Models
Sumuk Shashidhar, Abhinav Chinta, Vaibhav Sahai, Zhenhailong Wang, Heng Ji

Chinese Metaphorical Relation Extraction
Guihua Chen, Tiantian Wu, MiaoMiao Cheng, Xu Han, Jiefu Gong, Shijin Wang, Wei Song

Example-based Hypernetworks for Multi-source Adaptation to Unseen Domains
Tomer Volk, Eyal Ben-David, Ohad Amosy, Gal Chechik, Roi Reichart

Beneath the Surface: Unveiling Harmful Memes with Multimodal Reasoning Distilled from Large Language Models
Hongzhan Lin, Ziyang Luo, Jing Ma, Long Chen

Domain Adaptation for Conversational Query Production with the RAG Model Feedback
Ante Wang, Linfeng Song, Ge Xu, Jinsong Su

LEGO: A Multi-agent Collaborative Framework with Role-playing and Iterative Feedback for Causality Explanation Generation
Zhitao He, Pengfei Cao, Yubo Chen, Kang Liu, Ruopeng li, Mengshu Sun, Jun Zhao

Epsilon Sampling Rocks: Investigating Sampling Strategies for Minimum Bayes Risk Decoding for Machine Translation
Markus Freitag, Behrooz Ghorbani, Patrick Fernandes

The language of prompting: What linguistic properties make a prompt successful?
Alina Leidinger, Robert van Rooij, Ekaterina Shutova

When and Why Does Bias Mitigation Work?
Abhilasha Ravichander, Joe Stacey, Marek Rei

Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy
Zhihong Shao, Yeyun Gong, yelong shen, Minlie Huang, Nan Duan, Weizhu Chen

Dynamic Low-rank Estimation for Transformer-based Language Models
Ting Hua, Xiao Li, Shangqian Gao, Yen-Chang Hsu, Yilin Shen, Hongxia Jin

Non-parallel Accent Transfer based on Fine-grained Controllable Accent Modelling
Linqin Wang, Zhengtao Yu, Yuanzhang Yang, Shengxiang Gao, Cunli Mao, Yuxin Huang

Compositional Generalization for Data-to-Text Generation
Xinnuo Xu, Ivan Titov, Mirella Lapata

TalkUp: Paving the Way for Understanding Empowering Language
Lucille Njoo, Chan Park, Octavia Stappart, Marvin Thielk, Yi Chu, Yulia Tsvetkov

Unifying Text, Tables, and Images for Multimodal Question Answering
Haohao Luo, Ying Shen, Yang Deng

Multilingual Lottery Tickets to Pretrain Language Models
Jaeseong Lee, Seung-won Hwang

Target-Aware Spatio-Temporal Reasoning via Answering Questions in Dynamic Audio-Visual Scenarios
Yuanyuan Jiang, Jianqin Yin

Breaking the Language Barrier: Improving Cross-Lingual Reasoning with Structured Self-Attention
Negar Foroutan, Mohammadreza Banaei, Karl Aberer, Antoine Bosselut

CITB: A Benchmark for Continual Instruction Tuning
Zihan Zhang, Meng Fang, Ling Chen, Mohammad-Reza Namazi-Rad

Mixture-of-Linguistic-Experts Adapters for Improving and Interpreting Pre-trained Language Models
Raymond Li, Gabriel Murray, Giuseppe Carenini

Towards Better Representations for Multi-Label Text Classification with Multi-granularity Information
Fangfang Li, Puzhen Su, Junwen Duan, Weidong Xiao

PCMID: Multi-Intent Detection through Supervised Prototypical Contrastive Learning
Yurun Song, Junchen Zhao, Spencer Koehler, Amir Abdullah, Ian Harris

Is GPT-4 a Good Data Analyst?
Liying Cheng, Xingxuan Li, Lidong Bing

DiffusionRet: Diffusion-Enhanced Generative Retriever using Constrained Decoding
Shanbao Qiao, Xuebing Liu, Seung-Hoon Na

Estimating Large Language Model Capabilities without Labeled Test Data
Harvey Fu, Qinyuan Ye, Albert Xu, Xiang Ren, Robin Jia

Large Language Models as Source Planner for Personalized Knowledge-grounded Dialogues
Hongru Wang, Minda Hu, Yang Deng, Rui Wang, Fei Mi, Weichao Wang, Yasheng Wang, Wai-Chung Kwan, Irwin King, Kam-Fai Wong

Toxicity in Multilingual Machine Translation at Scale
Marta Costa-jussà, Eric Smith, Christophe Ropers, Daniel Licht, Jean Maillard, Javier Ferrando, Carlos Escolano

Conversational Recommender System and Large Language Model Are Made for Each Other in E-commerce Pre-sales Dialogue
Yuanxing Liu, Weinan Zhang, Yifan Chen, Yuchi Zhang, Haopeng Bai, Fan Feng, Hengbin Cui, Yongbin Li, Wanxiang Che

VIP5: Towards Multimodal Foundation Models for Recommendation
Shijie Geng, Juntao Tan, Shuchang Liu, Zuohui Fu, Yongfeng Zhang

Learning to Follow Object-Centric Image Editing Instructions Faithfully
Tuhin Chakrabarty, Kanishk Singh, Arkadiy Saakyan, Smaranda Muresan

Zero-shot Topical Text Classification with LLMs - an Experimental Study
Shai Gretz, Alon Halfon, Ilya Shnayderman, Orith Toledo-Ronen, Artem Spector, Lena Dankin, Yannis Katsis, Ofir Arviv, Yoav Katz, Noam Slonim, Liat Ein-Dor

Are Personalized Stochastic Parrots More Dangerous? Evaluating Persona Biases in Dialogue Systems
Yixin Wan, Jieyu Zhao, Aman Chadha, Nanyun Peng, Kai-Wei Chang

A Black-Box Attack on Code Models via Representation Nearest Neighbor Search
Jie Zhang, Wei Ma, Qiang Hu, Shangqing Liu, Xiaofei Xie, Yves Le Traon, Yang Liu

How Well Do Text Embedding Models Understand Syntax?
Yan Zhang, Zhaopeng Feng, Zhiyang Teng, Zuozhu Liu, Haizhou Li

CASSI: Contextual and Semantic Structure-based Interpolation Augmentation for Low-Resource NER
Tanmay Surana, Thi-Nga Ho, Kyaw Tun, Eng Siong Chng

NEWTON: Are Large Language Models Capable of Physical Reasoning?
Yi Wang, Jiafei Duan, Dieter Fox, Siddhartha Srinivasa

Beyond Denouncing Hate: Strategies for Countering Implied Biases and Stereotypes in Language
Jimin Mun, Emily Allaway, Akhila Yerukola, Laura Vianna, Sarah-Jane Leslie, Maarten Sap

On the Calibration of Large Language Models and Alignment
Chiwei Zhu, Benfeng Xu, Quan Wang, Yongdong Zhang, Zhendong Mao

TCRA-LLM: Token Compression Retrieval Augmented Large Language Model for Inference Cost Reduction
Junyi Liu, Liangzhi Li, Tong Xiang, Bowen Wang, Yiming Qian

Identifying Conspiracy Theories News based on Event Relation Graph
Yuanyuan Lei, Ruihong Huang

Salespeople vs SalesBot: Exploring the Role of Educational Value in Conversational Recommender Systems
Lidiya Murakhovs’ka, Philippe Laban, Tian Xie, Caiming Xiong, Chien-Sheng Wu

Dynamic Open-book Prompt for Conversational Recommender System
Xuan Ma, Tieyun Qian, Ke Sun

Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models
Zhihan Zhang, Shuohang Wang, Wenhao Yu, Yichong Xu, Dan Iter, Qingkai Zeng, Yang Liu, Chenguang Zhu, Meng Jiang

Learn Your Tokens: Word-Pooled Tokenization for Language Modeling
Avijit Thawani, Saurabh Ghanekar, Xiaoyuan Zhu, Jay Pujara

Towards Detecting Contextual Real-Time Toxicity for In-Game Chat
Zachary Yang, Nicolas Grenon-Godbout, Reihaneh Rabbany

JWSign: A Highly Multilingual Corpus of Bible Translations for more Diversity in Sign Language Processing
Shester Gueuwou, Sophie Siake, Colin Leong, Mathias Müller

Do Stochastic Parrots have Feelings Too? Improving Neural Detection of Synthetic Text via Emotion Recognition
Alan Cowap, Yvette Graham, Jennifer Foster

Variator: Accelerating Pre-trained Models with Plug-and-Play Compression Modules
Chaojun Xiao, Yuqi Luo, Wenbin Zhang, Pengle Zhang, Xu Han, Yankai Lin, Zhengyan Zhang, Ruobing Xie, Zhiyuan Liu, Maosong Sun, Jie Zhou

PivotFEC: Enhancing Few-shot Factual Error Correction with a Pivot Task Approach using Large Language Models
Xingwei He, A-Long Jin, Jun Ma, Yuan Yuan, Siu Yiu

Semantic Similarity Covariance Matrix Shrinkage
Guillaume Becquin, Saher Esmeir

Investigating Multilingual Coreference Resolution by Universal Annotations
Haixia Chai, Michael Strube

FactSpotter: Evaluating the Factual Faithfulness of Graph-to-Text Generation
Kun Zhang, Oana Balalau, Ioana Manolescu

LayoutDIT: Layout-Aware End-to-End Document Image Translation with Multi-Step Conductive Decoder
Zhiyang Zhang, Yaping Zhang, Yupu Liang, Lu Xiang, Yang Zhao, Yu Zhou, Chengqing Zong

Balaur: Language Model Pretraining with Lexical Semantic Relations
Andrei Mircea, Jackie Cheung

Exploring In-Context Learning for Knowledge Grounded Dialog Generation
Qinyu Chen, Wenhao Wu, Sujian Li

Towards Enhancing Relational Rules for Knowledge Graph Link Prediction
Shuhan Wu, Huaiyu Wan, Wei Chen, Yuting Wu, Junfeng Shen, Youfang Lin

Are NLP Models Good at Tracing Thoughts: An Overview of Narrative Understanding
Lixing Zhu, Runcong Zhao, Lin Gui, Yulan He

Who is Speaking? Speaker-Aware Multiparty Dialogue Act Classification
Ayesha Qamar, Adarsh Pyarelal, Ruihong Huang

Demystifying Prompts in Language Models via Perplexity Estimation
Hila Gonen, Srini Iyer, Terra Blevins, Noah Smith, Luke Zettlemoyer

C2D2 Dataset: A Resource for the Cognitive Distortion Analysis and Its Impact on Mental Health
Bichen Wang, Pengfei Deng, Yanyan Zhao, Bing Qin

MixEdit: Revisiting Data Augmentation and Beyond for Grammatical Error Correction
Jingheng Ye, Yinghui Li, Yangning Li, Hai-Tao Zheng

ROME: Evaluating Pre-trained Vision-Language Models on Reasoning beyond Visual Common Sense
Kankan Zhou, Eason Lai, Wei Bin Au Yeong, Kyriakos Mouratidis, Jing Jiang

Automatic Analysis of Substantiation in Scientific Peer Reviews
Yanzhu Guo, Guokan Shang, Virgile Rennard, Michalis Vazirgiannis, Chloé Clavel

RealBehavior: A Framework for Faithfully Characterizing Foundation Models’ Human-like Behavior Mechanisms
Enyu Zhou, Rui Zheng, Zhiheng Xi, Songyang Gao, Xiaoran Fan, Zichu Fei, Jingting Ye, Tao Gui, Qi Zhang, Xuanjing Huang

Unraveling Downstream Gender Bias from Large Language Models: A Study on AI Educational Writing Assistance
Thiemo Wambsganss, Xiaotian Su, Vinitra Swamy, Seyed Neshaei, Roman Rietsche, Tanja Käser

VERVE: Template-based ReflectiVE Rewriting for MotiVational IntErviewing
Do Min, Veronica Perez-Rosas, Ken Resnicow, Rada Mihalcea

Self-Knowledge Guided Retrieval Augmentation for Large Language Models
Yile Wang, Peng Li, Maosong Sun, Yang Liu

Pretraining Language Models with Text-Attributed Heterogeneous Graphs
Tao Zou, Le Yu, Yifei Huang, Leilei Sun, Bowen Du

On the Dimensionality of Sentence Embeddings
Hongwei Wang, Hongming Zhang, Dong Yu

Pit One Against Many: Leveraging Attention-head Embeddings for Parameter-efficient Multi-head Attention
Huiyin Xue, Nikolaos Aletras

StyleBART: Decorate Pretrained Model with Style Adapters for Unsupervised Stylistic Headline Generation
Hanqing Wang, Yajing Luo, Boya Xiong, Guanhua Chen, Yun Chen

RSVP: Customer Intent Detection via Agent Response Contrastive and Generative Pre-Training
Yu-Chien Tang, Wei-Yao Wang, An-Zi Yen, Wen-Chih Peng

InstructSafety: A Unified Framework for Building Multidimensional and Explainable Safety Detector through Instruction Tuning
Zhexin Zhang, Jiale Cheng, Hao Sun, Jiawen Deng, Minlie Huang

``A Tale of Two Movements’: Identifying and Comparing Perspectives in #BlackLivesMatter and #BlueLivesMatter Movements-related Tweets using Weakly Supervised Graph-based Structured Prediction
Shamik Roy, Dan Goldwasser

ClusterPrompt: Cluster Semantic Enhanced Prompt Learning for New Intent Discovery
Jinggui Liang, Lizi Liao

Investigating the Effect of Pre-finetuning BERT Models on NLI Involving Presuppositions
Jad Kabbara, Jackie Cheung

MRRL: Modifying the Reference via Reinforcement Learning for Non-Autoregressive Joint Multiple Intent Detection and Slot Filling
Xuxin Cheng, Zhihong Zhu, Bowen Cao, Qichen Ye, Yuexian Zou

MEGClass: Extremely Weakly Supervised Text Classification via Mutually-Enhancing Text Granularities
Priyanka Kargupta, Tanay Komarlu, Susik Yoon, Xuan Wang, Jiawei Han

Causal Inference from Text: Unveiling Interactions between Variables
Yuxiang Zhou, Yulan He

Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!
Yubo Ma, Yixin Cao, Yong Hong, Aixin Sun

Prompting and Evaluating Large Language Models for Proactive Dialogues: Clarification, Target-guided, and Non-collaboration
Yang Deng, Lizi Liao, Liang Chen, Hongru Wang, Wenqiang Lei, Tat-Seng Chua

SummIt: Iterative Text Summarization via ChatGPT
Haopeng Zhang, Xiao Liu, Jiawei Zhang

Orthogonal Subspace Learning for Language Model Continual Learning
Xiao Wang, Tianze Chen, Qiming Ge, Han Xia, Rong Bao, Rui Zheng, Qi Zhang, Tao Gui, Xuanjing Huang

Attention-Enhancing Backdoor Attacks Against BERT-based Models
Weimin Lyu, Songzhu Zheng, Lu Pang, Haibin Ling, Chao Chen

Hi-ToM: A Benchmark for Evaluating Higher-Order Theory of Mind Reasoning in Large Language Models
Yufan Wu, Yinghui He, Yilin Jia, Rada Mihalcea, Yulong Chen, Naihao Deng

Image and Text: Fighting the same Battle? Super Resolution Learning for Imbalanced Text Classification
Romain Meunier, Benamara Farah, Véronique Moriceau, Patricia Stolf

SELFOOD: Self-Supervised Out-Of-Distribution Detection via Learning to Rank
Dheeraj Mekala, Adithya Samavedhi, Chengyu Dong, Jingbo Shang

Mind the Gap Between Conversations for Improved Long-Term Dialogue Generation
Qiang Zhang, Jason Naradowsky, Yusuke Miyao

A Structure-Aware Generative Adversarial Network for Bilingual Lexicon Induction
Bocheng Han, Qian Tao, Lusi Li, Zhihao Xiong

Argument mining as a multi-hop generative machine reading comprehension task
Boyang Liu, Viktor Schlegel, Riza Batista-Navarro, Sophia Ananiadou

HuatuoGPT, Towards Taming Language Model to Be a Doctor
Hongbo Zhang, Junying Chen, Feng Jiang, Fei Yu, Zhihong Chen, Guiming Chen, Jianquan Li, Xiangbo Wu, Zhang Zhiyi, Qingying Xiao, Xiang Wan, Benyou Wang, Haizhou Li

Debias NLU Datasets via Training-free Perturbations
Qi Guo, Yuanhang Tang, Yawen Ouyang, Zhen Wu, Xinyu Dai

Aspect-to-Scope Oriented Multi-view Contrastive Learning for Aspect-based Sentiment Analysis
Heyan Chai, Ziyi Yao, Siyu Tang, Ye Wang, Liqiang Nie, Binxing Fang, Qing Liao

Legally Enforceable Hate Speech Detection for Public Forums
Chu Luo, Rohan Bhambhoria, Samuel Dahan, Xiaodan Zhu

ConPrompt: Pre-training a Language Model with Machine-Generated Data for Implicit Hate Speech Detection
Youngwook Kim, Shinwoo Park, Youngsoo Namgoong, Yo-Sub Han

Incorporating Syntactic Knowledge into Pre-trained Language Model using Optimization for Overcoming Catastrophic Forgetting
Ran Iwamoto, Issei Yoshida, Hiroshi Kanayama, Takuya Ohko, Masayasu Muraoka

Toward Human Readable Prompt Tuning: Kubrick’s The Shining is a good movie, and a good prompt too?
Weijia Shi, Xiaochuang Han, Hila Gonen, Ari Holtzman, Yulia Tsvetkov, Luke Zettlemoyer

Chain-of-Thought Reasoning in Tabular Language Models
Mingyu Zheng, Hao Yang, Wenbin Jiang, Zheng Lin, Yajuan Lyu, Qiaoqiao She, Weiping Wang

Diffusion Language Model with Query-Document Relevance for Query-Focused Summarization
Shaoyao Huang, Luozheng Qin, Ziqiang Cao

Grounded and well-rounded: a methodological approach to the study of cross-modal and cross-lingual grounding
Timothee Mickus, Elaine Zosa, Denis Paperno

Boosting Inference Efficiency: Unleashing the Power of Parameter-Shared Pre-trained Language Models
Weize Chen, Xiaoyue Xu, Xu Han, Yankai Lin, Ruobing Xie, Zhiyuan Liu, Maosong Sun, Jie Zhou

Natural Response Generation for Chinese Reading Comprehension
Nuo Chen, Hongguang Li, Yinan Bao, Baoyuan Wang, Jia Li

Is ChatGPT a Good Causal Reasoner? A Comprehensive Evaluation
Jinglong Gao, Xiao Ding, Bing Qin, Ting Liu

How Many Demonstrations Do You Need for In-context Learning?
Jiuhai Chen, Lichang Chen, Chen Zhu, Tianyi Zhou

Improving word mover’s distance by leveraging self-attention matrix
Hiroaki Yamagiwa, Sho Yokoi, Hidetoshi Shimodaira

Long-Horizon Dialogue Understanding for Role Identification in the Game of Avalon with Large Language Models
Simon Stepputtis, Joseph Campbell, Yaqi Xie, Zhengyang Qi, Wenxin Zhang, Ruiyi Wang, Sanketh Rangreji, Charles Lewis, Katia Sycara

Improving Sequential Model Editing with Fact Retrieval
Xiaoqi Han, Ru Li, Hongye Tan, Wang Yuanlong, Qinghua Chai, Jeff Pan

Battle of the Large Language Models: Dolly vs LLaMA vs Vicuna vs Guanaco vs Bard vs ChatGPT - A Text-to-SQL Parsing Comparison
Shuo Sun, Yuchen Zhang, Jiahuan Yan, Yuze Gao, Donovan Ong, Bin Chen, Jian Su

KBioXLM: A Knowledge-anchored Biomedical Multilingual Pretrained Language Model
Lei Geng, Xu Yan, Ziqiang Cao, Juntao Li, Wenjie Li, Sujian Li, Xinjie Zhou, Yang Yang, Jun Zhang

A Zero-Shot Language Agent for Computer Control with Structured Reflection
Tao Li, Gang Li, Zhiwei Deng, Bryan Wang, Yang Li

SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF
Yi Dong, Zhilin Wang, Makesh Sreedhar, Xianchao Wu, Oleksii Kuchaiev

IdealGPT: Iteratively Decomposing Vision and Language Reasoning via Large Language Models
Haoxuan You, Rui Sun, Zhecan Wang, Long Chen, Gengyu Wang, Hammad Ayyubi, Kai-Wei Chang, Shih-Fu Chang

GRI: Graph-based Relative Isomorphism of Word Embedding Spaces
Muhammad Ali, Yan Hu, Jianbin Qin, Di Wang

PersonaLM: Language Model Personalization via Domain-distributed Span Aggregated K-Nearest N-gram Retrieval Augmentation
Puneet Mathur, Zhe Liu, Ke Li, Yingyi Ma, Gil Keren, Zeeshan Ahmed, Dinesh Manocha, Xuedong Zhang

Scaling Vision-Language Models with Sparse Mixture of Experts
Sheng Shen, Zhewei Yao, Chunyuan Li, Trevor Darrell, Kurt Keutzer, Yuxiong He

Aspect-Category Enhanced Learning with a Neural Coherence Model for Implicit Sentiment Analysis
Jin Cui, Fumiyo Fukumoto, Xinfeng Wang, Yoshimi Suzuki, Jiyi Li, Wanzeng Kong

End-to-end Adversarial Sample Generation for Data Augmentation
Tianyuan Liu, Yuqing Sun

Query2Triple: Unified Query Encoding for Answering Diverse Complex Queries over Knowledge Graphs
Yao Xu, Shizhu He, Cunguang Wang, Li Cai, Kang Liu, Jun Zhao

Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement
Zhiheng Xi, Senjie Jin, Yuhao Zhou, Rui Zheng, Songyang Gao, Jia Liu, Tao Gui, Qi Zhang, Xuanjing Huang

Breaking through Deterministic Barriers: Randomized Pruning Mask Generation and Selection
Jianwei Li, Weizhi Gao, Qi Lei, Dongkuan Xu

Eyes Show the Way: Modelling Gaze Behaviour for Hallucination Detection
Kishan Maharaj, Ashita Saxena, Raja Kumar, Abhijit Mishra, Pushpak Bhattacharyya

Noisy Pair Corrector for Dense Retrieval
Hang Zhang, Yeyun Gong, Xingwei He, Dayiheng Liu, Daya Guo, Jiancheng Lv, Jian Guo

Diversifying language models for lesser-studied languages and language-usage contexts: A case of second language Korean
Hakyung Sung, Gyu-Ho Shin

Improving generalization in large langue model by learning prefix subspaces
Louis Falissard, Vincent Guigue, Laure Soulier

Domain Adaptation for Sentiment Analysis Using Robust Internal Representations
Mohammad Rostami, Digbalay Bose, Shrikanth Narayanan, Aram Galstyan

KeFVP: Knowledge-enhanced Financial Volatility Prediction
Hao Niu, Yun Xiong, Xiaosu Wang, Wenjing Yu, Yao Zhang, Weizu Yang

A Frustratingly Easy Plug-and-Play Detection-and-Reasoning Module for Chinese Spelling Check
Haojing Huang, Jingheng Ye, Qingyu Zhou, Yinghui Li, Yangning Li, Feng Zhou, Hai-Tao Zheng

Asking Clarification Questions to Handle Ambiguity in Open-Domain QA
Dongryeol Lee, Segwang Kim, Minwoo Lee, Hwanhee Lee, Joonsuk Park, Sang-Woo Lee, Kyomin Jung

Addressing the Length Bias Challenge in Document-Level Neural Machine Translation
Zhang Zhuocheng, Shuhao Gu, Min Zhang, Yang Feng

EconBERTa: Towards Robust Extraction of Named Entities in Economics
Karim Lasri, Pedro Vitor Quinta de Castro, Mona Schirmer, Luis Eduardo San Martin, Linxi Wang, Tomáš Dulka, Haaya Naushan, John Pougué-Biyong, Arianna Legovini, Samuel Fraiberger

Consonant is all you need: a compact representation of English text for efficient NLP
Maged Al-shaibani, Irfan Ahmad

Detrimental Contexts in Open-Domain Question Answering
Philhoon Oh, James Thorne

PMIndiaSum: Multilingual and Cross-lingual Headline Summarization for Languages in India
Ashok Urlana, Pinzhen Chen, Zheng Zhao, Shay Cohen, Manish Shrivastava, Barry Haddow

Beyond Labels: Empowering Human Annotators with Natural Language Explanations through a Novel Active-Learning Architecture
Bingsheng Yao, Ishan Jindal, Lucian Popa, Yannis Katsis, Sayan Ghosh, Lihong He, Yuxuan Lu, Shashank Srivastava, Yunyao Li, James Hendler, Dakuo Wang

Efficient Cross-Task Prompt Tuning for Few-Shot Conversational Emotion Recognition
Yige Xu, Zhiwei Zeng, Zhiqi Shen

SYMPTOMIFY: Transforming Symptom Annotations with Language Model Knowledge Harvesting
Bosung Kim, Ndapa Nakashole

TokenDrop + BucketSampler: Towards Efficient Padding-free Fine-tuning of Language Models
Amrit Nagarajan, Anand Raghunathan

Unified Representation for Non-compositional and Compositional Expressions
Ziheng Zeng, Suma Bhat

Context Quality Matters in Training Fusion-in-Decoder for Extractive Open-Domain Question Answering
Kosuke Akimoto, Kunihiro Takeoka, Masafumi Oyamada

Error Detection for Text-to-SQL Semantic Parsing
Shijie Chen, Ziru Chen, Huan Sun, Yu Su

Ultra-Fine Entity Typing with Prior Knowledge about Labels: A Simple Clustering Based Strategy
Na Li, Zied Bouraoui, Steven Schockaert

Do ``English’’ Named Entity Recognizers Work Well on Global Englishes?
Alexander Shan, John Bauer, Riley Carlson, Christopher Manning

Affective and Dynamic Beam Search for Story Generation
Tenghao Huang, Ehsan Qasemi, Bangzheng Li, He Wang, Faeze Brahman, Muhao Chen, Snigdha Chaturvedi

Multiview Clickbait Detection via Jointly Modeling Subjective and Objective Preference
Chongyang Shi, Yijun Yin, Qi Zhang, Liang Xiao, Usman Naseem, Shoujin Wang, Liang Hu

Let’s Synthesize Step by Step: Iterative Dataset Synthesis with Large Language Models by Extrapolating Errors from Small Models
Ruida Wang, Wangchunshu Zhou, Mrinmaya Sachan

Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning
Zhuolin Yang, Wei Ping, Zihan Liu, Vijay Korthikanti, Weili Nie, De-An Huang, Linxi Fan, Zhiding Yu, Shiyi Lan, Bo Li, Mohammad Shoeybi, Ming-Yu Liu, Yuke Zhu, Bryan Catanzaro, Chaowei Xiao, Anima Anandkumar

Can ChatGPT Defend its Belief in Truth? Evaluating LLM Reasoning via Debate
Boshi Wang, Xiang Yue, Huan Sun

Using In-Context Learning to Improve Dialogue Safety
Nicholas Meade, Spandana Gella, Devamanyu Hazarika, Prakhar Gupta, Di Jin, Siva Reddy, Yang Liu, Dilek Hakkani-Tur

HEAR: Hearing Enhanced Audio Response for Video-grounded Dialogue
Sunjae Yoon, Dahyun Kim, Eunseop Yoon, Hee Yoon, Junyeong Kim, Chang Yoo

Defining a New NLP Playground
Sha Li, Chi Han, Pengfei Yu, Carl Edwards, Manling Li, Xingyao Wang, Yi Fung, Charles Yu, Joel Tetreault, Eduard Hovy, Heng Ji

UPTON: Preventing Authorship Leakage from Public Text Release via Data Poisoning
Ziyao Wang, Thai Le, Dongwon Lee

IAEval: A Comprehensive Evaluation of Instance Attribution on Natural Language Understanding
Peijian Gu, Yaozong Shen, Lijie Wang, Quan Wang, Hua Wu, Zhendong Mao

Scene Graph Enhanced Pseudo-Labeling for Referring Expression Comprehension
Cantao Wu, Yi Cai, Liuwu Li, Jiexin Wang

Noisy Self-Training with Synthetic Queries for Dense Retrieval
Fan Jiang, Tom Drummond, Trevor Cohn

Leveraging GPT-4 for Automatic Translation Post-Editing
Vikas Raunak, Amr Sharaf, Yiren Wang, Hany Awadalla, Arul Menezes

Uniform Complexity for Text Generation
Joseph Imperial, Harish Madabushi

Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMs
Hongru Wang, Rui Wang, Fei Mi, Yang Deng, Zezhong Wang, Bin Liang, Ruifeng Xu, Kam-Fai Wong

CONTRASTE: Supervised Contrastive Pre-training With Aspect-based Prompts For Aspect Sentiment Triplet Extraction
Rajdeep Mukherjee, Nithish Kannen, Saurabh Pandey, Pawan Goyal

Towards Anytime Fine-tuning: Continually Pre-trained Language Models with Hypernetwork Prompts
Gangwei Jiang, Caigao Jiang, Siqiao Xue, James Zhang, Jun Zhou, Defu Lian, Ying Wei

Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data
Kashun Shum, Shizhe Diao, Tong Zhang

What Makes it Ok to Set a Fire? Iterative Self-distillation of Contexts and Rationales for Disambiguating Defeasible Social and Moral Situations
Kavel Rao, Liwei Jiang, Valentina Pyatkin, Yuling Gu, Niket Tandon, Nouha Dziri, Faeze Brahman, Yejin Choi

An Empirical Study on Multiple Knowledge from ChatGPT for Emotion Recognition in Conversations
Geng Tu, Bin Liang, Bing Qin, Kam-Fai Wong, Ruifeng Xu

Exploiting Contrastive Learning and Numerical Evidence for Confusing Legal Judgment Prediction
Leilei Gan, Baokui Li, Kun Kuang, Yating Zhang, Lei Wang, Anh Luu, Yi Yang, Fei Wu

Dimensions of Online Conflict: Towards Modeling Agonism
Matt Canute, Mali Jin, hannah holtzclaw, Alberto Lusoli, Philippa Adams, Mugdha Pandya, Maite Taboada, Diana Maynard, Wendy Hui Kyong Chun

Learning under Label Proportions for Text Classification
Jatin Chauhan, Xiaoxuan Wang, Wei Wang

MetaReVision: Meta-Learning with Retrieval for Visually Grounded Compositional Concept Acquisition
Guangyue Xu, Parisa Kordjamshidi, Joyce Chai

PR-MCS: Perturbation Robust Metric for MultiLingual Image Captioning
Yongil Kim, Yerin Hwang, Hyeongu Yun, Seunghyun Yoon, Trung Bui, Kyomin Jung

Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding
Yu Zhang, Hao Cheng, Zhihong Shen, Xiaodong Liu, Ye-Yi Wang, Jianfeng Gao

BLM-s/lE: A structured dataset of English spray-load verb alternations for testing generalization in LLMs
Giuseppe Samo, Vivi Nastase, Chunyang Jiang, Paola Merlo

Efficiently Enhancing Zero-Shot Performance of Instruction Following Model via Retrieval of Soft Prompt
Seonghyeon Ye, Joel Jang, Doyoung Kim, Yongrae Jo, Minjoon Seo

Geographical Erasure in Language Generation
Pola Schwöbel, Jacek Golebiowski, Michele Donini, Cedric Archambeau, Danish Pruthi

Can Foundation Models Watch, Talk and Guide You Step by Step to Make a Cake?
Yuwei Bao, Keunwoo Yu, Yichi Zhang, Shane Storks, Itamar Bar-Yossef, Alex de la Iglesia, Megan Su, Xiao Zheng, Joyce Chai

Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Yi Tay, Mostafa Dehghani, Samira Abnar, Hyung Chung, William Fedus, Jinfeng Rao, Sharan Narang, Vinh Tran, Dani Yogatama, Donald Metzler

Not All Languages Are Created Equal in LLMs: Improving Multilingual Capability by Cross-Lingual-Thought Prompting
Haoyang Huang, Tianyi Tang, Dongdong Zhang, Xin Zhao, Ting Song, Yan Xia, Furu Wei

DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text
Jinyan Su, Terry Zhuo, Di Wang, Preslav Nakov

From Complex to Simple: Unraveling the Cognitive Tree for Reasoning with Small Language Models
Yan Junbing, Chengyu Wang, Taolin Zhang, Xiaofeng He, Jun Huang, Wei Zhang

Macedon: Minimizing Representation Coding Rate Reduction for Cross-Lingual Natural Language Understanding
Haoyu Wang, Yaqing Wang, Huaxiu Yao, Jing Gao

Adversarial Robustness for Large Language NER models using Disentanglement and Word Attributions
Xiaomeng Jin, Bhanukiran Vinzamuri, Sriram Venkatapathy, Heng Ji, Pradeep Natarajan

You Are What You Annotate: Towards Better Models through Annotator Representations
Naihao Deng, Xinliang Zhang, Siyang Liu, Winston Wu, Lu Wang, Rada Mihalcea

Large Language Models Are Better Adversaries: Exploring Generative Clean-Label Backdoor Attacks Against Text Classifiers
Wencong You, Zayd Hammoudeh, Daniel Lowd

Noise-Robust Fine-Tuning of Pretrained Language Models via External Guidance
Song Wang, Zhen Tan, Ruocheng Guo, Jundong Li

Probabilistic Tree-of-thought Reasoning for Answering Knowledge-intensive Complex Questions
Shulin Cao, Jiajie Zhang, Jiaxin Shi, Xin Lv, Zijun Yao, Qi Tian, Lei Hou, Juanzi Li

Ensemble-Instruct: Instruction Tuning Data Generation with a Heterogeneous Mixture of LMs
Young-Suk Lee, Md Sultan, Yousef El-Kurdi, Tahira Naseem, Asim Munawar, Radu Florian, Salim Roukos, Ramón Astudillo

The Less the Merrier? Investigating Language Representation in Multilingual Models
Hellina Nigatu, Atnafu Tonja, Jugal Kalita

SuperTweetEval: A Challenging, Unified and Heterogeneous Benchmark for Social Media NLP Research
Dimosthenis Antypas, Asahi Ushio, Francesco Barbieri, Leonardo Neves, Kiamehr Rezaee, Luis Espinosa-Anke, Jiaxin Pei, Jose Camacho-Collados

Enabling Unsupervised Neural Machine Translation with Word-level Visual Representations
Chengpeng Fu, Xiaocheng Feng, Yichong Huang, Wenshuai Huo, Hui Wang, Bing Qin, Ting Liu

Pragmatics in Language Grounding: Phenomena, Tasks, and Modeling Approaches
Daniel Fried, Nicholas Tomlin, Jennifer Hu, Roma Patel, Aida Nematzadeh

MISCA: A Joint Model for Multiple Intent Detection and Slot Filling with Intent-Slot Co-Attention
Thinh Pham, Tran Chi, Dat Nguyen

Enhancing Emotion Recognition in Conversation via Multi-view Feature Alignment and Memorization
Guiyang Hou, Yongliang Shen, Wenqi Zhang, Wei Xue, Weiming Lu

Mandarin classifier systems optimize to accommodate communicative pressures
Yamei Wang, Géraldine Walther

Cultural Compass: Predicting Transfer Learning Success in Offensive Language Detection with Cultural Features
Li Zhou, Antonia Karamolegkou, Wenyu Chen, Daniel Hershcovich

Linguistically Motivated Sign Language Segmentation
Amit Moryossef, Zifan Jiang, Mathias Müller, Sarah Ebling, Yoav Goldberg

Language-Agnostic Bias Detection in Language Models with Bias Probing
Abdullatif Köksal, Omer Yalcin, Ahmet Akbiyik, M. Kilavuz, Anna Korhonen, Hinrich Schuetze

A Joint Matrix Factorization Analysis of Multilingual Representations
Zheng Zhao, Yftah Ziser, Bonnie Webber, Shay Cohen

Don’t Add, don’t Miss: Effective Content Preserving Generation from Pre-Selected Text Spans
Aviv Slobodkin, Avi Caciularu, Eran Hirsch, Ido Dagan

A Computational Interface to Translate Strategic Intent from Unstructured Language in a Low-Data Setting
Pradyumna Tambwekar, Lakshita Dodeja, Nathan Vaska, Wei Xu, Matthew Gombolay

HFMRE: Constructing Huffman Tree in Bags to Find Excellent Instances for Distantly Supervised Relation Extraction
Min Li, Cong Shao, Gang Li, Mingle Zhou

DISCO: A Large Scale Human Annotated Corpus for Disfluency Correction in Indo-European Languages
Vineet Bhat, Preethi Jyothi, Pushpak Bhattacharyya

Towards Being Parameter-Efficient: A Stratified Sparsely Activated Transformer with Dynamic Capacity
Haoran Xu, Maha Elbayad, Kenton Murray, Jean Maillard, Vedanuj Goswami

Misery Loves Complexity: Exploring Linguistic Complexity in the Context of Emotion Detection
Pranaydeep Singh, Luna De Bruyne, Orphée De Clercq, Els Lefever

Code-Switching with Word Senses for Pretraining in Neural Machine Translation
Vivek Iyer, Edoardo Barba, Alexandra Birch, Jeff Pan, Roberto Navigli

DiffusionSL: Sequence Labeling via Tag Diffusion Process
Ziyang Huang, Pengfei Cao, Jun Zhao, Kang Liu

COMET-M: Reasoning about Multiple Events in Complex Sentences
Sahithya Ravi, Raymond Ng, Vered Shwartz

AniEE: A Dataset of Animal Experimental Literature for Event Extraction
Dohee Kim, Ra Yoo, Soyoung Yang, Hee Yang, Jaegul Choo

From Words to Wires: Generating Functioning Electronic Devices from Natural Language Descriptions
Peter Jansen

Data-efficient Active Learning for Structured Prediction with Partial Annotation and Self-Training
Zhisong Zhang, Emma Strubell, Eduard Hovy

Explicit Alignment and Many-to-many Entailment Based Reasoning for Conversational Machine Reading
Yangyang Luo, Shiyu Tian, Caixia Yuan, Xiaojie Wang

Harnessing Dataset Cartography for Improved Compositional Generalization in Transformers
Osman İnce, Tanin Zeraati, Semih Yagcioglu, Yadollah Yaghoobzadeh, Erkut Erdem, Aykut Erdem

Roles of Scaling and Instruction Tuning in Language Perception: Model vs. Human Attention
Changjiang Gao, Shujian Huang, Jixing Li, Jiajun Chen

Survival of the Most Influential Prompts: Efficient Black-Box Prompt Search via Clustering and Pruning
Han Zhou, Xingchen Wan, Ivan Vulić, Anna Korhonen

LLMaAA: Making Large Language Models as Active Annotators
Ruoyu Zhang, Yanzeng Li, Yongliang Ma, Ming Zhou, Lei Zou

NLMs: Augmenting Negation in Language Models
Rituraj Singh, Rahul Kumar, Vivek Sridhar

Parameter-Efficient Prompt Tuning Makes Generalized and Calibrated Neural Text Retrievers
Weng Tam, Xiao Liu, Kaixuan Ji, Lilong Xue, Jiahua Liu, tao li, Yuxiao Dong, Jie Tang

X-SNS: Cross-Lingual Transfer Prediction through Sub-Network Similarity
Taejun Yun, Jinhyeon Kim, Deokyeong Kang, Seonghoon Lim, Jihoon Kim, Taeuk Kim

Noise-Robust Semi-Supervised Learning for Distantly Supervised Relation Extraction
Xin Sun, Qiang Liu, Shu Wu, Zilei Wang, Liang Wang

Towards Concept-Aware Large Language Models
Chen Shani, Jilles Vreeken, Dafna Shahaf

ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual Learning
Viet Lai, Nghia Ngo, Amir Pouran Ben Veyseh, Hieu Man, Franck Dernoncourt, Trung Bui, Thien Nguyen

Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model Training
Max Müller-Eberstein, Rob van der Goot, Barbara Plank, Ivan Titov

Not All Demonstration Examples are Equally Beneficial: Reweighting Demonstration Examples for In-Context Learning
Zhe Yang, Damai Dai, Peiyi Wang, Zhifang Sui

Difference-Masking: Choosing What to Mask in Continued Pretraining
Alex Wilf, Syeda Akter, Leena Mathur, Paul Liang, Sheryl Mathew, Mengrou Shou, Eric Nyberg, Louis-Philippe Morency

Learn From One Specialized Sub-Teacher: One-to-One Mapping for Feature-Based Knowledge Distillation
Khouloud Saadi, Jelena Mitrović, Michael Granitzer

Conditioning on Dialog Acts improves Empathy Style Transfer
Renyi Qu, Lyle Ungar, João Sedoc

From Speculation Detection to Trustworthy Relational Tuples in Information Extraction
Kuicai Dong, Aixin Sun, Jung-jae Kim, Xiaoli Li

Dialogue Medical Information Extraction with Medical-Item Graph and Dialogue-Status Enriched Representation
Lei Gao, Xinnan Zhang, Xian Wu, Shen Ge, Yefeng Zheng

Decomposed Prompt Tuning via Low-Rank Reparameterization
Yao Xiao, Lu Xu, Jiaxi Li, Wei Lu, Xiaoli Li

SGP-TOD: Building Task Bots Effortlessly via Schema-Guided LLM Prompting
Xiaoying Zhang, Baolin Peng, Kun Li, Jingyan Zhou, Helen Meng

Ethical Reasoning over Moral Alignment: A Case and Framework for In-Context Ethical Policies in LLMs
Abhinav Rao, Aditi Khandelwal, Kumar Tanmay, Utkarsh Agarwal, Monojit Choudhury

Vector-Quantized Prompt Learning for Paraphrase Generation
Haotian Luo, Yixin Liu, Peidong Liu, Xianggen Liu

Rethinking the Construction of Effective Metrics for Understanding the Mechanisms of Pretrained Language Models
You Li, Jinhui Yin, Yuming Lin

PARROT: Zero-Shot Narrative Reading Comprehension via Parallel Reading
Chao Zhao, Anvesh Vijjini, Snigdha Chaturvedi

BioDEX: Large-Scale Biomedical Adverse Drug Event Extraction for Real-World Pharmacovigilance
Karel D’Oosterlinck, François Remy, Johannes Deleu, Thomas Demeester, Chris Develder, Klim Zaporojets, Aneiss Ghodsi, Simon Ellershaw, Jack Collins, Christopher Potts

Coarse-to-Fine Dual Encoders are Better Frame Identification Learners
Kaikai An, Ce Zheng, Bofei Gao, Haozhe Zhao, Baobao Chang

Sound of Story: Multi-modal Storytelling with Audio
Jaeyeon Bae, Seokhoon Jeong, Seokun Kang, Namgi Han, Jae-Yon Lee, Hyounghun Kim, Taehwan Kim

Leveraging Contrastive Learning and Knowledge Distillation for Incomplete Modality Rumor Detection
Fan Xu, Pinyun Fu, Qi Huang, Bowei Zou, AiTi Aw, Mingwen Wang

Beyond Testers’ Biases: Guiding Model Testing with Knowledge Bases using LLMs
Chenyang Yang, Rishabh Rustogi, Rachel Brower-Sinning, Grace Lewis, Christian Kaestner, Tongshuang Wu

CAR: Conceptualization-Augmented Reasoner for Zero-Shot Commonsense Question Answering
Weiqi Wang, Tianqing Fang, Wenxuan Ding, Baixuan Xu, Xin Liu, Yangqiu Song, Antoine Bosselut

kNN-CM: A Non-parametric Inference-Phase Adaptation of Parametric Text Classifiers
Rishabh Bhardwaj, Yingting Li, Navonil Majumder, Bo Cheng, Soujanya Poria

Cross-modality Data Augmentation for End-to-End Sign Language Translation
Jinhui Ye, Wenxiang Jiao, Xing Wang, Zhaopeng Tu, Hui Xiong

Relation-Aware Question Answering for Heterogeneous Knowledge Graphs
Haowei Du, Quzhe Huang, Chen Li, Chen Zhang, Yang Li, Dongyan Zhao

Less than One-shot: Named Entity Recognition via Extremely Weak Supervision
Letian Peng, Zihan Wang, Jingbo Shang

Focus on the Core: Efficient Attention via Pruned Token Compression for Document Classification
Jungmin Yun, Mihyeon Kim, Youngbin Kim

Semantic Decomposition of Question and SQL for Text-to-SQL Parsing
Ben Eyal, Moran Mahabi, Ophir Haroche, Amir Bachar, Michael Elhadad

Time-Aware Language Modeling for Historical Text Dating
Han Ren, Hai Wang, Yajie Zhao, Yafeng Ren

A Read-and-Select Framework for Zero-shot Entity Linking
Zhenran Xu, Yulin Chen, Baotian Hu, Min Zhang

Multi-Task Learning of Query Generation and Classification for Generative Conversational Question Rewriting
Sarawoot Kongyoung, Craig MacDonald, Iadh Ounis

DepNeCTI: Dependency-based Nested Compound Type Identification for Sanskrit
Jivnesh Sandhan, Yaswanth Narsupalli, Sreevatsa Muppirala, Sriram Krishnan, Pavankumar Satuluri, Amba Kulkarni, Pawan Goyal

HeQ: a Large and Diverse Hebrew Reading Comprehension Benchmark
Amir Cohen, Hilla Merhav-Fine, Yoav Goldberg, Reut Tsarfaty

HANSEN: Human and AI Spoken Text Benchmark for Authorship Analysis
Nafis Tripto, Adaku Uchendu, Thai Le, Mattia Setzu, Fosca Giannotti, Dongwon Lee

Data Augmentation for Code Translation with Comparable Corpora and Multiple References
Yiqing Xie, Atharva Naik, Daniel Fried, Carolyn Rose

Multilingual Generation and Answering of Questions from Texts and Knowledge Graphs
Kelvin Han, Claire Gardent

InfoDiffusion: Information Entropy Aware Diffusion Process for Non-Autoregressive Text Generation
Renzhi Wang, Jing Li, Piji Li

Enhancing Scalability of Pre-trained Language Models via Efficient Parameter Sharing
Peiyu Liu, Ze-Feng Gao, Yushuo Chen, Xin Zhao, Ji-Rong Wen

Boosting Prompt-Based Self-Training With Mapping-Free Automatic Verbalizer for Multi-Class Classification
Yookyung Kho, Jaehee Kim, Pilsung Kang

On the Impact of Cross-Domain Data on German Language Models
Amin Dada, Aokun Chen, Cheng Peng, Kaleb Smith, Ahmad Idrissi-Yaghir, Constantin Seibold, Jianning Li, Lars Heiliger, Christoph Friedrich, Daniel Truhn, Jan Egger, Jiang Bian, Jens Kleesiek, Yonghui Wu

Dialect-to-Standard Normalization: A Large-Scale Multilingual Evaluation
Olli Kuparinen, Aleksandra Miletić, Yves Scherrer

A Parallel Corpus for Vietnamese Central-Northern Dialect Text Transfer
Thang Le, Anh Luu

A Comprehensive Evaluation of Tool-Assisted Generation Strategies
Alon Jacovi, Avi Caciularu, Jonathan Herzig, Roee Aharoni, Bernd Bohnet, Mor Geva

InheritSumm: A General, Versatile and Compact Summarizer by Distilling from GPT
Yichong Xu, Ruochen Xu, Dan Iter, Yang Liu, Shuohang Wang, Chenguang Zhu, Michael Zeng

Can ChatGPT Perform Reasoning Using the IRAC Method in Analyzing Legal Scenarios Like a Lawyer?
Xiaoxi Kang, Lizhen Qu, Lay-Ki Soon, Adnan Trakic, Terry Zhuo, Patrick Emerton, Genevieve Grant

Coverage-based Example Selection for In-Context Learning
Shivanshu Gupta, Matt Gardner, Sameer Singh

Are Structural Concepts Universal in Transformer Language Models? Towards Interpretable Cross-Lingual Generalization
Ningyu Xu, Qi Zhang, Jingting Ye, Menghan Zhang, Xuanjing Huang

Thorny Roses: Investigating the Dual Use Dilemma in Natural Language Processing
Lucie-Aimée Kaffee, Arnav Arora, Zeerak Talat, Isabelle Augenstein

BYOC: Personalized Few-Shot Classification with Co-Authored Class Descriptions
Arth Bohra, Govert Verkes, Artem Harutyunyan, Pascal Weinberger, Giovanni Campagna

Approximating CKY with Transformers
Ghazal Khalighinejad, Ollie Liu, Sam Wiseman

DialGuide: Aligning Dialogue Model Behavior with Developer Guidelines
Prakhar Gupta, Yang Liu, Di Jin, Behnam Hedayatnia, Spandana Gella, Sijia Liu, Patrick Lange, Julia Hirschberg, Dilek Hakkani-Tur

RWKV: Reinventing RNNs for the Transformer Era
Bo Peng, Eric Alcaide, Quentin Anthony, Alon Albalak, Samuel Arcadinho, Stella Biderman, Huanqi Cao, Xin Cheng, Michael Chung, Leon Derczynski, Xingjian Du, Matteo Grella, Kranthi GV, Xuzheng He, Haowen Hou, Przemyslaw Kazienko, Jan Kocon, Jiaming Kong, Bartłomiej Koptyra, Hayden Lau, Jiaju Lin, Krishna Sri Ipsit Mantri, Ferdinand Mom, Atsushi Saito, Guangyu Song, Xiangru Tang, Johan Wind, Stanisław Woźniak, Zhenyuan Zhang, Qinghua Zhou, Jian Zhu, Rui-Jie Zhu

VISIT: Visualizing and Interpreting the Semantic Information Flow of Transformers
Shahar Katz, Yonatan Belinkov

Is Robustness Transferable across Languages in Multilingual Neural Machine Translation?
Leiyu Pan, Supryadi , Deyi Xiong

Interpreting Answers to Yes-No Questions in User-Generated Content
Shivam Mathur, Keun Park, Dhivya Chinnappa, Saketh Kotamraju, Eduardo Blanco

Task-Aware Self-Supervised Framework for Dialogue Discourse Parsing
Wei Li, Luyao Zhu, Wei Shao, Zonglin Yang, Erik Cambria

Selective Demonstrations for Cross-domain Text-to-SQL
Shuaichen Chang, Eric Fosler-Lussier

IntenDD: A Unified Contrastive Learning Approach for Intent Detection and Discovery
Bhavuk Singhal, Ashim Gupta, V P Shivasankaran, Amrith Krishna

INarIG: Iterative Non-autoregressive Instruct Generation Model For Word-Level Auto Completion
Hengchao Shang, Zongyao Li, Daimeng Wei, Jiaxin Guo, Minghan Wang, Xiaoyu Chen, Lizhi Lei, Hao Yang

Learning Dynamic Representations for Discourse Dependency Parsing
Tianyi Liu, Yansong Feng, Dongyan Zhao

K-HATERS: A Hate Speech Detection Corpus in Korean with Target-Specific Ratings
Chaewon Park, Soohwan Kim, Kyubyong Park, Kunwoo Park

Mitigating Data Imbalance and Representation Degeneration in Multilingual Machine Translation
Wen Lai, Alexandra Chronopoulou, Alexander Fraser

BotPercent: Estimating Bot Populations in Twitter Communities
Zhaoxuan Tan, Shangbin Feng, Melanie Sclar, Herun Wan, Minnan Luo, Yejin Choi, Yulia Tsvetkov

The Locality and Symmetry of Positional Encodings
Lihu Chen, Gael Varoquaux, Fabian Suchanek

Towards a Deep Understanding of Multilingual End-to-End Speech Translation
Haoran Sun, Xiaohu Zhao, Yikun Lei, Shaolin Zhu, Deyi Xiong

An Empirical Investigation of Implicit and Explicit Knowledge-Enhanced Methods for Ad Hoc Dataset Retrieval
Weiqing Luo, Qiaosheng Chen, Zhiyang Zhang, Zixian Huang, Gong Cheng

A Multi-Modal Multilingual Benchmark for Document Image Classification
Yoshinari Fujinuma, Siddharth Varia, Nishant Sankaran, Srikar Appalaraju, Bonan Min, Yogarshi Vyas

Unnatural language processing: How do language models handle machine-generated prompts?
Corentin Kervadec, Francesca Franzon, Marco Baroni

Gradually Excavating External Knowledge for Implicit Complex Question Answering
Chang Liu, Xiaoguang Li, Lifeng Shang, Xin Jiang, Qun Liu, Edmund Lam, Ngai Wong

Evaluating Subjective Cognitive Appraisals of Emotions from Large Language Models
Hongli Zhan, Desmond Ong, Junyi Li

Exploring Linguistic Properties of Monolingual BERTs with Typological Classification among Languages
Elena Ruzzetti, Federico Ranaldi, Felicia Logozzo, Michele Mastromattei, Leonardo Ranaldi, Fabio Zanzotto

Discourse Sense Flows: Modelling the Rhetorical Style of Documents across Various Domains
Rene Knaebel, Manfred Stede

HierarchicalContrast: A Coarse-to-Fine Contrastive Learning Framework for Cross-Domain Zero-Shot Slot Filling
Junwen Zhang, Yin Zhang

A Confederacy of Models: a Comprehensive Evaluation of LLMs on Creative Writing
Carlos Gómez-Rodríguez, Paul Williams

1-PAGER: One Pass Answer Generation and Evidence Retrieval
Palak Jain, Livio Soares, Tom Kwiatkowski

Context-faithful Prompting for Large Language Models
Wenxuan Zhou, Sheng Zhang, Hoifung Poon, Muhao Chen

InfoCL: Alleviating Catastrophic Forgetting in Continual Text Classification from An Information Theoretic Perspective
Yifan Song, Peiyi Wang, Weimin Xiong, Dawei Zhu, Tianyu Liu, Zhifang Sui, Sujian Li

Sparse Frame Grouping Network with Action Centered for Untrimmed Video Paragraph Captioning
Guorui Yu, Yimin Hu, Yuejie Zhang, Rui Feng, Tao Zhang, Shang Gao

Unsupervised Binary Code Translation with Application to Code Clone Detection and Vulnerability Discovery
Iftakhar Ahmad, Lannan Luo

Drilling Down into the Discourse Structure with LLMs for Long Document Question Answering
Inderjeet Nair, Shwetha Somasundaram, Apoorv Saxena, Koustava Goswami

Alignment Precedes Fusion: Open-Vocabulary Named Entity Recognition as Context-Type Semantic Matching
Zhuoran Jin, Pengfei Cao, Zhitao He, Yubo Chen, Kang Liu, Jun Zhao

Representation Projection Invariance Mitigates Representation Collapse
Anastasia Razdaibiedina, Ashish Khetan, Zohar Karnin, Daniel Khashabi, Vivek Madan

Tunable Soft Prompts are Messengers in Federated Learning
Chenhe Dong, Yuexiang Xie, Bolin Ding, Ying Shen, Yaliang Li

Style-Aware Radiology Report Generation with RadGraph and Few-Shot Prompting
Benjamin Yan, Ruochen Liu, David Kuo, Subathra Adithan, Eduardo Reis, Stephen Kwak, Vasantha Venugopal, Chloe O’Connell, Agustina Saenz, Pranav Rajpurkar, Michael Moor

Incorporating Probing Signals into Multimodal Machine Translation via Visual Question-Answering Pairs
Yuxin Zuo, Bei Li, Chuanhao Lv, Tong Zheng, Tong Xiao, JingBo Zhu

GenKIE: Robust Generative Multimodal Document Key Information Extraction
Panfeng Cao, Ye Wang, Qiang Zhang, Zaiqiao Meng

Improving Multimodal Sentiment Analysis: Supervised Angular margin-based Contrastive Learning for Enhanced Fusion Representation
Cong-Duy Nguyen, Thong Nguyen, Duc Vu, Anh Luu

Efficient Multilingual Language Model Compression through Vocabulary Trimming
Asahi Ushio, Yi Zhou, Jose Camacho-Collados

GTA: Gated Toxicity Avoidance for LM Performance Preservation
Heegyu Kim, Hyunsouk Cho

LMGQS: A Large-scale Dataset for Query-focused Summarization
Ruochen Xu, Song Wang, Yang Liu, Shuohang Wang, Yichong Xu, Dan Iter, Pengcheng He, Chenguang Zhu, Michael Zeng

ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models
Zhipeng Chen, Kun Zhou, Beichen Zhang, Zheng Gong, Xin Zhao, Ji-Rong Wen

Non-Autoregressive Document-Level Machine Translation
Guangsheng Bao, Zhiyang Teng, Hao Zhou, Jianhao Yan, Yue Zhang

Exploring the Effectiveness of Multi-Lingual Commonsense Knowledge-Aware Open-Domain Dialogue Response Generation
Sixing Wu, Jiong Yu, Tianshi Che, Yang Zhou, Wei Zhou

Mixture of Soft Prompts for Controllable Data Generation
Derek Chen, Celine Lee, Yunan Lu, Domenic Rosati, Zhou Yu

A Boundary Offset Prediction Network for Named Entity Recognition
Minghao Tang, Yongquan He, Yongxiu Xu, Hongbo Xu, Wenyuan Zhang, Yang Lin

Prefix-Tuning Based Unsupervised Text Style Transfer
Huiyu Mai, Wenhao Jiang, Zhi-Hong Deng

Evaluating and Enhancing the Robustness of Code Pre-trained Models through Structure-Aware Adversarial Samples Generation
Nuo Chen, Qiushi Sun, Jianing Wang, Ming Gao, Xiaoli Li, Xiang Li

Annotation Sensitivity: Training Data Collection Methods Affect Model Performance
Christoph Kern, Stephanie Eckman, Jacob Beck, Rob Chew, Bolei Ma, Frauke Kreuter

Qualitative Code Suggestion: A Human-Centric Approach to Qualitative Coding
Cesare Piano, Samira Rahimi, Jackie Cheung

D$^2$TV: Dual Knowledge Distillation and Target-oriented Vision Modeling for Many-to-Many Multimodal Summarization
Yunlong Liang, Fandong Meng, Jiaan Wang, Jinan Xu, Yufeng Chen, Jie Zhou

Improving Input-label Mapping with Demonstration Replay for In-context Learning
Zhuocheng Gong, Jiahao Liu, Qifan Wang, Jingang Wang, Xunliang Cai, Dongyan Zhao, Rui Yan

Enhancing Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies
Linyong Nan, Yilun Zhao, Weijin Zou, Narutatsu Ri, Jaesung Tae, Ellen Zhang, Arman Cohan, Dragomir Radev

Cross-lingual Open-Retrieval Question Answering for African Languages
Odunayo Ogundepo, Tajuddeen Gwadabe, Clara Rivera, Jonathan Clark, Sebastian Ruder, David Adelani, Bonaventure Dossou, Abdou DIOP, Claytone Sikasote, Gilles Hacheme, Happy Buzaaba, Ignatius Ezeani, Rooweither Mabuya, Salomey Osei, Chris Emezue, Albert Kahira, Shamsuddeen Muhammad, Akintunde Oladipo, Abraham Owodunni, Atnafu Tonja, Iyanuoluwa Shode, Akari Asai, Anuoluwapo Aremu, Ayodele Awokoya, Bernard Opoku, Chiamaka Chukwuneke, Christine Mwase, Clemencia Siro, Stephen Arthur, Tunde Ajayi, Verrah Otiende, Andre Rubungo, Boyd Sinkala, Daniel Ajisafe, Emeka Onwuegbuzia, Falalu Lawan, Ibrahim Ahmad, Jesujoba Alabi, Chinedu Mbonu, Mofetoluwa Adeyemi, Mofya Phiri, Orevaoghene Ahia, Ruqayya Iro, Sonia Adhiambo

Viewing Knowledge Transfer in Multilingual Machine Translation Through a Representational Lens
David Stap, Vlad Niculae, Christof Monz

Aligning Predictive Uncertainty with Clarification Questions in Grounded Dialog
Kata Naszadi, Putra Manggala, Christof Monz

ParroT: Translating during Chat using Large Language Models tuned with Human Translation and Feedback
Wenxiang Jiao, Jen-tse Huang, Wenxuan Wang, Zhiwei He, Tian Liang, Xing Wang, Shuming Shi, Zhaopeng Tu

Dense Retrieval as Indirect Supervision for Large-space Decision Making
Nan Xu, Fei Wang, Mingtao Dong, Muhao Chen

One-Model-Connects-All: A Unified Graph Pre-Training Model for Online Community Modeling
Ruoxue Ma, Jiarong Xu, Xinnong Zhang, Haozhe Zhang, Zuyu Zhao, Qi Zhang, Xuanjing Huang, Zhongyu Wei

In-Image Neural Machine Translation with Segmented Pixel Sequence-to-Sequence Model
Yanzhi Tian, Xiang Li, Zeming Liu, Yuhang Guo, Bin Wang

NarrativeXL: a Large-scale Dataset for Long-Term Memory Models
Arsenii Moskvichev, Ky-Vinh Mai

Sub-network Discovery and Soft-masking for Continual Learning of Mixed Tasks
Zixuan Ke, Bing Liu, Wenhan Xiong, Asli Celikyilmaz, Haoran Li

PIVOINE: Instruction Tuning for Open-world Entity Profiling
Keming Lu, Xiaoman Pan, Kaiqiang Song, Hongming Zhang, Dong Yu, Jianshu Chen

DiQAD: A Benchmark Dataset for Open-domain Dialogue Quality Assessment
Yukun Zhao, Lingyong Yan, Weiwei Sun, Chong Meng, Shuaiqiang Wang, Zhicong Cheng, Zhaochun Ren, Dawei Yin

Tuna: Instruction Tuning using Feedback from Large Language Models
Haoran Li, Yiran Liu, Xingxing Zhang, Wei Lu, Furu Wei

A Causal View of Entity Bias in (Large) Language Models
Fei Wang, Wenjie Mo, Yiwei Wang, Wenxuan Zhou, Muhao Chen

T5Score: Discriminative Fine-tuning of Generative Evaluation Metrics
Yiwei Qin, Weizhe Yuan, Graham Neubig, Pengfei Liu

T-Projection: High Quality Annotation Projection for Sequence Labeling Tasks
Iker García-Ferrero, Rodrigo Agerri, German Rigau

MTGER: Multi-view Temporal Graph Enhanced Temporal Reasoning over Time-Involved Document
Zheng Chu, Zekun Wang, Jiafeng Liang, Ming Liu, Bing Qin

Dialect Transfer for Swiss German Speech Translation
Claudio Paonessa, Yanick Schraner, Jan Deriu, Manuela Hürlimann, Manfred Vogel, Mark Cieliebak

Masked Path Modeling for Vision-and-Language Navigation
Zi-Yi Dou, Feng Gao, Nanyun Peng

Learning Interpretable Style Embeddings via Prompting LLMs
Ajay Patel, Delip Rao, Ansh Kothary, Kathleen McKeown, Chris Callison-Burch

GRACE: Discriminator-Guided Chain-of-Thought Reasoning
Muhammad Khalifa, Lajanugen Logeswaran, Moontae Lee, Honglak Lee, Lu Wang

RexUIE: A Recursive Method with Explicit Schema Instructor for Universal Information Extraction
Chengyuan Liu, Fubang Zhao, Yangyang Kang, Jingyuan Zhang, Xiang Zhou, Changlong Sun, Kun Kuang, Fei Wu

PromptARA: Improving Deep Representation in Hybrid Automatic Readability Assessment with Prompt and Orthogonal Projection
Jinshan Zeng, Xianglong Yu, Xianchao Tong, Wenyan Xiao

Incorporating Object-Level Visual Context for Multimodal Fine-Grained Entity Typing
Ying Zhang, Wenbo Fan, Kehui Song, Yu Zhao, Xuhui Sui, Xiaojie Yuan

Exploring the Numerical Reasoning Capabilities of Language Models: A Comprehensive Analysis on Tabular Data
Mubashara Akhtar, Abhilash Shankarampeta, Vivek Gupta, Arpit Patil, Oana Cocarascu, Elena Simperl

Assessing Privacy Risks in Language Models: A Case Study on Summarization Tasks
Ruixiang Tang, Gord Lueck, Rodolfo Quispe, Huseyin Inan, Janardhan Kulkarni, Xia Hu

Extrapolating Multilingual Understanding Models as Multilingual Generators
Bohong Wu, Fei Yuan, Hai Zhao, Lei Li, Jingjing Xu

SAC$^3$: Reliable Hallucination Detection in Black-Box Language Models via Semantic-aware Cross-check Consistency
Jiaxin Zhang, Zhuohang Li, Kamalika Das, Bradley Malin, Sricharan Kumar

Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model
Parishad BehnamGhader, Santiago Miret, Siva Reddy

BERTwich: Extending BERT’s Capabilities to Model Dialectal and Noisy Text
Aarohi Srivastava, David Chiang

Closed Boundary Learning for Classification Tasks with the Universum Class
Hanzhang Zhou, Zijian Feng, Kezhi Mao

Revisiting Entropy Rate Constancy in Text
Vivek Verma, Nicholas Tomlin, Dan Klein

Calibrated Seq2seq Models for Efficient and Generalizable Ultra-fine Entity Typing
Yanlin Feng, Adithya Pratapa, David Mortensen

Learning Semantic Role Labeling from Compatible Label Sequences
Tao Li, Ghazaleh Kazeminejad, Susan Brown, Vivek Srikumar, Martha Palmer

QUADRo: Dataset and Models for QUestion-Answer Database Retrieval
Stefano Campese, Ivano Lauriola, Alessandro Moschitti

Give Me the Facts! A Survey on Factual Knowledge Probing in Pre-trained Language Models
Paul Youssef, Osman Koraş, Meijie Li, Jörg Schlötterer, Christin Seifert

Enhanced Simultaneous Machine Translation with Word-level Policies
Kang Kim, Hankyu Cho

Causal Intervention-based Few-Shot Named Entity Recognition
Zhen Yang, Yongbin Liu, Chunping Ouyang

TADI: Topic-aware Attention and Powerful Dual-encoder Interaction for Recall in News Recommendation
Junxiang Jiang

Unveiling the Power of Argument Arrangement in Online Persuasive Discussions
Nailia Mirzakhmedova, Johannes Kiesel, Khalid Al-Khatib, Benno Stein

FFAEval: Evaluating Dialogue System via Free-For-All Ranking
Zeyao Ma, Zijun Yao, Jing Zhang, Jifan Yu, Xiaohan Zhang, Juanzi Li, Jie Tang

Orca: A Few-shot Benchmark for Chinese Conversational Machine Reading Comprehension
Nuo Chen, Hongguang Li, Junqing He, Yinan Bao, Xinshi Lin, Qi Yang, Jianfeng Liu, Ruyi Gan, Jiaxing Zhang, Baoyuan Wang, Jia Li

VER: Unifying Verbalizing Entities and Relations
Jie Huang, Kevin Chang

Adversarial Text Generation by Search and Learning
Guoyi Li, Bingkang Shi, Zongzhen Liu, Dehan Kong, Yulei Wu, Xiaodan Zhang, Longtao Huang, Honglei Lyu

Measuring Pointwise $\mathcal{V}$-Usable Information In-Context-ly
Sheng Lu, Shan Chen, Yingya Li, Danielle Bitterman, Guergana Savova, Iryna Gurevych

SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational Abilities
Dong Zhang, Shimin Li, Xin Zhang, Jun Zhan, Pengyu Wang, Yaqian Zhou, Xipeng Qiu

A Thorough Examination on Zero-shot Dense Retrieval
Ruiyang Ren, Yingqi Qu, Jing Liu, Xin Zhao, Qifei Wu, Yuchen Ding, Hua Wu, Haifeng Wang, Ji-Rong Wen

Contrastive Pre-training for Personalized Expert Finding
Qiyao Peng, Hongtao Liu, Zhepeng Lv, Qing Yang, Wenjun Wang

Mitigating Intrinsic Named Entity-Related Hallucinations of Abstractive Text Summarization
Jianbin Shen, Junyu Xuan, Christy Liang

Towards Informative Few-Shot Prompt with Maximum Information Gain for In-Context Learning
Hongfu Liu, Ye Wang

Short Papers

Time-Aware Representation Learning for Time-Sensitive Question Answering
Jungbin Son, Alice Oh

On the Relation between Sensitivity and Accuracy in In-Context Learning
Yanda Chen, Chen Zhao, Zhou Yu, Kathleen McKeown, He He

RethinkingTMSC: An Empirical Study for Target-Oriented Multimodal Sentiment Classification
Junjie Ye, Jie Zhou, Junfeng Tian, Rui Wang, Qi Zhang, Tao Gui, Xuanjing Huang

SQLPrompt: In-Context Text-to-SQL with Minimal Labeled Data
Ruoxi Sun, Sercan Arik, Rajarishi Sinha, Hootan Nakhost, Hanjun Dai, Pengcheng Yin, Tomas Pfister

Trigger Warnings: Bootstrapping a Violence Detector for Fan Fiction
Magdalena Wolska, Matti Wiegmann, Christopher Schröder, Ole Borchardt, Benno Stein, Martin Potthast

Crossing the Aisle: Unveiling Partisan and Counter-Partisan Events in News Reporting
Kaijian Zou, Xinliang Zhang, Winston Wu, Nicholas Beauchamp, Lu Wang

Self-supervised Post-processing Method to Enrich Pretrained Word Vectors
Hwiyeol Jo

Is ChatGPT a Financial Expert? Evaluating Language Models on Financial Natural Language Processing
Yue Guo, Zian Xu, Yi Yang

Dissecting In-Context Learning of Translations in GPT-3
Vikas Raunak, Arul Menezes, Hany Awadalla

Social Commonsense-Guided Search Query Generation for Open-Domain Knowledge-Powered Conversations
Revanth Reddy, Hao Bai, Wentao Yao, Sharath Chandra Etagi Suresh, Heng Ji, ChengXiang Zhai

Is a Prestigious Job the same as a Prestigious Country? A Case Study on Multilingual Sentence Embeddings and European Countries
Jindřich Libovický

The Past, Present, and Future of Typological Databases in NLP
Emi Baylor, Esther Ploeger, Johannes Bjerva

SoulChat: Improving LLMs’ Empathy, Listening, and Comfort Abilities through Fine-tuning with Multi-turn Empathy Conversations
Yirong Chen, Xiaofen Xing, Jingkai Lin, Huimin Zheng, Zhenyu Wang, Qi Liu, Xiangmin Xu

DocAsRef: An Empirical Study on Repurposing Reference-based Summary Quality Metrics as Reference-free Metrics
Forrest Bao, Ruixuan Tu, Ge Luo, Yinfei Yang, Hebi Li, Minghui Qiu, Youbiao He, Cen Chen

Is Explanation the Cure? Misinformation Mitigation in the Short Term and Long Term
Yi-Li Hsu, Shih-Chieh Dai, Aiping Xiong, Lun-Wei Ku

Perceptual Structure in the absence of grounding: the impact of abstractedness and subjectivity in color language for LLMs
Pablo Loyola, Edison Marrese-Taylor, Andres Hoyos-Idrobo

A Dataset for Investigating the Impact of Context for Offensive Language Detection in Tweets
Musa İhtiyar, ömer özdemir, Mustafa Erengül, Arzucan özgür

Remember what you did so you know what to do next
Manuel Ciosici, Alex Hedges, Yash Kankanampati, Justin Martin, Marjorie Freedman, Ralph Weischedel

Learning to Abstract with Nonparametric Variational Information Bottleneck
Melika Behjati, Fabio Fehr, James Henderson

Exploring the Cognitive Knowledge Structure of Large Language Models: An Educational Diagnostic Assessment Approach
Zheyuan Zhang, Jifan Yu, Juanzi Li, Lei Hou

Simple Hardware-Efficient PCFGs with Independent Left and Right Productions
Wei Liu, Songlin Yang, Yoon Kim, Kewei Tu

Adapter Pruning using Tropical Characterization
Rishabh Bhardwaj, Tushar Vaidya, Soujanya Poria

Can Brain Signals Reveal Inner Alignment with Human Languages?
Jielin Qiu, William Han, Jiacheng Zhu, Mengdi Xu, Douglas Weber, Bo Li, Ding Zhao

Transformer-Based Language Model Surprisal Predicts Human Reading Times Best with About Two Billion Training Tokens
Byung-Doh Oh, William Schuler

MultiCoNER v2: a Large Multilingual dataset for Fine-grained and Noisy Named Entity Recognition
Besnik Fetahu, Zhiyu Chen, Sudipta Kar, Oleg Rokhlenko, Shervin Malmasi

Exploring the Impact of Corpus Diversity on Financial Pretrained Language Models
Jaeyoung Choe, Keonwoong Noh, Nayeon Kim, Seyun Ahn, Woohwan Jung

Smart ``Chef’’: Verifying the Effect of Role-based Paraphrasing for Aspect Term Extraction
Jiaxiang Chen, Yu Hong, Qingting Xu, Jianmin Yao

AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation
Xingshan Zeng, Liangyou Li, Qun Liu

No offence, Bert - I insult only humans! Multilingual sentence-level attack on toxicity detection networks
Sergey Berezin, Reza Farahbakhsh, Noel Crespi

Improved Training of Deep Text Clustering
Zonghao Yang, Wenpeng Hu, Yushan Tan, Zhunchen Luo

Multi-Granularity Information Interaction Framework for Incomplete Utterance Rewriting
Haowei Du, Dinghao Zhang, Chen Li, Yang Li, Dongyan Zhao

For Generated Text, Is NLI-Neutral Text the Best Text?
Michail Mersinias, Kyle Mahowald

Combining Counting Processes and Classification Improves a Stopping Rule for Technology Assisted Review
Reem Hezam, Mark Stevenson

Multi-label and Multi-target Sampling of Machine Annotation for Computational Stance Detection
Zhengyuan Liu, Hai Chieu, Nancy Chen

Hiding in Plain Sight: Tweets with Hate Speech Masked by Homoglyphs
Portia Cooper, Mihai Surdeanu, Eduardo Blanco

On Uncertainty Calibration and Selective Generation in Probabilistic Neural Summarization: A Benchmark Study
Polina Zablotskaia, Du Phan, Joshua Maynez, Shashi Narayan, Jie Ren, Jeremiah Liu

Three Questions Concerning the Use of Large Language Models to Facilitate Mathematics Learning
An-Zi Yen, Wei-Ling Hsu

On Surgical Fine-tuning for Language Encoders
Abhilasha Lodha, Gayatri Belapurkar, Saloni Chalkapurkar, Yuanming Tao, Reshmi Ghosh, Samyadeep Basu, Dmitrii Petrov, Soundararajan Srinivasan

Improving Zero-shot Reader by Reducing Distractions from Irrelevant Documents in Open-Domain Question Answering
Sukmin Cho, Jeongyeon Seo, Soyeong Jeong, Jong Park

Extractive Summarization via ChatGPT for Faithful Summary Generation
Haopeng Zhang, Xiao Liu, Jiawei Zhang

The Law and NLP: Bridging Disciplinary Disconnects
Robert Mahari, Dominik Stammbach, Elliott Ash, Alex Pentland

ASPIRO: Any-shot Structured Parsing-error-Induced ReprOmpting for Consistent Data-to-Text Generation
Martin Vejvar, Yasutaka Fujimoto

Test-time Augmentation for Factual Probing
Go Kamoda, Benjamin Heinzerling, Keisuke Sakaguchi, Kentaro Inui

Exploring the Sensitivity of LLMs’ Decision-Making Capabilities: Insights from Prompt Variations and Hyperparameters
Manikanta Loya, Divya Sinha, Richard Futrell

FinePrompt: Unveiling the Role of Finetuned Inductive Bias on Compositional Reasoning in GPT-4
Jeonghwan Kim, Giwon Hong, Sung-Hyon Myaeng, Joyce Whang

IndiSocialFT: Multilingual Word Representation for Indian languages in code-mixed environment
Saurabh Kumar, Ranbir Sanasam, Sukumar Nandi

Adaptive Hinge Balance Loss for Document-Level Relation Extraction
Jize Wang, Xinyi Le, Xiaodi Peng, Cailian Chen

Low-Resource Comparative Opinion Quintuple Extraction by Data Augmentation with Prompting
Qingting Xu, Yu Hong, Fubang Zhao, Kaisong Song, Yangyang Kang, Jiaxiang Chen, Guodong Zhou

Hallucination Detection for Grounded Instruction Generation
Lingjun Zhao, Khanh Nguyen, Hal Daumé III

Definitions Matter: Guiding GPT for Multi-label Classification
Youri Peskine, Damir Korenčić, Ivan Grubisic, Paolo Papotti, Raphael Troncy, Paolo Rosso

Chain-of-Thought Embeddings for Stance Detection on Social Media
Joseph Gatto, Omar Sharif, Sarah Preum

Using LLM for Improving Key Event Discovery: Temporal-Guided News Stream Clustering with Event Summaries
Nishanth Nakshatri, Siyi Liu, Sihao Chen, Dan Roth, Dan Goldwasser, Daniel Hopkins

Women Wearing Lipstick: Measuring the Bias Between an Object and Its Related Gender
Ahmed Sabir, Lluís Padró

Empowering Psychotherapy with Large Language Models: Cognitive Distortion Detection through Diagnosis of Thought Prompting
Zhiyu Chen, Yujie Lu, William Wang

Information Extraction from Legal Wills: How Well Does GPT-4 Do?
Alice Kwak, Cheonkam Jeong, Gaetano Forte, Derek Bambauer, Clayton Morrison, Mihai Surdeanu

Reasoning about Ambiguous Definite Descriptions
Stefan Schouten, Peter Bloem, Ilia Markov, Piek Vossen

Verb Conjugation in Transformers Is Determined by Linear Encodings of Subject Number
Sophie Hao, Tal Linzen

MindGames: Targeting Theory of Mind in Large Language Models with Dynamic Epistemic Modal Logic
Damien Sileo, Antoine Lernould

LATENTLOGIC: Learning Logic Rules in Latent Space over Knowledge Graphs
Junnan Liu, Qianren Mao, Chenghua Lin, Yangqiu Song, Jianxin Li

More than Votes? Voting and Language based Partisanship in the US Supreme Court
Biaoyan Fang, Trevor Cohn, Timothy Baldwin, Lea Frermann

ToxicChat: Unveiling Hidden Challenges of Toxicity Detection in Real-World User-AI Conversation
Zi Lin, Zihan Wang, Yongqi Tong, Yangkun Wang, Yuxin Guo, Yujia Wang, Jingbo Shang

Improving Contrastive Learning of Sentence Embeddings with Focal InfoNCE
Pengyue Hou, Xingyu Li

SDOH-NLI: a Dataset for Inferring Social Determinants of Health from Clinical Notes
Adam Lelkes, Eric Loreaux, Tal Schuster, Ming-Jun Chen, Alvin Rajkomar

On the Zero-Shot Generalization of Machine-Generated Text Detectors
Xiao Pu, Jingyu Zhang, Xiaochuang Han, Yulia Tsvetkov, Tianxing He

Medical Text Simplification: Optimizing for Readability with Unlikelihood Training and Reranked Beam Search Decoding
Lorenzo Jaime Yu Flores, Heyuan Huang, Kejian Shi, Sophie Chheang, Arman Cohan

Intuitive Multilingual Audio-Visual Speech Recognition with a Single-Trained Model
Joanna Hong, Se Park, Yong Ro

CCIM: Cross-modal Cross-lingual Interactive Image Translation
Cong Ma, Yaping Zhang, Mei Tu, Yang Zhao, Yu Zhou, Chengqing Zong

TRAMS: Training-free Memory Selection for Long-range Language Modeling
Haofei Yu, Cunxiang Wang, Yue Zhang, Wei Bi

Robustness Tests for Automatic Machine Translation Metrics with Adversarial Attacks
Yichen Huang, Timothy Baldwin

Non-Compositionality in Sentiment: New Data and Analyses
Verna Dankers, Christopher Lucas

Better Together: Enhancing Generative Knowledge Graph Completion with Language Models and Neighborhood Information
Alla Chepurova, Aydar Bulatov, Yuri Kuratov, Mikhail Burtsev

Don’t waste a single annotation: improving single-label classifiers through soft labels
Ben Wu, Yue Li, Yida Mu, Carolina Scarton, Kalina Bontcheva, Xingyi Song

INVITE: a Testbed of Automatically Generated Invalid Questions to Evaluate Large Language Models for Hallucinations
Anil Ramakrishna, Rahul Gupta, Jens Lehmann, Morteza Ziyadi

GPT-4 as an Effective Zero-Shot Evaluator for Scientific Figure Captions
Ting-Yao Hsu, Chieh-Yang Huang, Ryan Rossi, Sungchul Kim, C. Giles, Ting-Hao Huang

HARE: Explainable Hate Speech Detection with Step-by-Step Reasoning
Yongjin Yang, Joonkee Kim, Yujin Kim, Namgyu Ho, James Thorne, Se-Young Yun

ReLM: Leveraging Language Models for Enhanced Chemical Reaction Prediction
Yaorui Shi, An Zhang, Enzhi Zhang, Zhiyuan Liu, Xiang Wang

Self-Supervised Behavior Cloned Transformers are Path Crawlers for Text Games
Ruoyao Wang, Peter Jansen

Enhancing Reasoning Capabilities by Instruction Learning and Chain-of-Thoughts for Implicit Discourse Relation Recognition
Yuxiang Lu, Yu Hong, Zhipang Wang, Guodong Zhou

Topic-Informed Dialogue Summarization using Topic Distribution and Prompt-based Modeling
Jaeah You, Youngjoong Ko

NERvous About My Health: Constructing a Bengali Medical Named Entity Recognition Dataset
Alvi Khan, Fida Kamal, Nuzhat Nower, Tasnim Ahmed, Sabbir Ahmed, Tareque Chowdhury

Domain Private Transformers for Multi-Domain Dialog Systems
Anmol Kabra, Ethan Elenberg

Strong and Efficient Baselines for Open Domain Conversational Question Answering
Andrei Coman, Gianni Barlacchi, Adrià de Gispert

From Relevance to Utility: Evidence Retrieval with Feedback for Fact Verification
Hengran Zhang, Ruqing Zhang, Jiafeng Guo, Maarten de Rijke, Yixing Fan, Xueqi Cheng

Discovering Highly Influential Shortcut Reasoning: An Automated Template-Free Approach
Daichi Haraguchi, Kiyoaki Shirai, Naoya Inoue, Natthawut Kertkeidkachorn

Evaluating the Knowledge Base Completion Potential of GPT
Blerta Veseli, Simon Razniewski, Jan-Christoph Kalo, Gerhard Weikum

Late Fusion of Transformers for Sentiment Analysis of Code-Switched Data
Gagan Sharma, R Chinmay, Raksha Sharma

GPT Deciphering Fedspeak: Quantifying Dissent Among Hawks and Doves
Denis Peskoff, Adam Visokay, Sander Schulhoff, Benjamin Wachspress, Alan Blinder, Brandon Stewart

Unmasking the Hidden Meaning: Bridging Implicit and Explicit Hate Speech Embedding Representations
Nicolas Ocampo, Elena Cabrio, Serena Villata

Comparing Prompt-Based and Standard Fine-Tuning for Urdu Text Classification
Faizad Ullah, Ubaid Azam, Ali Faheem, Faisal Kamiran, Asim Karim

Explore the Way: Exploring Reasoning Path by Bridging Entities for Effective Cross-Document Relation Extraction
Junyoung Son, Jinsung Kim, Jungwoo Lim, Yoonna Jang, Heuiseok Lim

Entity Disambiguation on a Tight Labeling Budget
Audi Primadhanty, Ariadna Quattoni

Pseudointelligence: A Unifying Lens on Language Model Evaluation
Shikhar Murty, Orr Paradise, Pratyusha Sharma

Implicit Sense-labeled Connective Recognition as Text Generation
Yui Oka, Tsutomu Hirao

Dynamic Stashing Quantization for Efficient Transformer Training
Guo Yang, Daniel Lo, Robert Mullins, Yiren Zhao

A Lightweight Method to Generate Unanswerable Questions in English
Vagrant Gautam, Miaoran Zhang, Dietrich Klakow

Towards Formality-Aware Neural Machine Translation by Leveraging Context Information
Dohee Kim, Yujin Baek, Soyoung Yang, Jaegul Choo

Exploring the Potential of Large Language Models in Generating Code-Tracing Questions for Introductory Programming Courses
Aysa Fan, Haoran Zhang, Luc Paquette, Rui Zhang

Learning Easily Updated General Purpose Text Representations with Adaptable Task-Specific Prefix
Kuan-Hao Huang, Liang Tan, Rui Hou, Sinong Wang, Amjad Almahairi, Ruty Rinott

Enhancing Argument Structure Extraction with Efficient Leverage of Contextual Information
Yun Luo, Zhen Yang, Fandong Meng, Yingjie Li, Jie Zhou, Yue Zhang

Are Language Models Worse than Humans at Following Prompts? It’s Complicated
Albert Webson, Alyssa Loo, Qinan Yu, Ellie Pavlick

Generating Extractive Answers: Gated Recurrent Memory Reader for Conversational Question Answering
Xuanyu Zhang, Qing Yang

Data Selection Curriculum for Abstractive Text Summarization
Shichao Sun, Ruifeng Yuan, Jianfei He, Ziqiang Cao, Wenjie Li, Xiaohua Jia

Romanization-based Large-scale Adaptation of Multilingual Language Models
Sukannya Purkayastha, Sebastian Ruder, Jonas Pfeiffer, Iryna Gurevych, Ivan Vulić

Efficient k-NN Search with Cross-Encoders using Adaptive Multi-Round CUR Decomposition
Nishant Yadav, Nicholas Monath, Manzil Zaheer, Andrew McCallum

``You Are An Expert Linguistic Annotator’’: Limits of LLMs as Analyzers of Abstract Meaning Representation
Allyson Ettinger, Jena Hwang, Valentina Pyatkin, Chandra Bhagavatula, Yejin Choi

Monte Carlo Thought Search: Large Language Model Querying for Complex Scientific Reasoning in Catalyst Design
Henry Sprueill, Carl Edwards, Mariefel Olarte, Udishnu Sanyal, Heng Ji, Sutanay Choudhury

Guiding LLM to Fool Itself: Automatically Manipulating Machine Reading Comprehension Shortcut Triggers
Mosh Levy, Shauli Ravfogel, Yoav Goldberg

Intersectional Stereotypes in Large Language Models: Dataset and Analysis
Weicheng Ma, Brian Chiang, Tong Wu, Lili Wang, Soroush Vosoughi

COUNT: COntrastive UNlikelihood Text Style Transfer for Text Detoxification
Mohammad Mahdi Abdollah Pour, Parsa Farinneya, Manasa Bharadwaj, Nikhil Verma, Ali Pesaranghader, Scott Sanner

Role of Context in Unsupervised Sentence Representation Learning: the Case of Dialog Act Modeling
Rastislav Hronsky, Emmanuel Keuleers

Open-source Large Language Models are Strong Zero-shot Query Likelihood Models for Document Ranking
Shengyao Zhuang, Bing Liu, Bevan Koopman, Guido Zuccon

On General Language Understanding
David Schlangen

A Closer Look into Using Large Language Models for Automatic Evaluation
Cheng-Han Chiang, Hung-yi Lee

Natural Language Annotations for Reasoning about Program Semantics
Marco Zocca

Energy and Carbon Considerations of Fine-Tuning BERT
Xiaorong Wang, Clara Na, Emma Strubell, Sorelle Friedler, Sasha Luccioni

Ranking LLM-Generated Loop Invariants for Program Verification
Saikat Chakraborty, Shuvendu Lahiri, Sarah Fakhoury, Akash Lal, Madanlal Musuvathi, Aseem Rastogi, Aditya Senthilnathan, Rahul Sharma, Nikhil Swamy

WordNet Is All You Need: A Surprisingly Effective Unsupervised Method for Graded Lexical Entailment
Joseph Renner, Pascal Denis, Rémi Gilleron

Knowledge Corpus Error in Question Answering
Yejoon Lee, Philhoon Oh, James Thorne

In-Context Learning Creates Task Vectors
Roee Hendel, Mor Geva, Amir Globerson

Unsupervised Lexical Simplification with Context Augmentation
Takashi Wada, Timothy Baldwin, Jey Lau

mLongT5: A Multilingual and Efficient Text-To-Text Transformer for Longer Sequences
David Uthus, Santiago Ontanon, Joshua Ainslie, Mandy Guo

KG-GPT: A General Framework for Reasoning on Knowledge Graphs Using Large Language Models
Jiho Kim, Yeonsu Kwon, Yohan Jo, Edward Choi

A Novel Contrastive Learning Method for Clickbait Detection on RoCliCo: A Romanian Clickbait Corpus of News Articles
Daria Broscoteanu, Radu Ionescu

A Spectral Viewpoint on Continual Relation Extraction
Huy Nguyen, Chien Nguyen, Linh Ngo, Anh Luu, Thien Nguyen

DiffuSeq-v2: Bridging Discrete and Continuous Text Spaces for Accelerated Seq2Seq Diffusion Models
Shansan Gong, Mukai Li, Jiangtao Feng, Zhiyong Wu, Lingpeng Kong

M2C: Towards Automatic Multimodal Manga Complement
Hongcheng Guo, Boyang Wang, Jiaqi Bai, Jiaheng Liu, Jian Yang, Zhoujun Li

LLM-in-the-loop: Leveraging Large Language Model for Thematic Analysis
Shih-Chieh Dai, Aiping Xiong, Lun-Wei Ku

LLM aided semi-supervision for efficient Extractive Dialog Summarization
Nishant Mishra, Gaurav Sahu, Iacer Calixto, Ameen Abu-Hanna, Issam Laradji

CCEval: A Representative Evaluation Benchmark for the Chinese-centric Multilingual Machine Translation
Lianzhang Lou, Xi Yin, Yutao Xie, Yang Xiang

Hierarchical Prompting Assists Large Language Model on Web Navigation
Abishek Sridhar, Robert Lo, Frank Xu, Hao Zhu, Shuyan Zhou

Can Large Language Models Fix Data Annotation Errors? An Empirical Study Using Debatepedia for Query-Focused Text Summarization
Md Tahmid Rahman Laskar, Mizanur Rahman, Israt Jahan, Enamul Hoque, Jimmy Huang

TSTR: Target Similarity Tuning Meets the Real World
Anirudh Khatry, Sumit Gulwani, Priyanshu Gupta, Vu Le, Mukul Singh, Ananya Singha, Gust Verbruggen

CReTIHC: Designing Causal Reasoning Tasks about Temporal Interventions and Hallucinated Confoundings
Changwoo Chun, SongEun Lee, Jaehyung Seo, Heuiseok Lim

Entity-Based Evaluation of Political Bias in Automatic Summarization
Karen Zhou, Chenhao Tan

Improving Low-resource Question Answering by Augmenting Question Information
Andong Chen, Yuan Sun, Xiaobing Zhao, Rosella Galindo Esparza, Kehai Chen, Yang Xiang, Tiejun Zhao, Min zhang

DemoNSF: A Multi-task Demonstration-based Generative Framework for Noisy Slot Filling Task
Guanting Dong, Tingfeng Hui, Zhuoma GongQue, Jinxu Zhao, Daichi Guo, Gang Zhao, Keqing He, Weiran Xu

SHARCS: Efficient Transformers Through Routing with Dynamic Width Sub-networks
Mohammadreza Salehi, Sachin Mehta, Aditya Kusupati, Ali Farhadi, Hannaneh Hajishirzi

Always the Best Fit: Adaptive Domain Gap Filling from Causal Perspective for Few-Shot Relation Extraction
Ge Bai, Chenji Lu, Jiaxiang Geng, Shilong Li, Yidong Shi, Xiyan Liu, Ying Liu, Zhang Zhang, Ruifang Liu

Ecologically Valid Explanations for Label Variation in NLI
Nan-Jiang Jiang, Chenhao Tan, Marie-Catherine de Marneffe

A Benchmark for Semi-Inductive Link Prediction in Knowledge Graphs
Adrian Kochsiek, Rainer Gemulla

NLP Evaluation in trouble: On the Need to Measure LLM Data Contamination for each Benchmark
Oscar Sainz, Jon Campos, Iker García-Ferrero, Julen Etxaniz, Oier Lopez de Lacalle, Eneko Agirre

Improving Pacing in Long-Form Story Planning
Yichen Wang, Kevin Yang, Xiaoming Liu, Dan Klein

Robustness of Named-Entity Replacements for In-Context Learning
Saeed Goodarzi, Nikhil Kagita, Dennis Minn, Shufan Wang, Roberto Dessi, Shubham Toshniwal, Adina Williams, Jack Lanchantin, Koustuv Sinha

Contrastive Learning-based Sentence Encoders Implicitly Weight Informative Words
Hiroto Kurita, Goro Kobayashi, Sho Yokoi, Kentaro Inui

EMO-KNOW: A Large Scale Dataset on Emotion-Cause
Mia Nguyen, Yasith Samaradivakara, Prasanth Sasikumar, Chitralekha Gupta, Suranga Nanayakkara

Treepiece: Faster Semantic Parsing via Tree Tokenization
Sid Wang, Akshat Shrivastava, Aleksandr Livshits

Semantic Parsing by Large Language Models for Intricate Updating Strategies of Zero-Shot Dialogue State Tracking
Yuxiang Wu, Guanting Dong, Weiran Xu

Mitigating Framing Bias with Polarity Minimization Loss
Yejin Bang, Nayeon Lee, Pascale Fung

Steering Large Language Models for Machine Translation with Finetuning and In-Context Learning
Duarte Alves, Nuno Guerreiro, João Alves, José Pombal, Ricardo Rei, José de Souza, Pierre Colombo, Andre Martins

Improving Span Representation by Efficient Span-Level Attention
Pengyu Ji, Songlin Yang, Kewei Tu

Words, Subwords, and Morphemes: What Really Matters in the Surprisal-Reading Time Relationship?
Sathvik Nair, Philip Resnik

Enhancing Accessible Communication: from European Portuguese to Portuguese Sign Language
Catarina Sousa, Luisa Coheur, Mara Moita

Decoding Stumpers: Large Language Models vs. Human Problem-Solvers
Alon Goldstein, Miriam Havin, Roi Reichart, Ariel Goldstein

Multilingual Coarse Political Stance Classification of Media. The Editorial Line of a ChatGPT and Bard Newspaper
Cristina España-Bonet

Identifying {Early Maladaptive Schemas} from Mental Health Question Texts
Sujatha Gollapalli, Beng Ang, See-Kiong Ng

Syntax Matters: Towards Spoken Language Understanding via Syntax-Aware Attention
Yifeng Xie, Zhihong Zhu, Xuxin Cheng, Zhiqi Huang, Dongsheng Chen

Improving Consistency for Text Summarization with Energy Functions
Qi Zeng, Qingyu Yin, Zheng Li, Yifan Gao, Sreyashi Nag, Zhengyang Wang, Bing Yin, Heng Ji, Chao Zhang

Language Guided Visual Question Answering: Elevate Your Multimodal Language Model Using Knowledge-Enriched Prompts
Deepanway Ghosal, Navonil Majumder, Roy Lee, Rada Mihalcea, Soujanya Poria

XLS-R fine-tuning on noisy word boundaries for unsupervised speech segmentation into words
Robin Algayres, Pablo Diego-Simon, Benoît Sagot, Emmanuel Dupoux

One For All & All For One: Bypassing Hyperparameter Tuning with Model Averaging for Cross-Lingual Transfer
Fabian Schmidt, Ivan Vulić, Goran Glavaš

LLMs – the Good, the Bad or the Indispensable?: A Use Case on Legal Statute Prediction and Legal Judgment Prediction on Indian Court Cases
Shaurya Vats, Atharva Zope, Somsubhra De, Anurag Sharma, Upal Bhattacharya, Shubham Nigam, Shouvik Guha, Koustav Rudra, Kripabandhu Ghosh

Probing Representations for Document-level Event Extraction
Barry Wang, Xinya Du, Claire Cardie

Re-weighting Tokens: A Simple and Effective Active Learning Strategy for Named Entity Recognition
Haocheng Luo, Wei Tan, Ngoc Nguyen, Lan Du

CompleQA: Benchmarking the Impacts of Knowledge Graph Completion Methods on Question Answering
Donghan Yu, Yu Gu, Chenyan Xiong, Yiming Yang

Improving Multi-Criteria Chinese Word Segmentation through Learning Sentence Representation
Chun Lin, Ying-Jia Lin, Chia-Jen Yeh, Yi-Ting Li, Ching Yang, Hung-Yu Kao

Probing the ``Creativity’’ of Large Language Models: Can models produce divergent semantic association?
Honghua Chen, Nai Ding

On Event Individuation for Document-Level Information Extraction
William Gantt, Reno Kriz, Yunmo Chen, Siddharth Vashishtha, Aaron White

Efficient Data Learning for Open Information Extraction with Pre-trained Language Models
Zhiyuan Fan, Shizhu He

Towards Zero-shot Learning for End-to-end Cross-modal Translation Models
Jichen Yang, Kai Fan, Minpeng Liao, Boxing Chen, Zhongqiang Huang

IMU2CLIP: Language-grounded Motion Sensor Translation with Multimodal Contrastive Learning
Seungwhan Moon, Andrea Madotto, Zhaojiang Lin, Aparajita Saraf, Amy Bearman, Babak Damavandi

Systematic Assessment of Factual Knowledge in Large Language Models
Linhao Luo, Trang Vu, Dinh Phung, Reza Haf

Tokenization Consistency Matters for Generative Models on Extractive NLP Tasks
Kaiser Sun, Peng Qi, Yuhao Zhang, Lan Liu, William Wang, Zhiheng Huang

LogicAttack: Adversarial Attacks for Evaluating Logical Consistency of Natural Language Inference
Mutsumi Nakamura, Santosh Mashetty, Mihir Parmar, Neeraj Varshney, Chitta Baral

Synthesize, if you do not have: Effective Synthetic Dataset Creation Strategies for Self-Supervised Opinion Summarization in E-commerce
Tejpalsingh Siledar, Suman Banerjee, Amey Patil, Sudhanshu Singh, Muthusamy Chelliah, Nikesh Garera, Pushpak Bhattacharyya

Consistency is Key: On Data-Efficient Modality Transfer in Speech Translation
Hojin Lee, Changmin Lee, Seung-won Hwang

InstOptima: Evolutionary Multi-objective Instruction Optimization via Large Language Model-based Instruction Operators
Heng Yang, Ke Li

Re-Examining Summarization Evaluation across Multiple Quality Criteria
Ori Ernst, Ori Shapira, Ido Dagan, Ran Levy

Learning to love diligent trolls: Accounting for rater effects in the dialogue safety task
Michael Ilagan

Who Wrote it and Why? Prompting Large-Language Models for Authorship Verification
Chia-Yu Hung, Zhiqiang Hu, Yujia Hu, Roy Lee

Transitioning Representations between Languages for Cross-lingual Event Detection via Langevin Dynamics
Chien Nguyen, Huy Nguyen, Franck Dernoncourt, Thien Nguyen

Arabic Mini-ClimateGPT : A Climate Change and Sustainability Tailored Arabic LLM
Sahal Mullappilly, Abdelrahman Shaker, Omkar Thawakar, Hisham Cholakkal, Rao Anwer, Salman Khan, Fahad Khan

DocSplit: Simple Contrastive Pretraining for Large Document Embeddings
Yujie Wang, Mike Izbicki

TELeR: A General Taxonomy of LLM Prompts for Benchmarking Complex Tasks
Shubhra Kanti Karmaker Santu, Dongji Feng

Is the Answer in the Text? Challenging ChatGPT with Evidence Retrieval from Instructive Text
Sophie Henning, Talita Anthonio, Wei Zhou, Heike Adel, Mohsen Mesgar, Annemarie Friedrich

PaRaDe: Passage Ranking using Demonstrations with LLMs
Andrew Drozdov, Honglei Zhuang, Zhuyun Dai, Zhen Qin, Razieh Rahimi, Xuanhui Wang, Dana Alon, Mohit Iyyer, Andrew McCallum, Donald Metzler, Kai Hui

Investigating the Effectiveness of Multiple Expert Models Collaboration
Ikumi Ito, Takumi Ito, Jun Suzuki, Kentaro Inui

Emergent Inabilities? Inverse Scaling Over the Course of Pretraining
James Michaelov, Ben Bergen

ICU: Conquering Language Barriers in Vision-and-Language Modeling by Dividing the Tasks into Image Captioning and Language Understanding
Guojun Wu

Cache me if you Can: an Online Cost-aware Teacher-Student framework to Reduce the Calls to Large Language Models
Ilias Stogiannidis, Stavros Vassos, Prodromos Malakasiotis, Ion Androutsopoulos

Dialogue Act-Aided Backchannel Prediction Using Multi-Task Learning
Wencke Liermann, Yo-Han Park, Yong-Seok Choi, Kong Lee

mReFinED: An Efficient End-to-End Multilingual Entity Linking System
Peerat Limkonchotiwat, Weiwei Cheng, Christos Christodoulopoulos, Amir Saffari, Jens Lehmann

Emptying the Ocean with a Spoon: Should We Edit Models?
Yuval Pinter, Michael Elhadad

MSCFFN: A New FFN with Multi-Space Cross to Accelerate Transformer
Tang Dongge, Qing Yang

Exploring Context-Aware Evaluation Metrics for Machine Translation
Xinyu Hu, Xunjian Yin, Xiaojun Wan

QADYNAMICS: Training Dynamics-Driven Synthetic QA Diagnostic for Zero-Shot Commonsense Question Answering
Haochen Shi, Weiqi Wang, Tianqing Fang, Baixuan Xu, Wenxuan Ding, Xin Liu, Yangqiu Song

Does Listener Gaze in Face-to-Face Interaction Follow the Entropy Rate Constancy Principle: An Empirical Study
Yu Wang, Hendrik Buschmeier

BERT Has More to Offer: BERT Layers Combination Yields Better Sentence Embeddings
MohammadSaleh Hosseini, Munawara Munia, Latifur Khan

Test-Time Self-Adaptive Small Language Models for Question Answering
Soyeong Jeong, Jinheon Baek, Sukmin Cho, Sung Hwang, Jong Park

ExpNote: Black-box Large Language Models are better Task Solvers with Experience Notebook
Wangtao Sun, Xuanqing Yu, Shizhu He, Jun Zhao, Kang Liu

Evaluating Parameter-Efficient Finetuning Approaches for Pre-trained Models on the Financial Domain
Isabella Olariu, Cedric Lothritz, Jacques Klein, Tegawendé Bissyandé, Siwen Guo, Shohreh Haddadan

Is ChatGPT the ultimate Data Augmentation Algorithm?
Frédéric Piedboeuf, Philippe Langlais

The Linearity of the Effect of Surprisal on Reading Times across Languages
Weijie Xu, Jason Chon, Tianran Liu, Richard Futrell

Unleashing the Multilingual Encoder Potential: Boosting Zero-Shot Performance via Probability Calibration
Ercong Nie, Helmut Schmid, Hinrich Schuetze