Industry Track
Accepted Papers
BeautifulPrompt: Towards Automatic Prompt Engineering for Text-to-Image Synthesis
Tingfeng Cao, Chengyu Wang, Bingyan Liu, Ziheng Wu, Jinhui Zhu and jun huang
Enhancing Language Model with Unit Test Techniques for Efficient Regular Expression Generation
Chenhui Mao, Xiexiong Lin, Xin Jin and Xin Zhang
A Comparative Analysis of Task-Agnostic Distillation Methods for Compressing Transformer Language Models
Takuma Udagawa, Aashka Trivedi, Michele Merler and Bishwaranjan Bhattacharjee
Towards Effective Automatic Debt Collection with Persona Awareness
Tong Zhang, Junhong Liu, Chen Huang, Jia Liu, Hongru Liang, Zujie Wen and Wenqiang Lei
Gatekeeper to save COGS and improve efficiency of Text Prediction
Nidhi Tiwari, Sneha Kola, Milos Milunovic, Si-qing Chen and Marjan Slavkovski
Efficient Transformer Knowledge Distillation: A Performance Review
Nathan Brown, Ashton Williamson, Tahj Anderson and Logan Lawrence
CDD: A Large Scale Dataset for Legal Intelligence Research
Changzhen Ji, Yating Zhang, Adam Jatowt and Haipang Wu
MUST&P-SRL: Multi-lingual and Unified Syllabification in Text and Phonetic Domains for Speech Representation Learning
Noé Tits
Personalized Dense Retrieval on Global Index for Voice-enabled Conversational Systems
Masha Belyi, Charlotte Dzialo, Chaitanya Dwivedi, Prajit Reddy Muppidi and Kanna Shimizu
Text2Topic: Multi-Label Text Classification System for Efficient Topic Detection in User Generated Content with Zero-Shot Capabilities
Fengjun Wang, Moran Beladev, Ofri Kleinfeld, Elina Frayerman, Tal Shachar, Eran Fainman, Karen Lastmann Assaraf, Sarai Mizrachi and Benjamin Wang
Deep Metric Learning to Hierarchically Rank - An Application in Product Retrieval
KeeKiat Koo, Ashutosh Joshi, Nishaanth Reddy, Karim Bouyarmane, Ismail Tutar, Vaclav Petricek and Changhe Yuan
A Pretrained Language Model for Cyber Threat Intelligence
Youngja Park and Weiqiu You
SAMP: A Model Inference Toolkit of Post-Training Quantization for Text Processing via Self-Adaptive Mixed-Precision
Rong Tian, Zijing Zhao, Weijie Liu, Haoyan Liu, Weiquan Mao, Zhe Zhao and Kan Zhou
KD-Boost: Boosting Real-Time Semantic Matching in E-commerce with Knowledge Distillation
Sanjay Agrawal, Vivek Sembium and Ankith M S
Multi-teacher Distillation for Multilingual Spelling Correction
Jingfen Zhang, Xuan Guo, Sravan Bodapati and Christopher Potts
Does Named Entity Recognition Truly Not Scale Up to Real-world Product Attribute Extraction?
Wei-Te Chen, Keiji Shinzato, Naoki Yoshinaga and Yandi Xia
Investigating Table-to-Text Generation Capabilities of Large Language Models in Real-World Information Seeking Scenarios
Yilun Zhao, Haowei Zhang, Shengyun Si, Linyong Nan, Xiangru Tang and Arman Cohan
TMID: A Comprehensive Real-world Dataset for Trademark Infringement Detection in E-Commerce
Tongxin Hu, Zhuang Li, Xin Jin, Lizhen Qu and Xin Zhang
Joint Dialogue Topic Segmentation and Categorization: A Case Study on Clinical Spoken Conversations
Zhengyuan Liu, Siti Umairah Md Salleh, Hong Choon Oh, Pavitra Krishnaswamy and Nancy Chen
AdapterDistillation: Non-Destructive Task Composition with Knowledge Distillation
Junjie Wang, Yicheng Chen, Wangshu Zhang, Sen Hu, Teng Xu and Jing Zheng
PROMINET: Prototype-based Multi-View Network for Interpretable Email Response Prediction
Yuqing Wang, Prashanth Vijayaraghavan and Ehsan Degan
Retrieval-Enhanced Dual Encoder Training for Product Matching
Justin Chiu
WordArt Designer: User-Driven Artistic Typography Synthesis using Large Language Models
JUN-YAN HE, Zhi-Qi Cheng, Chenyang Li, Jingdong Sun, Wangmeng Xiang, Xianhui Lin, Xiaoyang Kang, Zengke Jin, Yusen Hu, Bin Luo, Yifeng Geng, xuansong xie and Jingren Zhou
Lattice Path Edit Distance: A Romanization-aware Edit Distance for Extracting Misspelling-Correction Pairs from Japanese Search Query Logs
Nobuhiro Kaji
Learning Multilingual Sentence Representations with Cross-lingual Consistency Regularization
Pengzhi Gao, Liwen Zhang, Zhongjun He, Hua Wu and Haifeng Wang
Unveiling Identity Biases in Toxicity Detection : A Game-Focused Dataset and Reactivity Analysis Approach
Josiane Van Dorpe, Zachary Yang, Nicolas Grenon-Godbout and Grégoire Winterstein
ORANGE: Text-video Retrieval via Watch-time-aware Heterogeneous Graph Contrastive Learning
Yucheng Lin, Tim Chang, Yaning Chang, Jianqiang Ma, Donghui Li, Ting Peng, Zang Li, Zhiyi Zhou and Feng Wang
Compute-Efficient Churn Reduction for Conversational Agents
Christopher Hidey and Sarthak Sarthak
Empower Large Language Model to Perform Better on Industrial Domain-Specific Question Answering
Fangkai Yang, Pu Zhao, Zezhong WANG, Lu Wang, Bo Qiao, Jue Zhang, Mohit Garg, Qingwei Lin, Saravan Rajmohan and Dongmei Zhang
Enhancing Extreme Multi-Label Text Classification: Addressing Challenges in Model, Data, and Evaluation
Dan Li, Zi Long Zhu, Janneke van de Loo, Agnes Masip Gomez, Vikrant Yadav, Georgios Tsatsaronis and Zubair Afzal
Query-aware Multi-modal based Ranking Relevance in Video Search
chengcan ye, Ting Peng, Tim Chang, Zhiyi Zhou and Feng Wang
Coordinated Replay Sample Selection for Continual Federated Learning
Jack Good, Jimit Majmudar, Christophe Dupuy, Jixuan Wang, Charith Peris, Clement Chung, Richard Zemel and Rahul Gupta
Building Real-World Meeting Summarization Systems using Large Language Models: A Practical Perspective
Md Tahmid Rahman Laskar, Xue-Yong Fu, Cheng Chen and Shashi Bhushan TN
Creator Context for Tweet Recommendation
Spurthi Amba Hombaiah, Tao Chen, Mingyang Zhang, Michael Bendersky, Marc Najork, Matt Colen, Sergey Levi, Vladimir Ofitserov and Tanvir Amin
AdaBERT-CTC: Leveraging BERT-CTC for Text-Only Domain Adaptation in ASR
Tyler Vuong, Karel Mundnich, Dhanush Bekal, Veera Raghavendra Elluru, Srikanth Ronanki and Sravan Bodapati
Conversing with databases: Practical Natural Language Querying
Denis Kochedykov, Fenglin Yin and Sreevidya Khatravath
AART: AI-Assisted Red-Teaming with Diverse Data Generation for New LLM-powered Applications
Bhaktipriya Radharapu, Kevin Robinson, Lora Aroyo and Preethi Lahoti
Speakerly: A Voice-based Writing Assistant for Text Composition
Dhruv Kumar, Vipul Raheja, Alice Kaiser-Schatzlein, Robyn Perry, Apurva Joshi, Justin Hugues-Nuger, Samuel Lou and Navid Chowdhury
Are ChatGPT and GPT-4 General-Purpose Solvers for Financial Text Analytics? A Study on Several Typical Tasks
Xianzhi Li, Samuel Chan, Xiaodan Zhu, Yulong Pei, Zhiqiang Ma, Xiaomo Liu and Sameena Shah
CL-QR: Cross-Lingual Enhanced Query Reformulation for Multi-lingual Conversational AI Agents
Zhongkai Sun, Zhengyang Zhao, Sixing Lu, Chengyuan Ma, Xiaohu Liu, Xing Fan, Wei Shen and Chenlei Guo
Improving Contextual Query Rewrite for Conversational AI Agents through User-preference Feedback Learning
Zhongkai Sun, Yingxue Zhou, Jie Hao, Xing Fan, Yanbin Lu, Chengyuan Ma, Wei Shen and Chenlei Guo
Scaling Neural ITN for Numbers and Temporal Expressions in Tamil: Findings for an Agglutinative Low-resource Language
Bhavuk Singhal, Sindhuja Gopalan, Amrith Krishna and Malolan Chetlur
EELBERT: Tiny Models through Dynamic Embeddings
Gabrielle Cohn, Rishika Agarwal, Deepanshu Gupta and Siddharth Patwardhan
Gold Standard Bangla OCR Dataset: An In-Depth Look at Data Preprocessing and Annotation Processes
Hasmot Ali, AKM Shahariar Azad Rabby, Md Majedul Islam, Fakhruddin Mahamud, Nazmul Hasan and Fuad Rahman
PILLOW: Enhancing Efficient Instruction Fine-tuning via Prompt Matching
Zhenting Qi, Xiaoyu Tan, Shaojie Shi, Chao Qu, Yinghui Xu and Yuan Qi
Welcome to the Real World: Efficient, Incremental and Scalable Key Point Analysis
Lilach Eden, Yoav Kantor, Matan Orbach, Yoav Katz, Noam Slonim and Roy Bar-Haim
Automatic Linking of Judgements to UK Supreme Court Hearings
Hadeel Saadany and Constantin Orasan
Automatic Marketing Theme and Commodity Construction System for E-commerce
Zhiping Wang, Peng Lin, Hainan Zhang, Hongshen Chen, Tianhao Li, Zhuoye Ding, Sulong Xu and Jinghe Hu
Towards Safer Operations: An Expert-involved Dataset of High-Pressure Gas Incidents for Preventing Future Failures
Shumpei Inoue, Minh-Tien Nguyen, Hiroki Mizokuchi, Tuan-Anh D. Nguyen, Huu-Hiep Nguyen and Dung Le
An Auxiliary Task Boosted Multi-task Learning Method for Service Account Retrieval with Limited Human Annotation
Yuanzhou Yao, Zhao Zhang, Kaijia Yang, Huasheng Liang, Qiang Yan and Yongjun Xu
VKIE: The Application of Key Information Extraction on Video Text
Siyu An, Ye Liu, Haoyuan Peng and Di Yin
Investigating the Role and Impact of Disfluency on Summarization
Varun Nathan, Ayush Kumar and Jithendra Vepa
InsightNet : Structured Insight Mining from Customer Feedback
Sandeep Sricharan Mukku, Manan Soni, Chetan Aggarwal, Jitenkumar Rana, Promod Yenigalla, Rashmi Patange and Shyam Mohan
E2E Spoken Entity Extraction for Virtual Agents
Karan Singla, Yeon-Jun Kim and Srinivas Bangalore
Generative Models for Product Attribute Extraction
Ansel Blume, Nasser Zalmout, Heng Ji and Xian Li
CarExpert: Leveraging Large Language Models for In-Car Conversational Question Answering
Md Rashad Al Hasan Rony, Christian Suess, Sinchana Ramakanth Bhat, Viju Sudhi, Julia Schneider, Maximilian Vogel, Roman Teucher, Ken E. Friedl and Soumya Sahoo
BUSTER: a “BUSiness Transaction Entity Recognition” dataset
Andrea Zugarini, Andrew Zamai, Marco Ernandes and Leonardo Rigutini
Multi-word Tokenization for Sequence Compression
Leonidas Gee, Leonardo Rigutini, Marco Ernandes and Andrea Zugarini
JarviX: A LLM No code Platform for Tabular Data Analysis and Optimization
Shang-Ching Liu, ShengKun Wang, Tsungyao Chang, Wenqi Lin, Chung-Wei Hsiung, Yi-Chen Hsieh, Yu-Ping Cheng, Sian-Hong Luo and Jianwei Zhang
Retrieve and Copy: Scaling ASR Personalization to Large Catalogs
Sai Muralidhar Jayanthi, Devang Kulshreshtha, Saket Dingliwal, Srikanth Ronanki and Sravan Bodapati
STEER: Semantic Turn Extension-Expansion Recognition for Voice Assistants
Leon Zhang, Jiarui Lu, Joel Ruben Antony Moniz, Aditya Kulkarni, Dhivya Piraviperumal, Tien Dung Tran, Nick Tzou and Hong Yu
Self-Criticism: Aligning Large Language Models with their Understanding of Helpfulness, Honesty, and Harmlessness
Xiaoyu Tan, Shaojie Shi, Xihe Qiu, Chao Qu, Zhenting Qi, Yinghui Xu and Yuan Qi
InstructPTS: Instruction-Tuning LLMs for Product Title Summarization
Besnik Fetahu, Zhiyu Chen, Oleg Rokhlenko and Shervin Malmasi
LLM4Vis: Explainable Visualization Recommendation using ChatGPT
Lei Wang, Songheng Zhang, Yun Wang, Ee-Peng Lim and Yong Wang
DUBLIN: Visual Document Understanding By Language-Image Network
Kriti Aggarwal, Aditi Khandelwal, Kumar Tanmay, Owais Khan Mohammed, Qiang Liu, Monojit Choudhury, Hardik Hansrajbhai Chauhan, Subhojit Som, Vishrav Chaudhary and Saurabh Tiwary
DocumentNet: Bridging the Data Gap in Document Pre-training
Lijun Yu, Jin Miao, Xiaoyu Sun, Jiayi Chen, Alexander Hauptmann, Hanjun Dai and Wei Wei
Relevance-assisted Generation for Robust Zero-shot Retrieval
Jihyuk Kim, Minsoo Kim, Joonsuk Park and Seung-won Hwang
Too much of product information : Don’t worry, let’s look for evidence!
Aryan Jain, Jitenkumar Rana and Chetan Aggarwal
Harnessing LLMs for Temporal Data - A Study on Explainable Financial Time Series Forecasting
Xinli Yu, Zheng Chen and Yanbin Lu
ViGPTQA - State-of-the-Art LLMs for Vietnamese Question Answering: System Overview, Core Models Training, and Evaluations
Minh Thuan Nguyen, Khanh Tung Tran, Nhu Van Nguyen and Xuan-Son Vu
An Integrated Search System for Korea Weather Data
Jinkyung Jo, Dayeon Ki, Soyoung Yoon and Minjoon Seo
Adaptive Hyper-parameter Learning for Deep Semantic Retrieval
Mingming Li, Chunyuan Yuan, Huimu Wang, Peng Wang, Jingwei Zhuo, Binbin Wang, Lin Liu and Sulong Xu
On Sample-Efficient Code Generation
Hojae Han, Yu Jin Kim, Byoungjip Kim, Youngwon Lee, Kyungjae Lee, Kyungmin Lee, Moontae Lee, Kyunghoon Bae and Seung-won Hwang
Batch Prompting: Efficient Inference with Large Language Model APIs
Zhoujun Cheng, Jungo Kasai and Tao Yu
Graph Meets LLM: A Novel Approach to Collaborative Filtering for Robust Conversational Understanding
Zheng Chen, Ziyan Jiang, Fan Yang, Eunah Cho, Xing Fan, Xiaojiang Huang, Yanbin Lu and Aram Galstyan
DELPHI: Data for Evaluating LLMs’ Performance in Handling Controversial Issues
David Sun, Artem Abzaliev, Hadas Kotek, Christopher Klein, Zidi Xiu and Jason D Williams
Angel: Enterprise Search System for the Non-Profit Industry
Saiful Haq, Ashutosh Sharma and Pushpak Bhattacharyya