written on 2023-09-15
title | authors | categories | displaydate |
---|---|---|---|
CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain Performance and Calibration | Rachneet Sachdeva, Martin Tutek, Iryna Gurevych | cs.CL | 2023-09-14 |
Semantic reconstruction of continuous language from MEG signals | Bo Wang, Xiran Xu, Longxiang Zhang, Boda Xiao, Xihong Wu, Jing Chen | cs.HC, eess.SP, q-bio.NC | 2023-09-14 |
Text Encoders Lack Knowledge: Leveraging Generative LLMs for Domain-Specific Semantic Textual Similarity | Joseph Gatto, Omar Sharif, Parker Seegmiller, Philip Bohlman, Sarah Masud Preum | cs.CL | 2023-09-12 |
Unveiling the potential of large language models in generating semantic and cross-language clones | Palash R. Roy, Ajmain I. Alam, Farouq Al-omari, Banani Roy, Chanchal K. Roy, Kevin A. Schneider | cs.SE, cs.AI, cs.LG | 2023-09-12 |
BHASA: A Holistic Southeast Asian Linguistic and Cultural Evaluation Suite for Large Language Models | Wei Qi Leong, Jian Gang Ngui, Yosephine Susanto, Hamsawardhini Rengarajan, Kengatharaiyer Sarveswaran, William Chandra Tjhi | cs.CL | 2023-09-12 |
Neural Latent Geometry Search: Product Manifold Inference via Gromov-Hausdorff-Informed Bayesian Optimization | Haitz Saez de Ocariz Borde, Alvaro Arroyo, Ismael Morales, Ingmar Posner, Xiaowen Dong | cs.LG, stat.ML | 2023-09-09 |
Parameter Efficient Audio Captioning With Faithful Guidance Using Audio-text Shared Latent Representation | Arvind Krishna Sridhar, Yinyi Guo, Erik Visser, Rehana Mahfuz | cs.CL, cs.MM, cs.SD | 2023-09-06 |
Synthetic Text Generation using Hypergraph Representations | Natraj Raman, Sameena Shah | cs.CL, cs.AI | 2023-09-06 |
Persona-aware Generative Model for Code-mixed Language | Ayan Sengupta, Md Shad Akhtar, Tanmoy Chakraborty | cs.CL, cs.LG | 2023-09-06 |
HC3 Plus: A Semantic-Invariant Human ChatGPT Comparison Corpus | Zhenpeng Su, Xing Wu, Wei Zhou, Guangyuan Ma, Songlin Hu | cs.CL, cs.AI | 2023-09-06 |
Generative AI-aided Joint Training-free Secure Semantic Communications via Multi-modal Prompts | Hongyang Du, Guangyuan Liu, Dusit Niyato, Jiayi Zhang, Jiawen Kang, Zehui Xiong, Bo Ai, Dong In Kim | eess.IV, cs.LG, cs.NI | 2023-09-05 |
Optimizing Factual Accuracy in Text Generation through Dynamic Knowledge Selection | Hongjin Qian, Zhicheng Dou, Jiejun Tan, Haonan Chen, Haoqi Gu, Ruofei Lai, Xinyu Zhang, Zhao Cao, Ji-Rong Wen | cs.CL, cs.AI | 2023-08-30 |
Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP | Vedant Palit, Rohan Pandey, Aryaman Arora, Paul Pu Liang | cs.CL, cs.AI, cs.CV | 2023-08-27 |
Planning with Logical Graph-based Language Model for Instruction Generation | Fan Zhang, Kebing Jin, Hankz Hankui Zhuo | cs.CL, cs.AI | 2023-08-26 |
GeoExplainer: A Visual Analytics Framework for Spatial Modeling Contextualization and Report Generation | Fan Lei, Yuxin Ma, Stewart Fotheringham, Elizabeth Mack, Ziqi Li, Mehak Sachdeva, Sarah Bardin, Ross Maciejewski | cs.HC, cs.LG | 2023-08-25 |
ARTIST: ARTificial Intelligence for Simplified Text | Lorenzo Corti, Jie Yang | cs.CL | 2023-08-25 |
Ceci n’est pas une pomme: Adversarial Illusions in Multi-Modal Embeddings | Eugene Bagdasaryan, Vitaly Shmatikov | cs.CR, cs.AI, cs.LG | 2023-08-22 |
Semantic Consistency for Assuring Reliability of Large Language Models | Harsh Raj, Vipul Gupta, Domenic Rosati, Subhabrata Majumdar | cs.CL, cs.AI, cs.CY | 2023-08-17 |
Can Knowledge Graphs Simplify Text? | Anthony Colas, Haodi Ma, Xuanli He, Yang Bai, Daisy Zhe Wang | cs.CL | 2023-08-14 |
Classification of Human- and AI-Generated Texts: Investigating Features for ChatGPT | Lorenz Mindner, Tim Schlippe, Kristina Schaaff | cs.CL, cs.AI | 2023-08-10 |
Few-Shot Data-to-Text Generation via Unified Representation and Multi-Source Learning | Alexander Hanbo Li, Mingyue Shang, Evangelia Spiliopoulou, Jie Ma, Patrick Ng, Zhiguo Wang, Bonan Min, William Wang, Kathleen McKeown, Vittorio Castelli, Dan Roth, Bing Xiang | cs.CL | 2023-08-10 |
Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism | Jiaxin Fan, Yong Zhang, Hanzhang Li, Jianzong Wang, Zhitao Li, Sheng Ouyang, Ning Cheng, Jing Xiao | cs.CL, cs.AI | 2023-08-07 |
Evaluating Generative Models for Graph-to-Text Generation | Shuzhou Yuan, Michael Färber | cs.CL, cs.AI | 2023-07-27 |
Controllable Generation of Dialogue Acts for Dialogue Systems via Few-Shot Response Generation and Ranking | Angela Ramirez, Karik Agarwal, Juraj Juraska, Utkarsh Garg, Marilyn A. Walker | cs.CL | 2023-07-26 |
Watermarking Conditional Text Generation for AI Detection: Unveiling Challenges and a Semantic-Aware Watermark Remedy | Yu Fu, Deyi Xiong, Yue Dong | cs.CL, cs.CR | 2023-07-25 |
FacTool: Factuality Detection in Generative AI – A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios | I-Chun Chern, Steffi Chern, Shiqi Chen, Weizhe Yuan, Kehua Feng, Chunting Zhou, Junxian He, Graham Neubig, Pengfei Liu | cs.CL, cs.AI | 2023-07-25 |
Why Is Prompt Tuning for Vision-Language Models Robust to Noisy Labels? | Cheng-En Wu, Yu Tian, Haichao Yu, Heng Wang, Pedro Morgado, Yu Hen Hu, Linjie Yang | cs.CV, cs.AI, cs.LG | 2023-07-22 |
OxfordTVG-HIC: Can Machine Make Humorous Captions from Images? | Runjia Li, Shuyang Sun, Mohamed Elhoseiny, Philip Torr | cs.CV, cs.CL | 2023-07-21 |
Jina Embeddings: A Novel Set of High-Performance Sentence Embedding Models | Michael Günther, Louis Milliken, Jonathan Geuter, Georgios Mastrapas, Bo Wang, Han Xiao | cs.CL, cs.AI, cs.IR, cs.LG, 68T50, H.3.1; H.3.3; I.2.7; I.5.4 | 2023-07-20 |
Generative Language Models on Nucleotide Sequences of Human Genes | Musa Nuri Ihtiyar, Arzucan Ozgur | q-bio.GN, cs.CL, cs.LG | 2023-07-20 |
On the Origin of LLMs: An Evolutionary Tree and Graph for 15,821 Large Language Models | Sarah Gao, Andrew Kean Gao | cs.DL, cs.CL, I.2.1; H.5.0 | 2023-07-19 |
Unveiling Gender Bias in Terms of Profession Across LLMs: Analyzing and Addressing Sociological Implications | Vishesh Thakur | cs.CL | 2023-07-18 |
Large Language Models Perform Diagnostic Reasoning | Cheng-Kuang Wu, Wei-Lin Chen, Hsin-Hsi Chen | cs.CL | 2023-07-18 |
COLLIE: Systematic Construction of Constrained Text Generation Tasks | Shunyu Yao, Howard Chen, Austin W. Hanjie, Runzhe Yang, Karthik Narasimhan | cs.CL, cs.AI, cs.LG | 2023-07-17 |
A Dialogue System for Assessing Activities of Daily Living: Improving Consistency with Grounded Knowledge | Zhecheng Sheng, Raymond Finzel, Michael Lucke, Sheena Dufresne, Maria Gini, Serguei Pakhomov | cs.CL, cs.AI | 2023-07-15 |
Using Large Language Models for Zero-Shot Natural Language Generation from Knowledge Graphs | Agnes Axelsson, Gabriel Skantze | cs.CL, 68T50, I.2.7; I.2.4 | 2023-07-14 |
Reading Radiology Imaging Like The Radiologist | Yuhao Wang | cs.CV, cs.AI | 2023-07-12 |
SkipDecode: Autoregressive Skip Decoding with Batching and Caching for Efficient LLM Inference | Luciano Del Corro, Allie Del Giorno, Sahaj Agarwal, Bin Yu, Ahmed Awadallah, Subhabrata Mukherjee | cs.CL | 2023-07-05 |
A ChatGPT Aided Explainable Framework for Zero-Shot Medical Image Diagnosis | Jiaxiang Liu, Tianxiang Hu, Yan Zhang, Xiaotang Gai, Yang Feng, Zuozhu Liu | eess.IV, cs.CV, cs.LG | 2023-07-05 |
Knowledge-Aware Audio-Grounded Generative Slot Filling for Limited Annotated Data | Guangzhi Sun, Chao Zhang, Ivan Vulić, Paweł Budzianowski, Philip C. Woodland | cs.CL | 2023-07-04 |
PatternGPT :A Pattern-Driven Framework for Large Language Model Text Generation | Le Xiao, Xin Shan | cs.CL, cs.AI | 2023-07-02 |
Concept-Oriented Deep Learning with Large Language Models | Daniel T. Chang | cs.LG, cs.CL | 2023-06-29 |
Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio | Allen Roush, Sanjay Basu, Akshay Moorthy, Dmitry Dubovoy | cs.CL, cs.AI, cs.LG | 2023-06-28 |
Knowledge Graph-Augmented Korean Generative Commonsense Reasoning | Dahyun Jung, Jaehyung Seo, Jaewook Lee, Chanjun Park, Heuiseok Lim | cs.CL, cs.AI | 2023-06-26 |
Long-range Language Modeling with Self-retrieval | Ohad Rubin, Jonathan Berant | cs.CL | 2023-06-23 |
AudioPaLM: A Large Language Model That Can Speak and Listen | Paul K. Rubenstein, Chulayuth Asawaroengchai, Duc Dung Nguyen, Ankur Bapna, Zalán Borsos, Félix de Chaumont Quitry, Peter Chen, Dalia El Badawy, Wei Han, Eugene Kharitonov, Hannah Muckenhirn, Dirk Padfield, James Qin, Danny Rozenberg, Tara Sainath, Johan Schalkwyk, Matt Sharifi, Michelle Tadmor Ramanovich, Marco Tagliasacchi, Alexandru Tudor, Mihajlo Velimirović, Damien Vincent, Jiahui Yu, Yongqiang Wang, Vicky Zayats, Neil Zeghidour, Yu Zhang, Zhishuai Zhang, Lukas Zilka, Christian Frank | cs.CL, cs.AI, cs.SD, eess.AS, stat.ML | 2023-06-22 |
Learning to Generate Better Than Your LLM | Jonathan D. Chang, Kiante Brantley, Rajkumar Ramamurthy, Dipendra Misra, Wen Sun | cs.LG, cs.AI, cs.CL | 2023-06-20 |
ChatGPT is not Enough: Enhancing Large Language Models with Knowledge Graphs for Fact-aware Language Modeling | Linyao Yang, Hongyang Chen, Zhao Li, Xiao Ding, Xindong Wu | cs.CL, cs.AI | 2023-06-20 |
FutureTOD: Teaching Future Knowledge to Pre-trained Language Model for Task-Oriented Dialogue | Weihao Zeng, Keqing He, Yejie Wang, Chen Zeng, Jingang Wang, Yunsen Xian, Weiran Xu | cs.CL | 2023-06-17 |
Semi-supervised Relation Extraction via Data Augmentation and Consistency-training | Komal K. Teru | cs.CL, cs.IR | 2023-06-16 |
Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models | Geon Yeong Park, Jeongsol Kim, Beomsu Kim, Sang Wan Lee, Jong Chul Ye | cs.CV, cs.AI, cs.CL, cs.LG | 2023-06-16 |
Neural models for Factual Inconsistency Classification with Explanations | Tathagata Raha, Mukund Choudhary, Abhinav Menon, Harshit Gupta, KV Aditya Srivatsa, Manish Gupta, Vasudeva Varma | cs.CL, cs.AI | 2023-06-15 |
Knowledge Distillation of Large Language Models | Yuxian Gu, Li Dong, Furu Wei, Minlie Huang | cs.CL, cs.AI | 2023-06-14 |
PoetryDiffusion: Towards Joint Semantic and Metrical Manipulation in Poetry Generation | Zhiyuan Hu, Chumin Liu, Yue Feng, Bryan Hooi | cs.CL | 2023-06-14 |
Unifying Large Language Models and Knowledge Graphs: A Roadmap | Shirui Pan, Linhao Luo, Yufei Wang, Chen Chen, Jiapu Wang, Xindong Wu | cs.CL, cs.AI | 2023-06-14 |
GBSD: Generative Bokeh with Stage Diffusion | Jieren Deng, Xin Zhou, Hao Tian, Zhihong Pan, Derek Aguiar | cs.CV, cs.AI | 2023-06-14 |
Question Decomposition Tree for Answering Complex Questions over Knowledge Bases | Xiang Huang, Sitao Cheng, Yiheng Shu, Yuheng Bao, Yuzhong Qu | cs.CL | 2023-06-13 |
Absformer: Transformer-based Model for Unsupervised Multi-Document Abstractive Summarization | Mohamed Trabelsi, Huseyin Uzunalioglu | cs.CL, cs.LG | 2023-06-07 |
Injecting knowledge into language generation: a case study in auto-charting after-visit care instructions from medical dialogue | Maksim Eremeev, Ilya Valmianski, Xavier Amatriain, Anitha Kannan | cs.CL | 2023-06-06 |
PLANNER: Generating Diversified Paragraph via Latent Language Diffusion Model | Yizhe Zhang, Jiatao Gu, Zhuofeng Wu, Shuangfei Zhai, Josh Susskind, Navdeep Jaitly | cs.CL | 2023-06-05 |
Adaptive and Personalized Exercise Generation for Online Language Learning | Peng Cui, Mrinmaya Sachan | cs.CL, cs.AI | 2023-06-04 |
Commonsense Knowledge Transfer for Pre-trained Language Models | Wangchunshu Zhou, Ronan Le Bras, Yejin Choi | cs.CL, cs.AI, cs.LG | 2023-06-04 |
Modular Transformers: Compressing Transformers into Modularized Layers for Flexible Efficient Inference | Wangchunshu Zhou, Ronan Le Bras, Yejin Choi | cs.CL, cs.LG | 2023-06-04 |
FlexRound: Learnable Rounding based on Element-wise Division for Post-Training Quantization | Jung Hyun Lee, Jeonghoon Kim, Se Jung Kwon, Dongsoo Lee | cs.LG, cs.AI | 2023-06-01 |
An Invariant Learning Characterization of Controlled Text Generation | Carolina Zheng, Claudia Shi, Keyon Vafa, Amir Feder, David M. Blei | cs.CL, cs.LG | 2023-05-31 |
Pre-Trained Language-Meaning Models for Multilingual Parsing and Generation | Chunliu Wang, Huiyuan Lai, Malvina Nissim, Johan Bos | cs.CL | 2023-05-31 |
Scalable Learning of Latent Language Structure With Logical Offline Cycle Consistency | Maxwell Crouse, Ramon Astudillo, Tahira Naseem, Subhajit Chaudhury, Pavan Kapanipathi, Salim Roukos, Alexander Gray | cs.CL, cs.AI | 2023-05-31 |
Fine-grained Text Style Transfer with Diffusion-Based Language Models | Yiwei Lyu, Tiange Luo, Jiacheng Shi, Todd C. Hollon, Honglak Lee | cs.CL, cs.AI, cs.LG | 2023-05-31 |
ScoNe: Benchmarking Negation Reasoning in Language Models With Fine-Tuning and In-Context Learning | Jingyuan Selena She, Christopher Potts, Samuel R. Bowman, Atticus Geiger | cs.CL, cs.LG | 2023-05-30 |
Generating with Confidence: Uncertainty Quantification for Black-box Large Language Models | Zhen Lin, Shubhendu Trivedi, Jimeng Sun | cs.CL, cs.LG, stat.ML | 2023-05-30 |
Response Generation in Longitudinal Dialogues: Which Knowledge Representation Helps? | Seyed Mahed Mousavi, Simone Caldarella, Giuseppe Riccardi | cs.CL | 2023-05-25 |
Self-contradictory Hallucinations of Large Language Models: Evaluation, Detection and Mitigation | Niels Mündler, Jingxuan He, Slobodan Jenko, Martin Vechev | cs.CL, cs.AI, cs.LG | 2023-05-25 |
Peek Across: Improving Multi-Document Modeling via Cross-Document Question-Answering | Avi Caciularu, Matthew E. Peters, Jacob Goldberger, Ido Dagan, Arman Cohan | cs.CL, cs.AI | 2023-05-24 |
Not All Metrics Are Guilty: Improving NLG Evaluation with LLM Paraphrasing | Tianyi Tang, Hongyuan Lu, Yuchen Eleanor Jiang, Haoyang Huang, Dongdong Zhang, Wayne Xin Zhao, Furu Wei | cs.CL | 2023-05-24 |
The ACL OCL Corpus: advancing Open science in Computational Linguistics | Shaurya Rohatgi, Yanxia Qin, Benjamin Aw, Niranjana Unnithan, Min-Yen Kan | cs.CL, cs.DL | 2023-05-24 |
Trusting Your Evidence: Hallucinate Less with Context-aware Decoding | Weijia Shi, Xiaochuang Han, Mike Lewis, Yulia Tsvetkov, Luke Zettlemoyer, Scott Wen-tau Yih | cs.CL | 2023-05-24 |
Diffusion Models in NLP: A Survey | Hao Zou, Zae Myung Kim, Dongyeop Kang | cs.CL | 2023-05-24 |
Enhancing Generation through Summarization Duality and Explicit Outline Control | Yunzhe Li, Qian Chen, Weixiang Yan, Wen Wang, Qinglin Zhang, Hari Sundaram | cs.CL, cs.AI | 2023-05-23 |
INSTRUCTSCORE: Towards Explainable Text Generation Evaluation with Automatic Feedback | Wenda Xu, Danqing Wang, Liangming Pan, Zhenqiao Song, Markus Freitag, William Yang Wang, Lei Li | cs.CL, cs.AI | 2023-05-23 |
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation | Sewon Min, Kalpesh Krishna, Xinxi Lyu, Mike Lewis, Wen-tau Yih, Pang Wei Koh, Mohit Iyyer, Luke Zettlemoyer, Hannaneh Hajishirzi | cs.CL, cs.AI, cs.LG | 2023-05-23 |
Evaluating Factual Consistency of Texts with Semantic Role Labeling | Jing Fan, Dennis Aumiller, Michael Gertz | cs.CL | 2023-05-22 |
GEST: the Graph of Events in Space and Time as a Common Representation between Vision and Language | Mihai Masala, Nicolae Cudlenco, Traian Rebedea, Marius Leordeanu | cs.CL | 2023-05-22 |
STOAT: Structured Data to Analytical Text With Controls | Deepanway Ghosal, Preksha Nema, Aravindan Raghuveer | cs.CL, cs.AI | 2023-05-19 |
Generating Visual Spatial Description via Holistic 3D Scene Understanding | Yu Zhao, Hao Fei, Wei Ji, Jianguo Wei, Meishan Zhang, Min Zhang, Tat-Seng Chua | cs.CV, cs.CL | 2023-05-19 |
What Comes Next? Evaluating Uncertainty in Neural Text Generators Against Human Production Variability | Mario Giulianelli, Joris Baan, Wilker Aziz, Raquel Fernández, Barbara Plank | cs.CL, cs.AI, cs.LG | 2023-05-19 |
Cross-modality Data Augmentation for End-to-End Sign Language Translation | Jinhui Ye, Wenxiang Jiao, Xing Wang, Zhaopeng Tu, Hui Xiong | cs.CL | 2023-05-18 |
What You See is What You Read? Improving Text-Image Alignment Evaluation | Michal Yarom, Yonatan Bitton, Soravit Changpinyo, Roee Aharoni, Jonathan Herzig, Oran Lang, Eran Ofek, Idan Szpektor | cs.CL, cs.CV | 2023-05-17 |
Boosting Event Extraction with Denoised Structure-to-Text Augmentation | bo wang, Heyan Huang, Xiaochi Wei, Ge Shi, Xiao Liu, Chong Feng, Tong Zhou, Shuaiqiang Wang, Dawei Yin | cs.CL | 2023-05-16 |
Watermarking Text Generated by Black-Box Language Models | Xi Yang, Kejiang Chen, Weiming Zhang, Chang Liu, Yuang Qi, Jie Zhang, Han Fang, Nenghai Yu | cs.CL, cs.AI | 2023-05-14 |
ProKnow: Process Knowledge for Safety Constrained and Explainable Question Generation for Mental Health Diagnostic Assistance | Kaushik Roy, Manas Gaur, Misagh Soltani, Vipula Rawte, Ashwin Kalyan, Amit Sheth | cs.CL | 2023-05-13 |
Autocorrelations Decay in Texts and Applicability Limits of Language Models | Nikolay Mikhaylovskiy, Ilya Churilov | cs.CL, I.2.7 | 2023-05-11 |
Long-Tailed Question Answering in an Open World | Yi Dai, Hao Lang, Yinhe Zheng, Fei Huang, Yongbin Li | cs.CL, cs.AI, cs.LG | 2023-05-11 |
Are ChatGPT and GPT-4 General-Purpose Solvers for Financial Text Analytics? An Examination on Several Typical Tasks | Xianzhi Li, Xiaodan Zhu, Zhiqiang Ma, Xiaomo Liu, Sameena Shah | cs.CL, cs.AI | 2023-05-10 |
Vārta: A Large-Scale Headline-Generation Dataset for Indic Languages | Rahul Aralikatte, Ziling Cheng, Sumanth Doddapaneni, Jackie Chi Kit Cheung | cs.CL | 2023-05-10 |
Vision-Language Models in Remote Sensing: Current Progress and Future Trends | Congcong Wen, Yuan Hu, Xiang Li, Zhenghang Yuan, Xiao Xiang Zhu | cs.CV, cs.AI | 2023-05-09 |
Dreams Are More “Predictable’’ Than You Think | Lorenzo Bertolini | cs.CL | 2023-05-08 |
Diffusion-NAT: Self-Prompting Discrete Diffusion for Non-Autoregressive Text Generation | Kun Zhou, Yifan Li, Wayne Xin Zhao, Ji-Rong Wen | cs.CL | 2023-05-06 |
Expository Text Generation: Imitate, Retrieve, Paraphrase | Nishant Balepur, Jie Huang, Kevin Chen-Chuan Chang | cs.CL | 2023-05-05 |
Stylized Data-to-Text Generation: A Case Study in the E-Commerce Domain | Liqiang Jing, Xuemeng Song, Xuming Lin, Zhongzhou Zhao, Wei Zhou, Liqiang Nie | cs.CL | 2023-05-05 |
Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs | Jinyang Li, Binyuan Hui, Ge Qu, Binhua Li, Jiaxi Yang, Bowen Li, Bailin Wang, Bowen Qin, Rongyu Cao, Ruiying Geng, Nan Huo, Chenhao Ma, Kevin C. C. Chang, Fei Huang, Reynold Cheng, Yongbin Li | cs.CL | 2023-05-04 |
A Systematic Study of Knowledge Distillation for Natural Language Generation with Pseudo-Target Training | Nitay Calderon, Subhabrata Mukherjee, Roi Reichart, Amir Kantor | cs.CL, cs.AI | 2023-05-03 |
The Benefits of Bad Advice: Autocontrastive Decoding across Model Layers | Ariel Gera, Roni Friedman, Ofir Arviv, Chulaka Gunasekara, Benjamin Sznajder, Noam Slonim, Eyal Shnarch | cs.CL, cs.LG | 2023-05-02 |
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond | Jingfeng Yang, Hongye Jin, Ruixiang Tang, Xiaotian Han, Qizhang Feng, Haoming Jiang, Bing Yin, Xia Hu | cs.CL, cs.AI, cs.LG | 2023-04-26 |
Evaluating Inter-Bilingual Semantic Parsing for Indian Languages | Divyanshu Aggarwal, Vivek Gupta, Anoop Kunchukuttan | cs.CL | 2023-04-25 |
Divide and Prompt: Chain of Thought Prompting for Text-to-SQL | Xiping Liu, Zhao Tan | cs.CL, cs.AI | 2023-04-23 |
Translationese Reduction using Abstract Meaning Representation | Shira Wein, Nathan Schneider | cs.CL | 2023-04-23 |
GeneGPT: Augmenting Large Language Models with Domain Tools for Improved Access to Biomedical Information | Qiao Jin, Yifan Yang, Qingyu Chen, Zhiyong Lu | cs.CL, cs.AI, q-bio.GN | 2023-04-19 |
ArguGPT: evaluating, understanding and identifying argumentative essays generated by GPT models | Yikang Liu, Ziyin Zhang, Wanyang Zhang, Shisen Yue, Xiaojing Zhao, Xinyuan Cheng, Yiwen Zhang, Hai Hu | cs.CL | 2023-04-16 |
Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study | Boxin Wang, Wei Ping, Peng Xu, Lawrence McAfee, Zihan Liu, Mohammad Shoeybi, Yi Dong, Oleksii Kuchaiev, Bo Li, Chaowei Xiao, Anima Anandkumar, Bryan Catanzaro | cs.CL, cs.AI, cs.IR, cs.LG | 2023-04-13 |
Training Large Language Models Efficiently with Sparsity and Dataflow | Venkat Srinivasan, Darshan Gandhi, Urmish Thakker, Raghu Prabhakar | cs.LG | 2023-04-11 |
To ChatGPT, or not to ChatGPT: That is the question! | Alessandro Pegoraro, Kavita Kumari, Hossein Fereidooni, Ahmad-Reza Sadeghi | cs.LG, cs.AI, cs.CL | 2023-04-04 |
Measuring and Manipulating Knowledge Representations in Language Models | Evan Hernandez, Belinda Z. Li, Jacob Andreas | cs.CL | 2023-04-03 |
Assessing Language Model Deployment with Risk Cards | Leon Derczynski, Hannah Rose Kirk, Vidhisha Balachandran, Sachin Kumar, Yulia Tsvetkov, M. R. Leiser, Saif Mohammad | cs.CL | 2023-03-31 |
ChatGPT or academic scientist? Distinguishing authorship with over 99% accuracy using off-the-shelf machine learning tools | Heather Desaire, Aleesa E. Chua, Madeline Isom, Romana Jarosova, David Hua | cs.LG, cs.CL | 2023-03-28 |
On Codex Prompt Engineering for OCL Generation: An Empirical Study | Seif Abukhalaf, Mohammad Hamdaqa, Foutse Khomh | cs.SE, cs.AI | 2023-03-28 |
Large Language Models are Diverse Role-Players for Summarization Evaluation | Ning Wu, Ming Gong, Linjun Shou, Shining Liang, Daxin Jiang | cs.CL | 2023-03-27 |
Paraphrase Detection: Human vs. Machine Content | Jonas Becker, Jan Philip Wahle, Terry Ruas, Bela Gipp | cs.CL, cs.AI | 2023-03-24 |
CoBIT: A Contrastive Bi-directional Image-Text Generation Model | Haoxuan You, Mandy Guo, Zhecan Wang, Kai-Wei Chang, Jason Baldridge, Jiahui Yu | cs.CV, cs.CL | 2023-03-23 |
Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense | Kalpesh Krishna, Yixiao Song, Marzena Karpinska, John Wieting, Mohit Iyyer | cs.CL, cs.CR, cs.LG | 2023-03-23 |
Compositional Zero-Shot Domain Transfer with Text-to-Text Models | Fangyu Liu, Qianchu Liu, Shruthi Bannur, Fernando Pérez-García, Naoto Usuyama, Sheng Zhang, Tristan Naumann, Aditya Nori, Hoifung Poon, Javier Alvarez-Valle, Ozan Oktay, Stephanie L. Hyland | cs.CL, cs.LG | 2023-03-23 |
Language Model Behavior: A Comprehensive Survey | Tyler A. Chang, Benjamin K. Bergen | cs.CL | 2023-03-20 |
SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language Models | Vithursan Thangarasa, Abhay Gupta, William Marshall, Tianda Li, Kevin Leong, Dennis DeCoste, Sean Lie, Shreyas Saxena | cs.LG, cs.CL | 2023-03-18 |
Input-length-shortening and text generation via attention values | Neşet Özkan Tan, Alex Yuxuan Peng, Joshua Bensemann, Qiming Bao, Tim Hartill, Mark Gahegan, Michael Witbrock | cs.CL | 2023-03-14 |
Transformer-based Planning for Symbolic Regression | Parshin Shojaee, Kazem Meidani, Amir Barati Farimani, Chandan K. Reddy | cs.LG, cs.AI | 2023-03-13 |
Large Language Models as Zero-Shot Human Models for Human-Robot Interaction | Bowen Zhang, Harold Soh | cs.RO, cs.CL, cs.HC, cs.LG | 2023-03-06 |
A Universal Question-Answering Platform for Knowledge Graphs | Reham Omar, Ishika Dhall, Panos Kalnis, Essam Mansour | cs.AI, cs.CL, cs.DB | 2023-03-01 |
Inseq: An Interpretability Toolkit for Sequence Generation Models | Gabriele Sarti, Nils Feldhus, Ludwig Sickert, Oskar van der Wal | cs.CL, cs.AI, cs.HC, cs.LG | 2023-02-27 |
Toward Fairness in Text Generation via Mutual Information Minimization based on Importance Sampling | Rui Wang, Pengyu Cheng, Ricardo Henao | cs.CL, cs.AI | 2023-02-25 |
Few-Shot Table-to-Text Generation with Prompt-based Adapter | Zhixin Guo, Minyxuan Yan, Jiexing Qi, Jianping Zhou, Ziwei He, Zhouhan Lin, Guanjie Zheng, Xinbing Wang | cs.CL | 2023-02-24 |
How Does In-Context Learning Help Prompt Tuning? | Simeng Sun, Yang Liu, Dan Iter, Chenguang Zhu, Mohit Iyyer | cs.CL | 2023-02-22 |
KILM: Knowledge Injection into Encoder-Decoder Language Models | Yan Xu, Mahdi Namazifar, Devamanyu Hazarika, Aishwarya Padmakumar, Yang Liu, Dilek Hakkani-Tür | cs.CL, cs.AI | 2023-02-17 |
Do We Still Need Clinical Language Models? | Eric Lehman, Evan Hernandez, Diwakar Mahajan, Jonas Wulff, Micah J. Smith, Zachary Ziegler, Daniel Nadler, Peter Szolovits, Alistair Johnson, Emily Alsentzer | cs.CL | 2023-02-16 |
Tree-Based Representation and Generation of Natural and Mathematical Language | Alexander Scarlatos, Andrew Lan | cs.CL | 2023-02-15 |
Investigating the Effect of Relative Positional Embeddings on AMR-to-Text Generation with Structural Adapters | Sebastien Montella, Alexis Nasr, Johannes Heinecke, Frederic Bechet, Lina M. Rojas-Barahona | cs.CL | 2023-02-12 |
TextDefense: Adversarial Text Detection based on Word Importance Entropy | Lujia Shen, Xuhong Zhang, Shouling Ji, Yuwen Pu, Chunpeng Ge, Xing Yang, Yanghe Feng | cs.CL, cs.AI, cs.CR | 2023-02-12 |
Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning | Zhuolin Yang, Wei Ping, Zihan Liu, Vijay Korthikanti, Weili Nie, De-An Huang, Linxi Fan, Zhiding Yu, Shiyi Lan, Bo Li, Ming-Yu Liu, Yuke Zhu, Mohammad Shoeybi, Bryan Catanzaro, Chaowei Xiao, Anima Anandkumar | cs.CV, cs.AI, cs.CL, cs.IR, cs.LG | 2023-02-09 |
Lightweight Transformers for Clinical Natural Language Processing | Omid Rohanian, Mohammadmahdi Nouriborji, Hannah Jauncey, Samaneh Kouchaki, ISARIC Clinical Characterisation Group, Lei Clifton, Laura Merson, David A. Clifton | cs.CL, cs.AI, cs.LG, 68T50, I.2.7 | 2023-02-09 |
Few-Shot Table-to-Text Generation with Prompt Planning and Knowledge Memorization | Zhixin Guo, Minyxuan Yan, Jiexing Qi, Jianping Zhou, Ziwei He, Zhouhan Lin, Guanjie Zheng, Xinbing Wang | cs.CL, cs.AI | 2023-02-09 |
Auto-Learning: An Adversarial Process of Two Pre-trained Models for Natural Language Generation | Zhengqing Yuan, Yuelin Lu, Chao Zhang, Huiwen Xue | cs.CL | 2023-02-08 |
Controlling Personality Style in Dialogue with Zero-Shot Prompt-Based Learning | Angela Ramirez, Mamon Alsalihy, Kartik Aggarwal, Cecilia Li, Liren Wu, Marilyn Walker | cs.CL | 2023-02-08 |
Execution-based Code Generation using Deep Reinforcement Learning | Parshin Shojaee, Aneesh Jain, Sindhu Tipirneni, Chandan K. Reddy | cs.LG, cs.AI, cs.CL, cs.PL | 2023-01-31 |
DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature | Eric Mitchell, Yoonho Lee, Alexander Khazatsky, Christopher D. Manning, Chelsea Finn | cs.CL, cs.AI | 2023-01-26 |
One Model for All Domains: Collaborative Domain-Prefix Tuning for Cross-Domain NER | Xiang Chen, Lei Li, Qiaoshuo Fei, Ningyu Zhang, Chuanqi Tan, Yong Jiang, Fei Huang, Huajun Chen | cs.CL, cs.AI, cs.DB, cs.IR, cs.LG | 2023-01-25 |
Semantic-aware Contrastive Learning for Electroencephalography-to-Text Generation with Curriculum Learning | Xiachong Feng, Xiaocheng Feng, Bing Qin | cs.HC, cs.CL | 2023-01-23 |
Rationalization for Explainable NLP: A Survey | Sai Gurrapu, Ajay Kulkarni, Lifu Huang, Ismini Lourentzou, Laura Freeman, Feras A. Batarseh | cs.CL, cs.LG | 2023-01-21 |
A Multi-Modal Geographic Pre-Training Method | Ruixue Ding, Boli Chen, Pengjun Xie, Fei Huang, Xin Li, Qiang Zhang, Yao Xu | cs.CL | 2023-01-11 |
Universal Multimodal Representation for Language Understanding | Zhuosheng Zhang, Kehai Chen, Rui Wang, Masao Utiyama, Eiichiro Sumita, Zuchao Li, Hai Zhao | cs.CL, cs.AI, cs.CV | 2023-01-09 |
TegFormer: Topic-to-Essay Generation with Good Topic Coverage and High Text Coherence | Wang Qi, Rui Liu, Yuan Zuo, Yong Chen, Dell Zhang | cs.CL | 2022-12-27 |
Do DALL-E and Flamingo Understand Each Other? | Hang Li, Jindong Gu, Rajat Koner, Sahand Sharifzadeh, Volker Tresp | cs.CV, cs.LG | 2022-12-23 |
Understanding Stereotypes in Language Models: Towards Robust Measurement and Zero-Shot Debiasing | Justus Mattern, Zhijing Jin, Mrinmaya Sachan, Rada Mihalcea, Bernhard Schölkopf | cs.CL, cs.LG | 2022-12-20 |
Controllable Text Generation with Language Constraints | Howard Chen, Huihan Li, Danqi Chen, Karthik Narasimhan | cs.CL | 2022-12-20 |
CoCo: Coherence-Enhanced Machine-Generated Text Detection Under Data Limitation With Contrastive Learning | Xiaoming Liu, Zhaohan Zhang, Yichen Wang, Yu Lan, Chao Shen | cs.CL | 2022-12-20 |
One Embedder, Any Task: Instruction-Finetuned Text Embeddings | Hongjin Su, Weijia Shi, Jungo Kasai, Yizhong Wang, Yushi Hu, Mari Ostendorf, Wen-tau Yih, Noah A. Smith, Luke Zettlemoyer, Tao Yu | cs.CL | 2022-12-19 |
Synthesis and Evaluation of a Domain-specific Large Data Set for Dungeons & Dragons | Akila Peiris, Nisansa de Silva | cs.CL, cs.LG | 2022-12-18 |
Plansformer: Generating Symbolic Plans using Transformers | Vishal Pallagani, Bharath Muppasani, Keerthiram Murugesan, Francesca Rossi, Lior Horesh, Biplav Srivastava, Francesco Fabiano, Andrea Loreggia | cs.AI | 2022-12-16 |
MURMUR: Modular Multi-Step Reasoning for Semi-Structured Data-to-Text Generation | Swarnadeep Saha, Xinyan Velocity Yu, Mohit Bansal, Ramakanth Pasunuru, Asli Celikyilmaz | cs.CL, cs.AI, cs.LG | 2022-12-16 |
ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning | Olga Golovneva, Moya Chen, Spencer Poff, Martin Corredor, Luke Zettlemoyer, Maryam Fazel-Zarandi, Asli Celikyilmaz | cs.CL, cs.LG | 2022-12-15 |
Collaborating Heterogeneous Natural Language Processing Tasks via Federated Learning | Chenhe Dong, Yuexiang Xie, Bolin Ding, Ying Shen, Yaliang Li | cs.CL | 2022-12-12 |
Discovering Latent Knowledge in Language Models Without Supervision | Collin Burns, Haotian Ye, Dan Klein, Jacob Steinhardt | cs.CL, cs.AI, cs.LG | 2022-12-07 |
Harnessing Knowledge and Reasoning for Human-Like Natural Language Generation: A Brief Review | Jiangjie Chen, Yanghua Xiao | cs.CL | 2022-12-07 |
Momentum Decoding: Open-ended Text Generation As Graph Exploration | Tian Lan, Yixuan Su, Shuhang Liu, Heyan Huang, Xian-Ling Mao | cs.CL | 2022-12-05 |
Learning Automata-Based Task Knowledge Representation from Large-Scale Generative Language Models | Yunhao Yang, Jean-Raphaël Gaglione, Ufuk Topcu | cs.FL, cs.CL | 2022-12-04 |
What do you MEME? Generating Explanations for Visual Semantic Role Labelling in Memes | Shivam Sharma, Siddhant Agarwal, Tharun Suresh, Preslav Nakov, Md. Shad Akhtar, Tanmoy Charkraborty | cs.CY, cs.CL | 2022-12-01 |
CliMedBERT: A Pre-trained Language Model for Climate and Health-related Text | B. Jalalzadeh Fard, S. A. Hasan, J. E. Bell | cs.CL, 68T50, I.2.7 | 2022-12-01 |
On the Importance of Image Encoding in Automated Chest X-Ray Report Generation | Otabek Nazarov, Mohammad Yaqub, Karthik Nandakumar | cs.CV, cs.AI | 2022-11-24 |
Retrieval-Augmented Multimodal Language Modeling | Michihiro Yasunaga, Armen Aghajanyan, Weijia Shi, Rich James, Jure Leskovec, Percy Liang, Mike Lewis, Luke Zettlemoyer, Wen-tau Yih | cs.CV, cs.CL, cs.LG | 2022-11-22 |
Towards Computationally Verifiable Semantic Grounding for Language Models | Chris Alberti, Kuzman Ganchev, Michael Collins, Sebastian Gehrmann, Ciprian Chelba | cs.CL | 2022-11-16 |
kogito: A Commonsense Knowledge Inference Toolkit | Mete Ismayilzada, Antoine Bosselut | cs.CL | 2022-11-15 |
A Survey of Knowledge-Enhanced Pre-trained Language Models | Linmei Hu, Zeyi Liu, Ziwang Zhao, Lei Hou, Liqiang Nie, Juanzi Li | cs.CL | 2022-11-11 |
CCPrompt: Counterfactual Contrastive Prompt-Tuning for Many-Class Classification | Yang Li, Canran Xu, Tao Shen, Jing Jiang, Guodong Long | cs.CL | 2022-11-11 |
Measuring Reliability of Large Language Models through Semantic Consistency | Harsh Raj, Domenic Rosati, Subhabrata Majumdar | cs.CL, cs.AI, cs.CY | 2022-11-10 |
Generative Transformers for Design Concept Generation | Qihao Zhu, Jianxi Luo | cs.CL | 2022-11-07 |
CLSE: Corpus of Linguistically Significant Entities | Aleksandr Chuklin, Justin Zhao, Mihir Kale | cs.CL | 2022-11-04 |
Dialect-robust Evaluation of Generated Text | Jiao Sun, Thibault Sellam, Elizabeth Clark, Tu Vu, Timothy Dozat, Dan Garrette, Aditya Siddhant, Jacob Eisenstein, Sebastian Gehrmann | cs.CL | 2022-11-02 |
Seq2Seq-SC: End-to-End Semantic Communication Systems with Pre-trained Language Model | Ju-Hyung Lee, Dong-Ho Lee, Eunsoo Sheen, Thomas Choi, Jay Pujara, Joongheon Kim | eess.SP, cs.CL | 2022-10-27 |
Contrastive Search Is What You Need For Neural Text Generation | Yixuan Su, Nigel Collier | cs.CL | 2022-10-25 |
Mapping Process for the Task: Wikidata Statements to Text as Wikipedia Sentences | Hoang Thang Ta, Alexander Gelbukha, Grigori Sidorov | cs.CL, cs.AI | 2022-10-23 |
Hard Gate Knowledge Distillation – Leverage Calibration for Robust and Reliable Language Model | Dongkyu Lee, Zhiliang Tian, Yingxiu Zhao, Ka Chun Cheung, Nevin L. Zhang | cs.CL, cs.AI | 2022-10-22 |
What do Large Language Models Learn beyond Language? | Avinash Madasu, Shashank Srivastava | cs.CL | 2022-10-21 |
Image Semantic Relation Generation | Mingzhe Du | cs.CV, cs.CL | 2022-10-19 |
NGEP: A Graph-based Event Planning Framework for Story Generation | Chen Tang, Zhihao Zhang, Tyler Loakman, Chenghua Lin, Frank Guerin | cs.CL, cs.AI | 2022-10-19 |
SafeText: A Benchmark for Exploring Physical Safety in Language Models | Sharon Levy, Emily Allaway, Melanie Subbiah, Lydia Chilton, Desmond Patton, Kathleen McKeown, William Yang Wang | cs.CL, cs.AI | 2022-10-18 |
DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Generation | Hanqing Zhang, Dawei Song | cs.CL | 2022-10-18 |
UniTune: Text-Driven Image Editing by Fine Tuning an Image Generation Model on a Single Image | Dani Valevski, Matan Kalman, Yossi Matias, Yaniv Leviathan | cs.CV, cs.GR, cs.LG | 2022-10-17 |
Deepfake Text Detection: Limitations and Opportunities | Jiameng Pu, Zain Sarwar, Sifat Muhammad Abdullah, Abdullah Rehman, Yoonjin Kim, Parantapa Bhattacharya, Mobin Javed, Bimal Viswanath | cs.CR, cs.CL, cs.LG | 2022-10-17 |
Towards a Unified Multi-Dimensional Evaluator for Text Generation | Ming Zhong, Yang Liu, Da Yin, Yuning Mao, Yizhu Jiao, Pengfei Liu, Chenguang Zhu, Heng Ji, Jiawei Han | cs.CL | 2022-10-13 |
RaP: Redundancy-aware Video-language Pre-training for Text-Video Retrieval | Xing Wu, Chaochen Gao, Zijia Lin, Zhongyuan Wang, Jizhong Han, Songlin Hu | cs.CV, cs.AI | 2022-10-13 |
Equi-Tuning: Group Equivariant Fine-Tuning of Pretrained Models | Sourya Basu, Prasanna Sattigeri, Karthikeyan Natesan Ramamurthy, Vijil Chenthamarakshan, Kush R. Varshney, Lav R. Varshney, Payel Das | cs.LG, cs.CL | 2022-10-13 |
CHARD: Clinical Health-Aware Reasoning Across Dimensions for Text Generation Models | Steven Y. Feng, Vivek Khetan, Bogdan Sacaleanu, Anatole Gershman, Eduard Hovy | cs.CL, cs.AI, cs.LG | 2022-10-09 |
BLAB Reporter: Automated journalism covering the Blue Amazon | Yan V. Sym, João Gabriel M. Campos, Fabio G. Cozman | cs.CL | 2022-10-08 |
Bird-Eye Transformers for Text Generation Models | Lei Sha, Yuhang Song, Yordan Yordanov, Tommaso Salvatori, Thomas Lukasiewicz | cs.CL | 2022-10-08 |
A Unified Encoder-Decoder Framework with Entity Memory | Zhihan Zhang, Wenhao Yu, Chenguang Zhu, Meng Jiang | cs.CL | 2022-10-07 |
Unsupervised Sentence Textual Similarity with Compositional Phrase Semantics | Zihao Wang, Jiaheng Dou, Yong Zhang | cs.CL | 2022-10-05 |
Synonym Detection Using Syntactic Dependency And Neural Embeddings | Dongqiang Yang, Pikun Wang, Xiaodong Sun, Ning Li | cs.CL, cs.AI | 2022-09-30 |
Co-Writing Screenplays and Theatre Scripts with Language Models: An Evaluation by Industry Professionals | Piotr Mirowski, Kory W. Mathewson, Jaylen Pittman, Richard Evans | cs.HC, cs.CL | 2022-09-29 |
DR.BENCH: Diagnostic Reasoning Benchmark for Clinical Natural Language Processing | Yanjun Gao, Dmitriy Dligach, Timothy Miller, John Caskey, Brihat Sharma, Matthew M Churpek, Majid Afshar | cs.CL, cs.AI | 2022-09-29 |
FiD-Light: Efficient and Effective Retrieval-Augmented Text Generation | Sebastian Hofstätter, Jiecao Chen, Karthik Raman, Hamed Zamani | cs.CL, cs.IR | 2022-09-28 |
Informative Text Generation from Knowledge Triples | Zihao Fu, Yijiang River Dong, Lidong Bing, Wai Lam | cs.CL | 2022-09-26 |
Controllable Text Generation for Open-Domain Creativity and Fairness | Nanyun Peng | cs.CL, cs.AI | 2022-09-24 |
Can we do that simpler? Simple, Efficient, High-Quality Evaluation Metrics for NLG | Jens Grünwald, Christoph Leiter, Steffen Eger | cs.CL, cs.LG | 2022-09-20 |
Selective Token Generation for Few-shot Natural Language Generation | Daejin Jo, Taehwan Kwon, Eun-Sol Kim, Sungwoong Kim | cs.CL, cs.LG | 2022-09-17 |
TwHIN-BERT: A Socially-Enriched Pre-trained Language Model for Multilingual Tweet Representations | Xinyang Zhang, Yury Malkov, Omar Florez, Serim Park, Brian McWilliams, Jiawei Han, Ahmed El-Kishky | cs.CL | 2022-09-15 |
Distribution Aware Metrics for Conditional Natural Language Generation | David M Chan, Yiming Ni, Austin Myers, Sudheendra Vijayanarasimhan, David A Ross, John Canny | cs.CL, cs.AI, cs.CV, cs.LG | 2022-09-15 |
vec2text with Round-Trip Translations | Geoffrey Cideron, Sertan Girgin, Anton Raichuk, Olivier Pietquin, Olivier Bachem, Léonard Hussenot | cs.CL, cs.LG | 2022-09-14 |
Visual Recipe Flow: A Dataset for Learning Visual State Changes of Objects with Recipe Flows | Keisuke Shirai, Atsushi Hashimoto, Taichi Nishimura, Hirotaka Kameko, Shuhei Kurita, Yoshitaka Ushiku, Shinsuke Mori | cs.CL, cs.AI | 2022-09-13 |
Elaboration-Generating Commonsense Question Answering at Scale | Wenya Wang, Vivek Srikumar, Hanna Hajishirzi, Noah A. Smith | cs.CL | 2022-09-02 |
Multi-Modal Experience Inspired AI Creation | Qian Cao, Xu Chen, Ruihua Song, Hao Jiang, Guang Yang, Zhao Cao | cs.AI | 2022-09-02 |
Unified Knowledge Prompt Pre-training for Customer Service Dialogues | Keqing He, Jingang Wang, Chaobo Sun, Wei Wu | cs.CL | 2022-08-31 |
StoryTrans: Non-Parallel Story Author-Style Transfer with Discourse Representations and Content Enhancing | Xuekai Zhu, Jian Guan, Minlie Huang, Juan Liu | cs.CL | 2022-08-29 |
GenTUS: Simulating User Behaviour and Language in Task-oriented Dialogues with Generative Transformers | Hsien-Chin Lin, Christian Geishauser, Shutong Feng, Nurul Lubis, Carel van Niekerk, Michael Heck, Milica Gašić | cs.CL | 2022-08-23 |
Automatic tagging of knowledge points for K12 math problems | Xiaolu Wang, Ziqi Ding, Liangyu Chen | cs.CL | 2022-08-21 |
Performance Optimization for Semantic Communications: An Attention-based Reinforcement Learning Approach | Yining Wang, Mingzhe Chen, Tao Luo, Walid Saad, Dusit Niyato, H. Vincent Poor, Shuguang Cui | cs.IT, cs.AI, math.IT | 2022-08-17 |
Understanding Attention for Vision-and-Language Tasks | Feiqi Cao, Soyeon Caren Han, Siqu Long, Changwei Xu, Josiah Poon | cs.CV, cs.CL | 2022-08-17 |
A Representation Modeling Based Language GAN with Completely Random Initialization | Da Ren, Qing Li | cs.CL | 2022-08-04 |
LaKo: Knowledge-driven Visual Question Answering via Late Knowledge-to-Text Injection | Zhuo Chen, Yufeng Huang, Jiaoyan Chen, Yuxia Geng, Yin Fang, Jeff Pan, Ningyu Zhang, Wen Zhang | cs.CV, cs.AI | 2022-07-26 |
Leveraging Natural Supervision for Language Representation Learning and Generation | Mingda Chen | cs.CL | 2022-07-21 |
Syntax Controlled Knowledge Graph-to-Text Generation with Order and Semantic Consistency | Jin Liu, Chongfeng Fan, Fengyu Zhou, Huijuan Xu | cs.AI | 2022-07-02 |
Is neural language acquisition similar to natural? A chronological probing study | Ekaterina Voloshina, Oleg Serikov, Tatiana Shavrina | cs.CL | 2022-07-01 |
Evaluation of Semantic Answer Similarity Metrics | Farida Mustafazade, Peter F. Ebbinghaus | cs.CL, cs.AI, cs.LG | 2022-06-25 |
BenchCLAMP: A Benchmark for Evaluating Language Models on Semantic Parsing | Subhro Roy, Sam Thomson, Tongfei Chen, Richard Shin, Adam Pauls, Jason Eisner, Benjamin Van Durme | cs.CL | 2022-06-21 |
Interpretable AMR-Based Question Decomposition for Multi-hop Question Answering | Zhenyun Deng, Yonghua Zhu, Yang Chen, Michael Witbrock, Patricia Riddle | cs.CL | 2022-06-16 |
An Exploration of Post-Editing Effectiveness in Text Summarization | Vivian Lai, Alison Smith-Renner, Ke Zhang, Ruijia Cheng, Wenjuan Zhang, Joel Tetreault, Alejandro Jaimes | cs.CL, cs.AI, cs.HC | 2022-06-13 |
Plot Writing From Pre-Trained Language Models | Yiping Jin, Vishakha Kadam, Dittaya Wanvarie | cs.CL | 2022-06-07 |
Enhancing Dual-Encoders with Question and Answer Cross-Embeddings for Answer Retrieval | Yanmeng Wang, Jun Bai, Ye Wang, Jianfei Zhang, Wenge Rong, Zongcheng Ji, Shaojun Wang, Jing Xiao | cs.CL, cs.AI | 2022-06-07 |
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech | Ziyue Jiang, Su Zhe, Zhou Zhao, Qian Yang, Yi Ren, Jinglin Liu, Zhenhui Ye | eess.AS, cs.CL, cs.SD | 2022-06-05 |
Beyond Opinion Mining: Summarizing Opinions of Customer Reviews | Reinald Kim Amplayo, Arthur Bražinskas, Yoshi Suhara, Xiaolan Wang, Bing Liu | cs.CL, cs.AI, cs.LG | 2022-06-03 |
Revisiting Generative Commonsense Reasoning: A Pre-Ordering Approach | Chao Zhao, Faeze Brahman, Tenghao Huang, Snigdha Chaturvedi | cs.CL | 2022-05-26 |
Automatic question generation based on sentence structure analysis using machine learning approach | Miroslav Blšták, Viera Rozinajová | cs.CL, cs.AI | 2022-05-25 |
PLOG: Table-to-Logic Pretraining for Logical Table-to-Text Generation | Ao Liu, Haoyu Dong, Naoaki Okazaki, Shi Han, Dongmei Zhang | cs.CL | 2022-05-25 |
Exploring industrial safety knowledge via Zipf law | Zhenhua Wang, Ming Ren, Dong Gao, Zhuang Li | cs.CL | 2022-05-25 |
RSTGen: Imbuing Fine-Grained Interpretable Control into Long-FormText Generators | Rilwan A. Adewoyin, Ritabrata Dutta, Yulan He | cs.CL | 2022-05-25 |
A Dynamic, Interpreted CheckList for Meaning-oriented NLG Metric Evaluation – through the Lens of Semantic Similarity Rating | Laura Zeidler, Juri Opitz, Anette Frank | cs.CL | 2022-05-24 |
What Makes Data-to-Text Generation Hard for Pretrained Language Models? | Moniba Keymanesh, Adrian Benton, Mark Dredze | cs.CL, cs.AI, cs.IR, cs.LG | 2022-05-23 |
A Self-Paced Mixed Distillation Method for Non-Autoregressive Generation | Weizhen Qi, Yeyun Gong, Yelong Shen, Jian Jiao, Yu Yan, Houqiang Li, Ruofei Zhang, Weizhu Chen, Nan Duan | cs.CL | 2022-05-23 |
CORAL: Contextual Response Retrievability Loss Function for Training Dialog Generation Models | Bishal Santra, Ravi Ghadia, Arpit Dwivedi, Manish Gupta, Pawan Goyal | cs.CL | 2022-05-21 |
Exploiting Inductive Bias in Transformers for Unsupervised Disentanglement of Syntax and Semantics with VAEs | Ghazi Felhi, Joseph Le Roux, Djamé Seddah | cs.CL, cs.LG | 2022-05-12 |
Robust (Controlled) Table-to-Text Generation with Structure-Aware Equivariance Learning | Fei Wang, Zhewei Xu, Pedro Szekely, Muhao Chen | cs.CL, cs.AI, cs.LG | 2022-05-08 |
Language Models Can See: Plugging Visual Controls in Text Generation | Yixuan Su, Tian Lan, Yahui Liu, Fangyu Liu, Dani Yogatama, Yan Wang, Lingpeng Kong, Nigel Collier | cs.CV, cs.CL | 2022-05-05 |
Efficient Few-Shot Fine-Tuning for Opinion Summarization | Arthur Bražinskas, Ramesh Nallapati, Mohit Bansal, Markus Dreyer | cs.CL, cs.AI, cs.LG | 2022-05-04 |
Faithful to the Document or to the World? Mitigating Hallucinations via Entity-linked Knowledge in Abstractive Summarization | Yue Dong, John Wieting, Pat Verga | cs.CL | 2022-04-28 |
Recovering Patient Journeys: A Corpus of Biomedical Entities and Relations on Twitter (BEAR) | Amelie Wührl, Roman Klinger | cs.CL, cs.IR | 2022-04-21 |
A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond | Yisheng Xiao, Lijun Wu, Junliang Guo, Juntao Li, Min Zhang, Tao Qin, Tie-yan Liu | cs.CL, cs.LG | 2022-04-20 |
Rows from Many Sources: Enriching row completions from Wikidata with a pre-trained Language Model | Carina Negreanu, Alperen Karaoglu, Jack Williams, Shuang Chen, Daniel Fabian, Andrew Gordon, Chin-Yew Lin | cs.CL, cs.AI | 2022-04-14 |
GAP: A Graph-aware Language Model Framework for Knowledge Graph-to-Text Generation | Anthony Colas, Mehrdad Alvandipour, Daisy Zhe Wang | cs.CL | 2022-04-13 |
Explanation Graph Generation via Pre-trained Language Models: An Empirical Study with Contrastive Learning | Swarnadeep Saha, Prateek Yadav, Mohit Bansal | cs.CL, cs.AI, cs.LG | 2022-04-11 |
Explaining Deep Convolutional Neural Networks via Latent Visual-Semantic Filter Attention | Yu Yang, Seungbae Kim, Jungseock Joo | cs.CV, cs.AI, cs.LG | 2022-04-10 |
Knowledge Infused Decoding | Ruibo Liu, Guoqing Zheng, Shashank Gupta, Radhika Gaonkar, Chongyang Gao, Soroush Vosoughi, Milad Shokouhi, Ahmed Hassan Awadallah | cs.CL, cs.AI, cs.LG | 2022-04-06 |
CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations | Leonard Salewski, A. Sophia Koepke, Hendrik P. A. Lensch, Zeynep Akata | cs.CV, cs.CL | 2022-04-05 |
Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos | Muheng Li, Lei Chen, Yueqi Duan, Zhilan Hu, Jianjiang Feng, Jie Zhou, Jiwen Lu | cs.CV, cs.AI, cs.LG | 2022-03-26 |
A Roadmap for Big Model | Sha Yuan, Hanyu Zhao, Shuai Zhao, Jiahong Leng, Yangxiao Liang, Xiaozhi Wang, Jifan Yu, Xin Lv, Zhou Shao, Jiaao He, Yankai Lin, Xu Han, Zhenghao Liu, Ning Ding, Yongming Rao, Yizhao Gao, Liang Zhang, Ming Ding, Cong Fang, Yisen Wang, Mingsheng Long, Jing Zhang, Yinpeng Dong, Tianyu Pang, Peng Cui, Lingxiao Huang, Zheng Liang, Huawei Shen, Hui Zhang, Quanshi Zhang, Qingxiu Dong, Zhixing Tan, Mingxuan Wang, Shuo Wang, Long Zhou, Haoran Li, Junwei Bao, Yingwei Pan, Weinan Zhang, Zhou Yu, Rui Yan, Chence Shi, Minghao Xu, Zuobai Zhang, Guoqiang Wang, Xiang Pan, Mengjie Li, Xiaoyu Chu, Zijun Yao, Fangwei Zhu, Shulin Cao, Weicheng Xue, Zixuan Ma, Zhengyan Zhang, Shengding Hu, Yujia Qin, Chaojun Xiao, Zheni Zeng, Ganqu Cui, Weize Chen, Weilin Zhao, Yuan Yao, Peng Li, Wenzhao Zheng, Wenliang Zhao, Ziyi Wang, Borui Zhang, Nanyi Fei, Anwen Hu, Zenan Ling, Haoyang Li, Boxi Cao, Xianpei Han, Weidong Zhan, Baobao Chang, Hao Sun, Jiawen Deng, Juanzi Li, Lei Hou, Xigang Cao, Jidong Zhai, Zhiyuan Liu, Maosong Sun, Jiwen Lu, Zhiwu Lu, Qin Jin, Ruihua Song, Ji-Rong Wen, Zhouchen Lin, Liwei Wang, Hang Su, Jun Zhu, Zhifang Sui, Jiajun Zhang, Yang Liu, Xiaodong He, Minlie Huang, Jian Tang, Jie Tang | cs.LG, cs.AI, cs.CL | 2022-03-26 |
Mitigating Gender Bias in Distilled Language Models via Counterfactual Role Reversal | Umang Gupta, Jwala Dhamala, Varun Kumar, Apurv Verma, Yada Pruksachatkun, Satyapriya Krishna, Rahul Gupta, Kai-Wei Chang, Greg Ver Steeg, Aram Galstyan | cs.CL, cs.LG | 2022-03-23 |
An Empirical Study on Learning and Improving the Search Objective for Unsupervised Paraphrasing | Weikai Steven Lu | cs.CL, cs.AI, cs.LG | 2022-03-23 |
Perturbations in the Wild: Leveraging Human-Written Text Perturbations for Realistic Adversarial Attack and Defense | Thai Le, Jooyoung Lee, Kevin Yen, Yifan Hu, Dongwon Lee | cs.LG, cs.CL, cs.CR | 2022-03-19 |
Dependency-based Mixture Language Models | Zhixian Yang, Xiaojun Wan | cs.CL | 2022-03-19 |
RoMe: A Robust Metric for Evaluating Natural Language Generation | Md Rashad Al Hasan Rony, Liubov Kovriguina, Debanjan Chaudhuri, Ricardo Usbeck, Jens Lehmann | cs.CL, cs.AI | 2022-03-17 |
PLANET: Dynamic Content Planning in Autoregressive Transformers for Long-form Text Generation | Zhe Hu, Hou Pong Chan, Jiachen Liu, Xinyan Xiao, Hua Wu, Lifu Huang | cs.CL | 2022-03-17 |
TegTok: Augmenting Text Generation via Task-specific and Open-world Knowledge | Chao-Hong Tan, Jia-Chen Gu, Chongyang Tao, Zhen-Hua Ling, Can Xu, Huang Hu, Xiubo Geng, Daxin Jiang | cs.CL, cs.AI | 2022-03-16 |
Graph Pre-training for AMR Parsing and Generation | Xuefeng Bai, Yulong Chen, Yue Zhang | cs.CL | 2022-03-15 |
Pruned Graph Neural Network for Short Story Ordering | Melika Golestani, Zeinab Borhanifard, Farnaz Tahmasebian, Heshaam Faili | cs.CL | 2022-03-13 |
IndicNLG Suite: Multilingual Datasets for Diverse NLG Tasks in Indic Languages | Aman Kumar, Himani Shrotriya, Prachi Sahu, Raj Dabre, Ratish Puduppully, Anoop Kunchukuttan, Amogh Mishra, Mitesh M. Khapra, Pratyush Kumar | cs.CL, cs.AI | 2022-03-10 |
Recent Advances in Neural Text Generation: A Task-Agnostic Survey | Chen Tang, Frank Guerin, Yucheng Li, Chenghua Lin | cs.CL, cs.AI | 2022-03-06 |
Deep Latent-Variable Models for Text Generation | Xiaoyu Shen | cs.CL | 2022-03-03 |