written on 2024-11-11
title | authors | categories | displaydate |
---|---|---|---|
The Unreasonable Ineffectiveness of Nucleus Sampling on Mitigating Text Memorization | Luka Borec, Philipp Sadler, David Schlangen | cs.CL | 2024-08-29 |
Spatio-Temporal Context Prompting for Zero-Shot Action Detection | Wei-Jhe Huang, Min-Hung Chen, Shang-Hong Lai | cs.CV, cs.AI | 2024-08-28 |
DIAGen: Diverse Image Augmentation with Generative Models | Tobias Lingenberg, Markus Reuter, Gopika Sudhakaran, Dominik Gojny, Stefan Roth, Simone Schaub-Meyer | cs.CV, cs.AI | 2024-08-26 |
Predictability and Causality in Spanish and English Natural Language Generation | Andrea Busto-Castiñeira, Francisco J. González-Castaño, Silvia García-Méndez, Francisco de Arriba-Pérez | cs.CL | 2024-08-26 |
Biomedical Large Languages Models Seem not to be Superior to Generalist Models on Unseen Medical Data | Felix J. Dorfner, Amin Dada, Felix Busch, Marcus R. Makowski, Tianyu Han, Daniel Truhn, Jens Kleesiek, Madhumita Sushil, Jacqueline Lammert, Lisa C. Adams, Keno K. Bressem | cs.CL | 2024-08-25 |
DHP Benchmark: Are LLMs Good NLG Evaluators? | Yicheng Wang, Jiayi Yuan, Yu-Neng Chuang, Zhuoer Wang, Yingchi Liu, Mark Cusick, Param Kulkarni, Zhengping Ji, Yasser Ibrahim, Xia Hu | cs.CL, cs.AI | 2024-08-25 |
BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks on Large Language Models | Yige Li, Hanxun Huang, Yunhan Zhao, Xingjun Ma, Jun Sun | cs.AI | 2024-08-23 |
GenderCARE: A Comprehensive Framework for Assessing and Reducing Gender Bias in Large Language Models | Kunsheng Tang, Wenbo Zhou, Jie Zhang, Aishan Liu, Gelei Deng, Shuai Li, Peigui Qi, Weiming Zhang, Tianwei Zhang, Nenghai Yu | cs.CL, cs.AI | 2024-08-22 |
Preference-Guided Reflective Sampling for Aligning Language Models | Hai Ye, Hwee Tou Ng | cs.CL | 2024-08-22 |
Xinyu: An Efficient LLM-based System for Commentary Generation | Yiquan Wu, Bo Tang, Chenyang Xi, Yu Yu, Pengyu Wang, Yifei Liu, Kun Kuang, Haiying Deng, Zhiyu Li, Feiyu Xiong, Jie Hu, Peng Cheng, Zhonghao Wang, Yi Wang, Yi Luo, Mingchuan Yang | cs.CL, cs.AI, I.2.7 | 2024-08-21 |
Tracing Privacy Leakage of Language Models to Training Data via Adjusted Influence Functions | Jinxin Liu, Zao Yang | cs.LG, cs.CL, cs.CR | 2024-08-20 |
SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models | Anke Tang, Li Shen, Yong Luo, Shuai Xie, Han Hu, Lefei Zhang, Bo Du, Dacheng Tao | cs.LG, cs.AI | 2024-08-19 |
LLMs’ Understanding of Natural Language Revealed | Walid S. Saba | cs.AI | 2024-07-29 |
The Power of Combining Data and Knowledge: GPT-4o is an Effective Interpreter of Machine Learning Models in Predicting Lymph Node Metastasis of Lung Cancer | Danqing Hu, Bing Liu, Xiaofeng Zhu, Nan Wu | cs.CL, cs.LG | 2024-07-25 |
Are Large Language Models Possible to Conduct Cognitive Behavioral Therapy? | Hao Shen, Zihan Li, Minqiang Yang, Minghui Ni, Yongfeng Tao, Zhengyang Yu, Weihao Zheng, Chen Xu, Bin Hu | cs.CL | 2024-07-25 |
IgnitionInnovators at “Discharge Me!”: Chain-of-Thought Instruction Finetuning Large Language Models for Discharge Summaries | An Quang Tang, Xiuzhen Zhang, Minh Ngoc Dinh | cs.CL | 2024-07-24 |
A Survey of Text Style Transfer: Applications and Ethical Implications | Sourabrata Mukherjee, Mateusz Lango, Zdenek Kasner, Ondrej Dušek | cs.CL | 2024-07-23 |
FairFlow: An Automated Approach to Model-based Counterfactual Data Augmentation For NLP | Ewoenam Kwaku Tokpo, Toon Calders | cs.CL | 2024-07-23 |
Robust Privacy Amidst Innovation with Large Language Models Through a Critical Assessment of the Risks | Yao-Shun Chuang, Atiquer Rahman Sarkar, Noman Mohammed, Xiaoqian Jiang | cs.CL | 2024-07-23 |
Finetuning Generative Large Language Models with Discrimination Instructions for Knowledge Graph Completion | Yang Liu, Xiaobin Tian, Zequn Sun, Wei Hu | cs.CL, cs.AI | 2024-07-23 |
When Do Universal Image Jailbreaks Transfer Between Vision-Language Models? | Rylan Schaeffer, Dan Valentine, Luke Bailey, James Chua, Cristóbal Eyzaguirre, Zane Durante, Joe Benton, Brando Miranda, Henry Sleight, John Hughes, Rajashree Agrawal, Mrinank Sharma, Scott Emmons, Sanmi Koyejo, Ethan Perez | cs.CL, cs.AI, cs.CR, cs.CV, cs.LG | 2024-07-21 |
Is Behavior Cloning All You Need? Understanding Horizon in Imitation Learning | Dylan J. Foster, Adam Block, Dipendra Misra | cs.LG, cs.AI, math.ST, stat.ML, stat.TH | 2024-07-20 |
Check-Eval: A Checklist-based Approach for Evaluating Text Quality | Jayr Pereira, Roberto Lotufo | cs.CL, cs.AI | 2024-07-19 |
From Instruction to Insight: Exploring the Functional and Semantic Roles of Text in Interactive Dashboards | Nicole Sultanum, Vidya Setlur | cs.HC | 2024-07-19 |
Clinical Reading Comprehension with Encoder-Decoder Models Enhanced by Direct Preference Optimization | Md Sultan Al Nahian, Ramakanth Kavuluru | cs.IR, cs.CL, cs.LG | 2024-07-19 |
Do LLMs have Consistent Values? | Naama Rozen, Gal Elidan, Amir Globerson, Ella Daniel | cs.CL, cs.AI | 2024-07-16 |
Are you still on track!? Catching LLM Task Drift with Activations | Sahar Abdelnabi, Aideen Fay, Giovanni Cherubin, Ahmed Salem, Mario Fritz, Andrew Paverd | cs.CR, cs.CL, cs.CY | 2024-06-02 |
Evaluating Large Language Model Biases in Persona-Steered Generation | Andy Liu, Mona Diab, Daniel Fried | cs.CL | 2024-05-30 |
Context Injection Attacks on Large Language Models | Cheng’an Wei, Kai Chen, Yue Zhao, Yujia Gong, Lu Xiang, Shenchen Zhu | cs.AI | 2024-05-30 |
Multi-Aspect Controllable Text Generation with Disentangled Counterfactual Augmentation | Yi Liu, Xiangyu Liu, Xiangrong Zhu, Wei Hu | cs.CL, cs.AI | 2024-05-30 |
WRDScore: New Metric for Evaluation of Natural Language Generation Models | Ravil Mussabayev | cs.CL, cs.AI | 2024-05-29 |
MetaToken: Detecting Hallucination in Image Descriptions by Meta Classification | Laura Fieback, Jakob Spiegelberg, Hanno Gottschalk | cs.CV, cs.CL, cs.LG, I.4 | 2024-05-29 |
Zipper: A Multi-Tower Decoder Architecture for Fusing Modalities | Vicky Zayats, Peter Chen, Melissa Merrari, Dirk Padfield | cs.LG, cs.AI, cs.CL, eess.AS | 2024-05-29 |
Augmenting Textual Generation via Topology Aware Retrieval | Yu Wang, Nedim Lipka, Ruiyi Zhang, Alexa Siu, Yuying Zhao, Bo Ni, Xin Wang, Ryan Rossi, Tyler Derr | cs.IR | 2024-05-27 |
Uncovering LLM-Generated Code: A Zero-Shot Synthetic Code Detector via Code Rewriting | Tong Ye, Yangkai Du, Tengfei Ma, Lingfei Wu, Xuhong Zhang, Shouling Ji, Wenhai Wang | cs.SE, cs.AI | 2024-05-25 |
Effective Unsupervised Constrained Text Generation based on Perturbed Masking | Yingwen Fu, Wenjie Ou, Zhou Yu, Yue Lin | cs.CL | 2024-04-24 |
Semantic Routing for Enhanced Performance of LLM-Assisted Intent-Based 5G Core Network Management and Orchestration | Dimitrios Michael Manias, Ali Chouman, Abdallah Shami | cs.NI, cs.AI | 2024-04-24 |
Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval | Young Kyun Jang, Donghyun Kim, Zihang Meng, Dat Huynh, Ser-Nam Lim | cs.CV, cs.AI | 2024-04-23 |
Identifying Fairness Issues in Automatically Generated Testing Content | Kevin Stowe, Benny Longwill, Alyssa Francis, Tatsuya Aoyama, Debanjan Ghosh, Swapna Somasundaran | cs.CL, I.2.7 | 2024-04-23 |
Navigating the Path of Writing: Outline-guided Text Generation with Large Language Models | Yukyung Lee, Soonwon Ka, Bokyung Son, Pilsung Kang, Jaewook Kang | cs.CL, cs.AI, cs.HC | 2024-04-22 |
Parameter Efficient Diverse Paraphrase Generation Using Sequence-Level Knowledge Distillation | Lasal Jayawardena, Prasan Yapa | cs.CL, cs.AI, cs.LG | 2024-04-19 |
Sampling-based Pseudo-Likelihood for Membership Inference Attacks | Masahiro Kaneko, Youmi Ma, Yuki Wata, Naoaki Okazaki | cs.CL | 2024-04-17 |
Incubating Text Classifiers Following User Instruction with Nothing but LLM | Letian Peng, Jingbo Shang | cs.CL | 2024-04-16 |
LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation? | Yuchi Wang, Shuhuai Ren, Rundong Gao, Linli Yao, Qingyan Guo, Kaikai An, Jianhong Bai, Xu Sun | cs.AI, cs.CL, cs.CV | 2024-04-16 |
Gaining More Insight into Neural Semantic Parsing with Challenging Benchmarks | Xiao Zhang, Chunliu Wang, Rik van Noord, Johan Bos | cs.CL | 2024-04-12 |
Language Models for Text Classification: Is In-Context Learning Enough? | Aleksandra Edwards, Jose Camacho-Collados | cs.CL, cs.AI | 2024-03-26 |
Outcome-Constrained Large Language Models for Countering Hate Speech | Lingzi Hong, Pengcheng Luo, Eduardo Blanco, Xiaoying Song | cs.CL | 2024-03-25 |
UrbanVLP: A Multi-Granularity Vision-Language Pre-Trained Foundation Model for Urban Indicator Prediction | Xixuan Hao, Wei Chen, Yibo Yan, Siru Zhong, Kun Wang, Qingsong Wen, Yuxuan Liang | cs.CV, cs.AI | 2024-03-25 |
Visually Guided Generative Text-Layout Pre-training for Document Intelligence | Zhiming Mao, Haoli Bai, Lu Hou, Jiansheng Wei, Xin Jiang, Qun Liu, Kam-Fai Wong | cs.CL, cs.CV | 2024-03-25 |
The Frontier of Data Erasure: Machine Unlearning for Large Language Models | Youyang Qu, Ming Ding, Nan Sun, Kanchana Thilakarathna, Tianqing Zhu, Dusit Niyato | cs.AI | 2024-03-23 |
EAGLE: A Domain Generalization Framework for AI-generated Text Detection | Amrita Bhattacharjee, Raha Moraffah, Joshua Garland, Huan Liu | cs.CL, cs.AI, cs.LG | 2024-03-23 |
InstaSynth: Opportunities and Challenges in Generating Synthetic Instagram Data with ChatGPT for Sponsored Content Detection | Thales Bertaglia, Lily Heisig, Rishabh Kaushal, Adriana Iamnitchi | cs.CY, cs.CL, cs.SI | 2024-03-22 |
ProSwitch: Knowledge-Guided Language Model Fine-Tuning to Generate Professional and Non-Professional Styled Text | Chang Zong, Yuyan Chen, Weiming Lu, Jian Shao, Yueting Zhuang | cs.CL, cs.AI, 68T50, I.2.7 | 2024-03-14 |
SemEval-2024 Shared Task 6: SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes | Timothee Mickus, Elaine Zosa, Raúl Vázquez, Teemu Vahtola, Jörg Tiedemann, Vincent Segonne, Alessandro Raganato, Marianna Apidianaki | cs.CL | 2024-03-12 |
generAItor: Tree-in-the-Loop Text Generation for Language Model Explainability and Adaptation | Thilo Spinner, Rebecca Kehlbeck, Rita Sevastjanova, Tobias Stähle, Daniel A. Keim, Oliver Deussen, Mennatallah El-Assady | cs.HC, cs.LG, I.2.7; H.5.2 | 2024-03-12 |
Triples-to-isiXhosa (T2X): Addressing the Challenges of Low-Resource Agglutinative Data-to-Text Generation | Francois Meyer, Jan Buys | cs.CL | 2024-03-12 |
Matrix-Transformation Based Low-Rank Adaptation (MTLoRA): A Brain-Inspired Method for Parameter-Efficient Fine-Tuning | Yao Liang, Yuwei Wang, Yi Zeng | cs.CL, cs.AI | 2024-03-12 |
Complex Reasoning over Logical Queries on Commonsense Knowledge Graphs | Tianqing Fang, Zeming Chen, Yangqiu Song, Antoine Bosselut | cs.CL, cs.AI | 2024-03-12 |
Premonition: Using Generative Models to Preempt Future Data Changes in Continual Learning | Mark D. McDonnell, Dong Gong, Ehsan Abbasnejad, Anton van den Hengel | cs.CV, cs.LG | 2024-03-12 |
One Category One Prompt: Dataset Distillation using Diffusion Models | Ali Abbasi, Ashkan Shahbazi, Hamed Pirsiavash, Soheil Kolouri | cs.CV, cs.CL, cs.LG | 2024-03-11 |
Narrating Causal Graphs with Large Language Models | Atharva Phatak, Vijay K. Mago, Ameeta Agrawal, Aravind Inbasekaran, Philippe J. Giabbanelli | cs.CL | 2024-03-11 |
LSTM-Based Text Generation: A Study on Historical Datasets | Mustafa Abbas Hussein Hussein, Serkan Savaş | cs.CL, cs.AI | 2024-03-11 |
Evolving Knowledge Distillation with Large Language Models and Active Learning | Chengyuan Liu, Yangyang Kang, Fubang Zhao, Kun Kuang, Zhuoren Jiang, Changlong Sun, Fei Wu | cs.CL | 2024-03-11 |
Defending Against Unforeseen Failure Modes with Latent Adversarial Training | Stephen Casper, Lennart Schulze, Oam Patel, Dylan Hadfield-Menell | cs.CR, cs.AI, cs.LG | 2024-03-08 |
Direct Alignment of Draft Model for Speculative Decoding with Chat-Fine-Tuned LLMs | Raghavv Goel, Mukul Gagrani, Wonseok Jeon, Junyoung Park, Mingu Lee, Christopher Lott | cs.LG, cs.AI, cs.CL | 2024-02-29 |
CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples | Jianrui Zhang, Mu Cai, Tengyang Xie, Yong Jae Lee | cs.CV, cs.AI, cs.CL, cs.LG | 2024-02-20 |
RomanSetu: Efficiently unlocking multilingual capabilities of Large Language Models models via Romanization | Jaavid Aktar Husain, Raj Dabre, Aswanth Kumar, Jay Gala, Thanmay Jayakumar, Ratish Puduppully, Anoop Kunchukuttan | cs.CL, cs.AI | 2024-01-25 |
Consistency Guided Knowledge Retrieval and Denoising in LLMs for Zero-shot Document-level Relation Triplet Extraction | Qi Sun, Kun Huang, Xiaocui Yang, Rong Tong, Kun Zhang, Soujanya Poria | cs.CL | 2024-01-24 |
IndiText Boost: Text Augmentation for Low Resource India Languages | Onkar Litake, Niraj Yagnik, Shreyas Labhsetwar | cs.CL, cs.AI, cs.LG | 2024-01-23 |
Unsupervised Learning of Graph from Recipes | Aissatou Diallo, Antonis Bikakis, Luke Dickens, Anthony Hunter, Rob Miller | cs.CL | 2024-01-22 |
Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text | Abhimanyu Hans, Avi Schwarzschild, Valeriia Cherepanova, Hamid Kazemi, Aniruddha Saha, Micah Goldblum, Jonas Geiping, Tom Goldstein | cs.CL, cs.AI, cs.LG | 2024-01-22 |
Self-training from Self-memory in Data-to-text Generation | Hoang-Thang Ta | cs.CL | 2024-01-19 |
Large Language Models for Scientific Information Extraction: An Empirical Study for Virology | Mahsa Shamsabadi, Jennifer D’Souza, Sören Auer | cs.CL, cs.AI, cs.DL, cs.IT, math.IT | 2024-01-18 |
Evolutionary Computation in the Era of Large Language Model: Survey and Roadmap | Xingyu Wu, Sheng-hao Wu, Jibin Wu, Liang Feng, Kay Chen Tan | cs.NE, cs.AI, cs.CL | 2024-01-18 |
Aligning Large Language Models with Counterfactual DPO | Bradley Butcher | cs.CL, cs.AI | 2024-01-17 |
Contrastive Perplexity for Controlled Generation: An Application in Detoxifying Large Language Models | Tassilo Klein, Moin Nabi | cs.CL, cs.LG | 2024-01-16 |
Ask the experts: sourcing high-quality datasets for nutritional counselling through Human-AI collaboration | Simone Balloccu, Ehud Reiter, Vivek Kumar, Diego Reforgiato Recupero, Daniele Riboni | cs.CL | 2024-01-16 |
Fine-grained Hallucination Detection and Editing for Language Models | Abhika Mishra, Akari Asai, Vidhisha Balachandran, Yizhong Wang, Graham Neubig, Yulia Tsvetkov, Hannaneh Hajishirzi | cs.CL | 2024-01-12 |
Between Lines of Code: Unraveling the Distinct Patterns of Machine and Human Programmers | Yuling Shi, Hongyu Zhang, Chengcheng Wan, Xiaodong Gu | cs.SE, cs.AI, cs.CL | 2024-01-12 |
Automating Knowledge Acquisition for Content-Centric Cognitive Agents Using LLMs | Sanjay Oruganti, Sergei Nirenburg, Jesse English, Marjorie McShane | cs.CL, cs.AI | 2023-12-27 |
Zur Darstellung eines mehrstufigen Prototypbegriffs in der multilingualen automatischen Sprachgenerierung: vom Korpus über word embeddings bis hin zum automatischen Wörterbuch | María José Domínguez Vázquez | cs.CL | 2023-12-26 |
PersianLLaMA: Towards Building First Persian Large Language Model | Mohammad Amin Abbasi, Arash Ghafouri, Mahdi Firouzmandi, Hassan Naderi, Behrouz Minaei Bidgoli | cs.CL, cs.AI | 2023-12-25 |
Balancing the Style-Content Trade-Off in Sentiment Transfer Using Polarity-Aware Denoising | Sourabrata Mukherjee, Zdeněk Kasner, Ondřej Dušek | cs.CL | 2023-12-22 |
Assaying on the Robustness of Zero-Shot Machine-Generated Text Detectors | Yi-Fan Zhang, Zhang Zhang, Liang Wang, Tieniu Tan, Rong Jin | cs.CL | 2023-12-20 |
Instruct-SCTG: Guiding Sequential Controlled Text Generation through Instructions | Yinhong Liu, Yixuan Su, Ehsan Shareghi, Nigel Collier | cs.CL, cs.AI | 2023-12-19 |
External Knowledge Augmented Polyphone Disambiguation Using Large Language Model | Chen Li | cs.CL | 2023-12-19 |
Compositional Generalization for Multi-label Text Classification: A Data-Augmentation Approach | Yuyang Chai, Zhuang Li, Jiahui Liu, Lei Chen, Fei Li, Donghong Ji, Chong Teng | cs.CL | 2023-12-18 |
Deep dive into language traits of AI-generated Abstracts | Vikas Kumar, Amisha Bharti, Devanshu Verma, Vasudha Bhatnagar | cs.CL, cs.LG | 2023-12-17 |
A Soft Contrastive Learning-based Prompt Model for Few-shot Sentiment Analysis | Jingyi Zhou, Jie Zhou, Jiabao Zhao, Siyin Wang, Haijun Shan, Gui Tao, Qi Zhang, Xuanjing Huang | cs.CL | 2023-12-16 |
Continuous Diffusion for Mixed-Type Tabular Data | Markus Mueller, Kathrin Gruber, Dennis Fok | cs.LG, stat.ML | 2023-12-16 |
GSQA: An End-to-End Model for Generative Spoken Question Answering | Min-Han Shih, Ho-Lam Chung, Yu-Chi Pai, Ming-Hao Hsu, Guan-Ting Lin, Shang-Wen Li, Hung-yi Lee | cs.CL, cs.AI | 2023-12-15 |
Fast Sampling via De-randomization for Discrete Diffusion Models | Zixiang Chen, Huizhuo Yuan, Yongqian Li, Yiwen Kou, Junkai Zhang, Quanquan Gu | cs.LG, cs.AI, stat.ML | 2023-12-14 |
ToViLaG: Your Visual-Language Generative Model is Also An Evildoer | Xinpeng Wang, Xiaoyuan Yi, Han Jiang, Shanlin Zhou, Zhihua Wei, Xing Xie | cs.CL, cs.AI | 2023-12-13 |
A Survey of Text Watermarking in the Era of Large Language Models | Aiwei Liu, Leyi Pan, Yijian Lu, Jingjing Li, Xuming Hu, Lijie Wen, Irwin King, Philip S. Yu | cs.CL, 68T50, I.2.7 | 2023-12-13 |
On Diverse Preferences for Large Language Model Alignment | Dun Zeng, Yong Dai, Pengyu Cheng, Tianhao Hu, Wanshun Chen, Nan Du, Zenglin Xu | cs.AI | 2023-12-12 |
Multilingual large language models leak human stereotypes across language boundaries | Yang Trista Cao, Anna Sotnikova, Jieyu Zhao, Linda X. Zou, Rachel Rudinger, Hal Daume III | cs.CL | 2023-12-12 |
Astrocyte-Enabled Advancements in Spiking Neural Networks for Large Language Modeling | Guobin Shen, Dongcheng Zhao, Yiting Dong, Yang Li, Jindong Li, Yi Zeng | cs.NE, cs.AI | 2023-12-12 |
Generative AI for Hate Speech Detection: Evaluation and Findings | Sagi Pendzel, Tomer Wullach, Amir Adler, Einat Minkov | cs.CL, cs.AI | 2023-11-16 |
The Curious Decline of Linguistic Diversity: Training Language Models on Synthetic Text | Yanzhu Guo, Guokan Shang, Michalis Vazirgiannis, Chloé Clavel | cs.CL | 2023-11-16 |
Aligning Neural Machine Translation Models: Human Feedback in Training and Inference | Miguel Moura Ramos, Patrick Fernandes, António Farinhas, André F. T. Martins | cs.CL | 2023-11-15 |
HELLaMA: LLaMA-based Table to Text Generation by Highlighting the Important Evidence | Junyi Bian, Xiaolei Qin, Wuhe Zou, Mengzuo Huang, Weidong Zhang | cs.CL | 2023-11-15 |
MAP’s not dead yet: Uncovering true language model modes by conditioning away degeneracy | Davis Yoshida, Kartik Goyal, Kevin Gimpel | cs.CL, cs.AI, cs.LG | 2023-11-15 |
X-Eval: Generalizable Multi-aspect Text Evaluation via Augmented Instruction Tuning with Auxiliary Evaluation Aspects | Minqian Liu, Ying Shen, Zhiyang Xu, Yixin Cao, Eunah Cho, Vaibhav Kumar, Reza Ghanadan, Lifu Huang | cs.CL, cs.AI, cs.LG | 2023-11-15 |
TSST: A Benchmark and Evaluation Models for Text Speech-Style Transfer | Huashan Sun, Yixiao Wu, Yinghao Li, Jiawei Li, Yizhe Yang, Yang Gao | cs.CL, cs.AI, I.2.7 | 2023-11-14 |
Artificial Text Boundary Detection with Topological Data Analysis and Sliding Window Techniques | Laida Kushnareva, Tatiana Gaintseva, German Magai, Serguei Barannikov, Dmitry Abulkhanov, Kristian Kuznetsov, Irina Piontkovskaya, Sergey Nikolenko | cs.CL | 2023-11-14 |
Learning Globally Optimized Language Structure via Adversarial Training | Xuwang Yin | cs.CL, cs.AI | 2023-11-12 |
Synthetic Speaking Children – Why We Need Them and How to Make Them | Muhammad Ali Farooq, Dan Bigioi, Rishabh Jain, Wang Yao, Mariam Yiwere, Peter Corcoran | cs.HC, cs.AI, cs.SD, eess.AS | 2023-11-08 |
Aspects of human memory and Large Language Models | Romuald A. Janik | cs.CL, cs.AI, cs.LG, q-bio.NC | 2023-11-07 |
A Simple yet Efficient Ensemble Approach for AI-generated Text Detection | Harika Abburi, Kalyani Roy, Michael Suesserman, Nirmala Pudota, Balaji Veeramani, Edward Bowen, Sanmitra Bhattacharya | cs.CL, cs.AI | 2023-11-06 |
An Ensemble Method Based on the Combination of Transformers with Convolutional Neural Networks to Detect Artificially Generated Text | Vijini Liyanage, Davide Buscaldi | cs.CL | 2023-10-26 |
Automatic Logical Forms improve fidelity in Table-to-Text generation | Iñigo Alonso, Eneko Agirre | cs.CL | 2023-10-26 |
Beyond MLE: Convex Learning for Text Generation | Chenze Shao, Zhengrui Ma, Min Zhang, Yang Feng | cs.CL, cs.AI, cs.LG | 2023-10-26 |
Critic-Driven Decoding for Mitigating Hallucinations in Data-to-text Generation | Mateusz Lango, Ondřej Dušek | cs.CL, I.2.7 | 2023-10-25 |
A Comprehensive Evaluation of Constrained Text Generation for Large Language Models | Xiang Chen, Xiaojun Wan | cs.CL | 2023-10-25 |
Octopus: A Multitask Model and Toolkit for Arabic Natural Language Generation | AbdelRahim Elmadany, El Moatez Billah Nagoudi, Muhammad Abdul-Mageed | cs.CL | 2023-10-24 |
Woodpecker: Hallucination Correction for Multimodal Large Language Models | Shukang Yin, Chaoyou Fu, Sirui Zhao, Tong Xu, Hao Wang, Dianbo Sui, Yunhang Shen, Ke Li, Xing Sun, Enhong Chen | cs.CV, cs.AI, cs.CL, cs.LG | 2023-10-24 |
Let the Pretrained Language Models “Imagine” for Short Texts Topic Modeling | Pritom Saha Akash, Jie Huang, Kevin Chen-Chuan Chang | cs.CL | 2023-10-24 |
Data Augmentation Techniques for Machine Translation of Code-Switched Texts: A Comparative Study | Injy Hamed, Nizar Habash, Ngoc Thang Vu | cs.CL | 2023-10-23 |
Statistical Depth for Ranking and Characterizing Transformer-Based Text Embeddings | Parker Seegmiller, Sarah Masud Preum | cs.CL | 2023-10-23 |
Did the Neurons Read your Book? Document-level Membership Inference for Large Language Models | Matthieu Meeus, Shubham Jain, Marek Rei, Yves-Alexandre de Montjoye | cs.CL, cs.CR, cs.LG | 2023-10-23 |
A Survey on LLM-generated Text Detection: Necessity, Methods, and Future Directions | Junchao Wu, Shu Yang, Runzhe Zhan, Yulin Yuan, Derek F. Wong, Lidia S. Chao | cs.CL, cs.AI | 2023-10-23 |
Text generation for dataset augmentation in security classification tasks | Alexander P. Welsh, Matthew Edwards | cs.CR, cs.CL | 2023-10-22 |
Towards Understanding Sycophancy in Language Models | Mrinank Sharma, Meg Tong, Tomasz Korbak, David Duvenaud, Amanda Askell, Samuel R. Bowman, Newton Cheng, Esin Durmus, Zac Hatfield-Dodds, Scott R. Johnston, Shauna Kravec, Timothy Maxwell, Sam McCandlish, Kamal Ndousse, Oliver Rausch, Nicholas Schiefer, Da Yan, Miranda Zhang, Ethan Perez | cs.CL, cs.AI, cs.LG, stat.ML, I.2.6 | 2023-10-20 |
GraphextQA: A Benchmark for Evaluating Graph-Enhanced Large Language Models | Yuanchun Shen, Ruotong Liao, Zhen Han, Yunpu Ma, Volker Tresp | cs.CL | 2023-10-12 |
DistillSpec: Improving Speculative Decoding via Knowledge Distillation | Yongchao Zhou, Kaifeng Lyu, Ankit Singh Rawat, Aditya Krishna Menon, Afshin Rostamizadeh, Sanjiv Kumar, Jean-François Kagy, Rishabh Agarwal | cs.CL, cs.AI, cs.LG | 2023-10-12 |
CP-KGC: Constrained-Prompt Knowledge Graph Completion with Large Language Models | Rui Yang, Li Fang, Yi Zhou | cs.CL, cs.AI | 2023-10-12 |
Goodtriever: Adaptive Toxicity Mitigation with Retrieval-augmented Models | Luiza Pozzobon, Beyza Ermis, Patrick Lewis, Sara Hooker | cs.AI | 2023-10-11 |
Multimodal Graph Learning for Generative Tasks | Minji Yoon, Jing Yu Koh, Bryan Hooi, Ruslan Salakhutdinov | cs.AI | 2023-10-11 |
MatChat: A Large Language Model and Application Service Platform for Materials Science | Ziyi Chen, Fankai Xie, Meng Wan, Yang Yuan, Miao Liu, Zongguo Wang, Sheng Meng, Yangang Wang | cond-mat.mtrl-sci, cs.AI | 2023-10-11 |
The Temporal Structure of Language Processing in the Human Brain Corresponds to The Layered Hierarchy of Deep Language Models | Ariel Goldstein, Eric Ham, Mariano Schain, Samuel Nastase, Zaid Zada, Avigail Dabush, Bobbi Aubrey, Harshvardhan Gazula, Amir Feder, Werner K Doyle, Sasha Devore, Patricia Dugan, Daniel Friedman, Roi Reichart, Michael Brenner, Avinatan Hassidim, Orrin Devinsky, Adeen Flinker, Omer Levy, Uri Hasson | cs.CL, cs.AI, cs.LG, q-bio.NC | 2023-10-11 |
A Semantic Invariant Robust Watermark for Large Language Models | Aiwei Liu, Leyi Pan, Xuming Hu, Shiao Meng, Lijie Wen | cs.CR, cs.CL, 68T50, I.2.7 | 2023-10-10 |
Generative quantum machine learning via denoising diffusion probabilistic models | Bingzhi Zhang, Peng Xu, Xiaohui Chen, Quntao Zhuang | quant-ph, cs.AI, cs.LG | 2023-10-09 |
RAUCG: Retrieval-Augmented Unsupervised Counter Narrative Generation for Hate Speech | Shuyu Jiang, Wenyi Tang, Xingshu Chen, Rui Tanga, Haizhou Wang, Wenxian Wang | cs.CL | 2023-10-09 |
On the Zero-Shot Generalization of Machine-Generated Text Detectors | Xiao Pu, Jingyu Zhang, Xiaochuang Han, Yulia Tsvetkov, Tianxing He | cs.CL | 2023-10-08 |
Learning Personalized Story Evaluation | Danqing Wang, Kevin Yang, Hanlin Zhu, Xiaomeng Yang, Andrew Cohen, Lei Li, Yuandong Tian | cs.CL | 2023-10-05 |
LPML: LLM-Prompting Markup Language for Mathematical Reasoning | Ryutaro Yamauchi, Sho Sonoda, Akiyoshi Sannai, Wataru Kumagai | cs.AI, cs.LG, cs.PL | 2023-09-21 |
MBR and QE Finetuning: Training-time Distillation of the Best and Most Expensive Decoding Methods | Mara Finkelstein, Subhajit Naskar, Mehdi Mirzazadeh, Apurva Shah, Markus Freitag | cs.CL | 2023-09-19 |
CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain Performance and Calibration | Rachneet Sachdeva, Martin Tutek, Iryna Gurevych | cs.CL | 2023-09-14 |
Semantic reconstruction of continuous language from MEG signals | Bo Wang, Xiran Xu, Longxiang Zhang, Boda Xiao, Xihong Wu, Jing Chen | cs.HC, eess.SP, q-bio.NC | 2023-09-14 |
Auto-Regressive Next-Token Predictors are Universal Learners | Eran Malach | cs.LG, cs.CL | 2023-09-13 |
Scaled Prompt-Tuning for Few-Shot Natural Language Generation | Ting Hu, Christoph Meinel, Haojin Yang | cs.CL | 2023-09-13 |
Text Encoders Lack Knowledge: Leveraging Generative LLMs for Domain-Specific Semantic Textual Similarity | Joseph Gatto, Omar Sharif, Parker Seegmiller, Philip Bohlman, Sarah Masud Preum | cs.CL | 2023-09-12 |
Neural Latent Geometry Search: Product Manifold Inference via Gromov-Hausdorff-Informed Bayesian Optimization | Haitz Saez de Ocariz Borde, Alvaro Arroyo, Ismael Morales, Ingmar Posner, Xiaowen Dong | cs.LG, stat.ML | 2023-09-09 |
EPA: Easy Prompt Augmentation on Large Language Models via Multiple Sources and Multiple Targets | Hongyuan Lu, Wai Lam | cs.CL | 2023-09-09 |
ConDA: Contrastive Domain Adaptation for AI-generated Text Detection | Amrita Bhattacharjee, Tharindu Kumarage, Raha Moraffah, Huan Liu | cs.CL, cs.AI, cs.LG | 2023-09-07 |
Parameter Efficient Audio Captioning With Faithful Guidance Using Audio-text Shared Latent Representation | Arvind Krishna Sridhar, Yinyi Guo, Erik Visser, Rehana Mahfuz | cs.CL, cs.MM, cs.SD | 2023-09-06 |
Persona-aware Generative Model for Code-mixed Language | Ayan Sengupta, Md Shad Akhtar, Tanmoy Chakraborty | cs.CL, cs.LG | 2023-09-06 |
Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning | Lili Yu, Bowen Shi, Ramakanth Pasunuru, Benjamin Muller, Olga Golovneva, Tianlu Wang, Arun Babu, Binh Tang, Brian Karrer, Shelly Sheynin, Candace Ross, Adam Polyak, Russell Howes, Vasu Sharma, Puxin Xu, Hovhannes Tamoyan, Oron Ashual, Uriel Singer, Shang-Wen Li, Susan Zhang, Richard James, Gargi Ghosh, Yaniv Taigman, Maryam Fazel-Zarandi, Asli Celikyilmaz, Luke Zettlemoyer, Armen Aghajanyan | cs.LG, cs.CL, cs.CV | 2023-09-05 |
PromptTTS 2: Describing and Generating Voices with Text Prompt | Yichong Leng, Zhifang Guo, Kai Shen, Xu Tan, Zeqian Ju, Yanqing Liu, Yufei Liu, Dongchao Yang, Leying Zhang, Kaitao Song, Lei He, Xiang-Yang Li, Sheng Zhao, Tao Qin, Jiang Bian | eess.AS, cs.CL, cs.LG, cs.SD | 2023-09-05 |
Studying the impacts of pre-training using ChatGPT-generated text on downstream tasks | Sarthak Anand | cs.CL, cs.AI | 2023-09-02 |
BLSP: Bootstrapping Language-Speech Pre-training via Behavior Alignment of Continuation Writing | Chen Wang, Minpeng Liao, Zhongqiang Huang, Jinliang Lu, Junhong Wu, Yuchen Liu, Chengqing Zong, Jiajun Zhang | cs.CL, cs.SD, eess.AS | 2023-09-02 |
Bias and Fairness in Large Language Models: A Survey | Isabel O. Gallegos, Ryan A. Rossi, Joe Barrow, Md Mehrab Tanjim, Sungchul Kim, Franck Dernoncourt, Tong Yu, Ruiyi Zhang, Nesreen K. Ahmed | cs.CL, cs.AI, cs.CY, cs.LG | 2023-09-02 |
Reinforcement Learning for Generative AI: A Survey | Yuanjiang Cao, Quan Z. Sheng, Julian McAuley, Lina Yao | cs.LG, cs.AI | 2023-08-28 |
MedAlign: A Clinician-Generated Dataset for Instruction Following with Electronic Medical Records | Scott L. Fleming, Alejandro Lozano, William J. Haberkorn, Jenelle A. Jindal, Eduardo P. Reis, Rahul Thapa, Louis Blankemeier, Julian Z. Genkins, Ethan Steinberg, Ashwin Nayak, Birju S. Patel, Chia-Chun Chiang, Alison Callahan, Zepeng Huo, Sergios Gatidis, Scott J. Adams, Oluseyi Fayanju, Shreya J. Shah, Thomas Savage, Ethan Goh, Akshay S. Chaudhari, Nima Aghaeepour, Christopher Sharp, Michael A. Pfeffer, Percy Liang, Jonathan H. Chen, Keith E. Morse, Emma P. Brunskill, Jason A. Fries, Nigam H. Shah | cs.CL, cs.AI, cs.LG | 2023-08-27 |
Planning with Logical Graph-based Language Model for Instruction Generation | Fan Zhang, Kebing Jin, Hankz Hankui Zhuo | cs.CL, cs.AI | 2023-08-26 |
1.5 million materials narratives generated by chatbots | Yang Jeong Park, Sung Eun Jerng, Jin-Sung Park, Choah Kwon, Chia-Wei Hsu, Zhichu Ren, Sungroh Yoon, Ju Li | cond-mat.mtrl-sci, cs.CL | 2023-08-25 |
ChatGPT as Data Augmentation for Compositional Generalization: A Case Study in Open Intent Detection | Yihao Fang, Xianzhi Li, Stephen W. Thomas, Xiaodan Zhu | cs.CL, cs.AI | 2023-08-25 |
GeoExplainer: A Visual Analytics Framework for Spatial Modeling Contextualization and Report Generation | Fan Lei, Yuxin Ma, Stewart Fotheringham, Elizabeth Mack, Ziqi Li, Mehak Sachdeva, Sarah Bardin, Ross Maciejewski | cs.HC, cs.LG | 2023-08-25 |
Random Word Data Augmentation with CLIP for Zero-Shot Anomaly Detection | Masato Tamura | cs.CV, cs.LG | 2023-08-22 |
Data-to-text Generation for Severely Under-Resourced Languages with GPT-3.5: A Bit of Help Needed from Google Translate | Michela Lorandi, Anya Belz | cs.CL, cs.AI | 2023-08-19 |
Mirror Diffusion Models | Jaesung Tae | cs.LG | 2023-08-11 |
Few-Shot Data-to-Text Generation via Unified Representation and Multi-Source Learning | Alexander Hanbo Li, Mingyue Shang, Evangelia Spiliopoulou, Jie Ma, Patrick Ng, Zhiguo Wang, Bonan Min, William Wang, Kathleen McKeown, Vittorio Castelli, Dan Roth, Bing Xiang | cs.CL | 2023-08-10 |
Emotion-Conditioned Text Generation through Automatic Prompt Optimization | Yarik Menchaca Resendiz, Roman Klinger | cs.CL | 2023-08-09 |
DataTales: Investigating the use of Large Language Models for Authoring Data-Driven Articles | Nicole Sultanum, Arjun Srinivasan | cs.HC, cs.CL | 2023-08-08 |
Generative Forests | Richard Nock, Mathieu Guillame-Bert | cs.LG, I.2.6 | 2023-08-07 |
Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism | Jiaxin Fan, Yong Zhang, Hanzhang Li, Jianzong Wang, Zhitao Li, Sheng Ouyang, Ning Cheng, Jing Xiao | cs.CL, cs.AI | 2023-08-07 |
Towards Multiple References Era – Addressing Data Leakage and Limited Reference Diversity in NLG Evaluation | Xianfeng Zeng, Yijin Liu, Fandong Meng, Jie Zhou | cs.CL | 2023-08-06 |
Fighting Fire with Fire: Can ChatGPT Detect AI-generated Text? | Amrita Bhattacharjee, Huan Liu | cs.CL, cs.AI | 2023-08-02 |
Feature-aware conditional GAN for category text generation | Xinze Li, Kezhi Mao, Fanfan Lin, Zijian Feng | cs.CL, cs.AI | 2023-08-02 |
CoSMo: A constructor specification language for Abstract Wikipedia’s content selection process | Kutz Arrieta, Pablo R. Fillottrani, C. Maria Keet | cs.CL, I.2.4; H.2.3 | 2023-08-01 |
Tackling Hallucinations in Neural Chart Summarization | Saad Obaid ul Islam, Iza Škrjanec, Ondřej Dušek, Vera Demberg | cs.CL, cs.LG | 2023-08-01 |
Learning Multi-modal Representations by Watching Hundreds of Surgical Video Lectures | Kun Yuan, Vinkle Srivastav, Tong Yu, Joel Lavanchy, Pietro Mascagni, Nassir Navab, Nicolas Padoy | cs.CV, cs.AI | 2023-07-27 |
Evaluating Generative Models for Graph-to-Text Generation | Shuzhou Yuan, Michael Färber | cs.CL, cs.AI | 2023-07-27 |
Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts | Mayug Maniparambil, Chris Vorster, Derek Molloy, Noel Murphy, Kevin McGuinness, Noel E. O’Connor | cs.CV, cs.AI, cs.CL, cs.LG | 2023-07-21 |
OxfordTVG-HIC: Can Machine Make Humorous Captions from Images? | Runjia Li, Shuyang Sun, Mohamed Elhoseiny, Philip Torr | cs.CV, cs.CL | 2023-07-21 |
Jina Embeddings: A Novel Set of High-Performance Sentence Embedding Models | Michael Günther, Louis Milliken, Jonathan Geuter, Georgios Mastrapas, Bo Wang, Han Xiao | cs.CL, cs.AI, cs.IR, cs.LG, 68T50, H.3.1; H.3.3; I.2.7; I.5.4 | 2023-07-20 |
Visual Flow-based Programming Plugin for Brain Computer Interface in Computer-Aided Design | Tong Bill Xu, Saleh Kalantari | cs.HC, cs.SE | 2023-07-20 |
Generative Language Models on Nucleotide Sequences of Human Genes | Musa Nuri Ihtiyar, Arzucan Ozgur | q-bio.GN, cs.CL, cs.LG | 2023-07-20 |
FinGPT: Democratizing Internet-scale Data for Financial Large Language Models | Xiao-Yang Liu, Guoxuan Wang, Daochen Zha | cs.CL, cs.LG, q-fin.GN | 2023-07-19 |
Unveiling Gender Bias in Terms of Profession Across LLMs: Analyzing and Addressing Sociological Implications | Vishesh Thakur | cs.CL | 2023-07-18 |
COLLIE: Systematic Construction of Constrained Text Generation Tasks | Shunyu Yao, Howard Chen, Austin W. Hanjie, Runzhe Yang, Karthik Narasimhan | cs.CL, cs.AI, cs.LG | 2023-07-17 |
Fast Quantum Algorithm for Attention Computation | Yeqi Gao, Zhao Song, Xin Yang, Ruizhe Zhang | quant-ph, cs.LG | 2023-07-16 |
Using Large Language Models for Zero-Shot Natural Language Generation from Knowledge Graphs | Agnes Axelsson, Gabriel Skantze | cs.CL, 68T50, I.2.7; I.2.4 | 2023-07-14 |
Generating Efficient Training Data via LLM-based Attribute Manipulation | Letian Peng, Yuwei Zhang, Jingbo Shang | cs.CL | 2023-07-14 |
DIALGEN: Collaborative Human-LM Generated Dialogues for Improved Understanding of Human-Human Conversations | Bo-Ru Lu, Nikita Haduong, Chia-Hsuan Lee, Zeqiu Wu, Hao Cheng, Paul Koester, Jean Utke, Tao Yu, Noah A. Smith, Mari Ostendorf | cs.CL | 2023-07-13 |
Reading Radiology Imaging Like The Radiologist | Yuhao Wang | cs.CV, cs.AI | 2023-07-12 |
PatternGPT :A Pattern-Driven Framework for Large Language Model Text Generation | Le Xiao, Xin Shan | cs.CL, cs.AI | 2023-07-02 |
Benchmarking Large Language Model Capabilities for Conditional Generation | Joshua Maynez, Priyanka Agrawal, Sebastian Gehrmann | cs.CL | 2023-06-29 |
Joint Level Generation and Translation Using Gameplay Videos | Negar Mirgati, Matthew Guzdial | cs.CV, cs.LG | 2023-06-29 |
You Can Generate It Again: Data-to-text Generation with Verification and Correction Prompting | Xuan Ren, Lingqiao Liu | cs.CL, cs.AI, cs.LG | 2023-06-28 |
Knowledge Graph-Augmented Korean Generative Commonsense Reasoning | Dahyun Jung, Jaehyung Seo, Jaewook Lee, Chanjun Park, Heuiseok Lim | cs.CL, cs.AI | 2023-06-26 |
AudioPaLM: A Large Language Model That Can Speak and Listen | Paul K. Rubenstein, Chulayuth Asawaroengchai, Duc Dung Nguyen, Ankur Bapna, Zalán Borsos, Félix de Chaumont Quitry, Peter Chen, Dalia El Badawy, Wei Han, Eugene Kharitonov, Hannah Muckenhirn, Dirk Padfield, James Qin, Danny Rozenberg, Tara Sainath, Johan Schalkwyk, Matt Sharifi, Michelle Tadmor Ramanovich, Marco Tagliasacchi, Alexandru Tudor, Mihajlo Velimirović, Damien Vincent, Jiahui Yu, Yongqiang Wang, Vicky Zayats, Neil Zeghidour, Yu Zhang, Zhishuai Zhang, Lukas Zilka, Christian Frank | cs.CL, cs.AI, cs.SD, eess.AS, stat.ML | 2023-06-22 |
Open-Domain Text Evaluation via Meta Distribution Modeling | Sidi Lu, Asli Celikyilmaz, Tianlu Wang, Nanyun Peng | cs.CL | 2023-06-20 |
ChatGPT is not Enough: Enhancing Large Language Models with Knowledge Graphs for Fact-aware Language Modeling | Linyao Yang, Hongyang Chen, Zhao Li, Xiao Ding, Xindong Wu | cs.CL, cs.AI | 2023-06-20 |
Explicit Syntactic Guidance for Neural Text Generation | Yafu Li, Leyang Cui, Jianhao Yan, Yongjing Yin, Wei Bi, Shuming Shi, Yue Zhang | cs.CL | 2023-06-20 |
Semi-supervised Relation Extraction via Data Augmentation and Consistency-training | Komal K. Teru | cs.CL, cs.IR | 2023-06-16 |
Building blocks for complex tasks: Robust generative event extraction for radiology reports under domain shifts | Sitong Zhou, Meliha Yetisgen, Mari Ostendorf | cs.CL | 2023-06-15 |
Opportunities and Challenges for ChatGPT and Large Language Models in Biomedicine and Health | Shubo Tian, Qiao Jin, Lana Yeganova, Po-Ting Lai, Qingqing Zhu, Xiuying Chen, Yifan Yang, Qingyu Chen, Won Kim, Donald C. Comeau, Rezarta Islamaj, Aadit Kapoor, Xin Gao, Zhiyong Lu | cs.CY, cs.AI, cs.CL, q-bio.QM | 2023-06-15 |
DiffuDetox: A Mixed Diffusion Model for Text Detoxification | Griffin Floto, Mohammad Mahdi Abdollah Pour, Parsa Farinneya, Zhenwei Tang, Ali Pesaranghader, Manasa Bharadwaj, Scott Sanner | cs.CL, cs.LG | 2023-06-14 |
Unifying Large Language Models and Knowledge Graphs: A Roadmap | Shirui Pan, Linhao Luo, Yufei Wang, Chen Chen, Jiapu Wang, Xindong Wu | cs.CL, cs.AI | 2023-06-14 |
Large Language Models Sometimes Generate Purely Negatively-Reinforced Text | Fabien Roger | cs.LG, cs.CL | 2023-06-13 |
SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking | Chris Cundy, Stefano Ermon | cs.LG, cs.AI | 2023-06-08 |
Increasing Diversity While Maintaining Accuracy: Text Data Generation with Large Language Models and Human Interventions | John Joon Young Chung, Ece Kamar, Saleema Amershi | cs.CL | 2023-06-07 |
Structured Voronoi Sampling | Afra Amini, Li Du, Ryan Cotterell | cs.CL, cs.AI | 2023-06-05 |
Adaptive and Personalized Exercise Generation for Online Language Learning | Peng Cui, Mrinmaya Sachan | cs.CL, cs.AI | 2023-06-04 |
Exposing Bias in Online Communities through Large-Scale Language Models | Celine Wald, Lukas Pfahler | cs.CL, cs.CY, cs.LG | 2023-06-04 |
Exploring semantic information in disease: Simple Data Augmentation Techniques for Chinese Disease Normalization | Wenqian Cui, Shaohui Liu, Xiangling Fu, Xien Liu, Ji Wu | cs.CL, cs.AI | 2023-06-02 |
Fine-Grained Human Feedback Gives Better Rewards for Language Model Training | Zeqiu Wu, Yushi Hu, Weijia Shi, Nouha Dziri, Alane Suhr, Prithviraj Ammanabrolu, Noah A. Smith, Mari Ostendorf, Hannaneh Hajishirzi | cs.CL | 2023-06-02 |
Preference-grounded Token-level Guidance for Language Model Fine-tuning | Shentao Yang, Shujian Zhang, Congying Xia, Yihao Feng, Caiming Xiong, Mingyuan Zhou | cs.CL | 2023-06-01 |
Learning to Imagine: Visually-Augmented Natural Language Generation | Tianyi Tang, Yushuo Chen, Yifan Du, Junyi Li, Wayne Xin Zhao, Ji-Rong Wen | cs.CL | 2023-05-26 |
RewriteLM: An Instruction-Tuned Large Language Model for Text Rewriting | Lei Shu, Liangchen Luo, Jayakumar Hoskere, Yun Zhu, Canoee Liu, Simon Tong, Jindong Chen, Lei Meng | cs.CL, cs.AI | 2023-05-25 |
Balancing Effect of Training Dataset Distribution of Multiple Styles for Multi-Style Text Transfer | Debarati Das, David Ma, Dongyeop Kang | cs.CL | 2023-05-24 |
Peek Across: Improving Multi-Document Modeling via Cross-Document Question-Answering | Avi Caciularu, Matthew E. Peters, Jacob Goldberger, Ido Dagan, Arman Cohan | cs.CL, cs.AI | 2023-05-24 |
Not All Metrics Are Guilty: Improving NLG Evaluation with LLM Paraphrasing | Tianyi Tang, Hongyuan Lu, Yuchen Eleanor Jiang, Haoyang Huang, Dongdong Zhang, Wayne Xin Zhao, Furu Wei | cs.CL | 2023-05-24 |
Ghostbuster: Detecting Text Ghostwritten by Large Language Models | Vivek Verma, Eve Fleisig, Nicholas Tomlin, Dan Klein | cs.CL, cs.AI | 2023-05-24 |
Active Learning for Natural Language Generation | Yotam Perlitz, Ariel Gera, Michal Shmueli-Scheuer, Dafna Sheinwald, Noam Slonim, Liat Ein-Dor | cs.CL | 2023-05-24 |
LLMDet: A Large Language Models Detection Tool | Kangxi Wu, Liang Pang, Huawei Shen, Xueqi Cheng, Tat-Seng Chua | cs.CL | 2023-05-24 |
The ACL OCL Corpus: advancing Open science in Computational Linguistics | Shaurya Rohatgi, Yanxia Qin, Benjamin Aw, Niranjana Unnithan, Min-Yen Kan | cs.CL, cs.DL | 2023-05-24 |
Large Language Models are Effective Table-to-Text Generators, Evaluators, and Feedback Providers | Yilun Zhao, Haowei Zhang, Shengyun Si, Linyong Nan, Xiangru Tang, Arman Cohan | cs.CL | 2023-05-24 |
Universal Self-adaptive Prompting | Xingchen Wan, Ruoxi Sun, Hootan Nakhost, Hanjun Dai, Julian Martin Eisenschlos, Sercan O. Arik, Tomas Pfister | cs.CL, cs.AI, cs.LG | 2023-05-24 |
Faithful Low-Resource Data-to-Text Generation through Cycle Training | Zhuoer Wang, Marcus Collins, Nikhita Vedula, Simone Filice, Shervin Malmasi, Oleg Rokhlenko | cs.CL | 2023-05-24 |
In-Context Demonstration Selection with Cross Entropy Difference | Dan Iter, Reid Pryzant, Ruochen Xu, Shuohang Wang, Yang Liu, Yichong Xu, Chenguang Zhu | cs.CL, cs.AI | 2023-05-24 |
Diffusion Models in NLP: A Survey | Hao Zou, Zae Myung Kim, Dongyeop Kang | cs.CL | 2023-05-24 |
QTSumm: A New Benchmark for Query-Focused Table Summarization | Yilun Zhao, Zhenting Qi, Linyong Nan, Boyu Mi, Yixin Liu, Weijin Zou, Simeng Han, Xiangru Tang, Yumo Xu, Arman Cohan, Dragomir Radev | cs.CL | 2023-05-23 |
INSTRUCTSCORE: Towards Explainable Text Generation Evaluation with Automatic Feedback | Wenda Xu, Danqing Wang, Liangming Pan, Zhenqiao Song, Markus Freitag, William Yang Wang, Lei Li | cs.CL, cs.AI | 2023-05-23 |
Process-To-Text: A Framework for the Quantitative Description of Processes in Natural Language | Yago Fontenla-Seco, Alberto Bugarín-Diz, Manuel Lama | cs.CL | 2023-05-23 |
STOAT: Structured Data to Analytical Text With Controls | Deepanway Ghosal, Preksha Nema, Aravindan Raghuveer | cs.CL, cs.AI | 2023-05-19 |
Generating Visual Spatial Description via Holistic 3D Scene Understanding | Yu Zhao, Hao Fei, Wei Ji, Jianguo Wei, Meishan Zhang, Min Zhang, Tat-Seng Chua | cs.CV, cs.CL | 2023-05-19 |
What Comes Next? Evaluating Uncertainty in Neural Text Generators Against Human Production Variability | Mario Giulianelli, Joris Baan, Wilker Aziz, Raquel Fernández, Barbara Plank | cs.CL, cs.AI, cs.LG | 2023-05-19 |
Exploiting Biased Models to De-bias Text: A Gender-Fair Rewriting Model | Chantal Amrhein, Florian Schottmann, Rico Sennrich, Samuel Läubli | cs.CL, I.2.7 | 2023-05-18 |
Cross-modality Data Augmentation for End-to-End Sign Language Translation | Jinhui Ye, Wenxiang Jiao, Xing Wang, Zhaopeng Tu, Hui Xiong | cs.CL | 2023-05-18 |
ReGen: Zero-Shot Text Classification via Training Data Generation with Progressive Dense Retrieval | Yue Yu, Yuchen Zhuang, Rongzhi Zhang, Yu Meng, Jiaming Shen, Chao Zhang | cs.CL, cs.IR, cs.LG | 2023-05-18 |
Equivariant Few-Shot Learning from Pretrained Models | Sourya Basu, Pulkit Katdare, Prasanna Sattigeri, Vijil Chenthamarakshan, Katherine Driggs-Campbell, Payel Das, Lav R. Varshney | cs.LG, cs.AI, cs.CL, cs.CV | 2023-05-17 |
Smaller Language Models are Better Black-box Machine-Generated Text Detectors | Fatemehsadat Mireshghallah, Justus Mattern, Sicun Gao, Reza Shokri, Taylor Berg-Kirkpatrick | cs.CL, cs.LG | 2023-05-17 |
Boosting Event Extraction with Denoised Structure-to-Text Augmentation | bo wang, Heyan Huang, Xiaochi Wei, Ge Shi, Xiao Liu, Chong Feng, Tong Zhou, Shuaiqiang Wang, Dawei Yin | cs.CL | 2023-05-16 |
Helping the Helper: Supporting Peer Counselors via AI-Empowered Practice and Feedback | Shang-Ling Hsu, Raj Sanjay Shah, Prathik Senthil, Zahra Ashktorab, Casey Dugan, Werner Geyer, Diyi Yang | cs.HC, cs.CL | 2023-05-15 |
Creative Data Generation: A Review Focusing on Text and Poetry | Mohamad Elzohbi, Richard Zhao | cs.CL | 2023-05-15 |
Taxi1500: A Multilingual Dataset for Text Classification in 1500 Languages | Chunlan Ma, Ayyoob ImaniGooghari, Haotian Ye, Ehsaneddin Asgari, Hinrich Schütze | cs.CL | 2023-05-15 |
MatSci-NLP: Evaluating Scientific Language Models on Materials Science Language Tasks Using Text-to-Schema Modeling | Yu Song, Santiago Miret, Bang Liu | cs.CL, cond-mat.mtrl-sci, cs.AI | 2023-05-14 |
Diffusion-NAT: Self-Prompting Discrete Diffusion for Non-Autoregressive Text Generation | Kun Zhou, Yifan Li, Wayne Xin Zhao, Ji-Rong Wen | cs.CL | 2023-05-06 |
Stylized Data-to-Text Generation: A Case Study in the E-Commerce Domain | Liqiang Jing, Xuemeng Song, Xuming Lin, Zhongzhou Zhao, Wei Zhou, Liqiang Nie | cs.CL | 2023-05-05 |
VideoOFA: Two-Stage Pre-Training for Video-to-Text Generation | Xilun Chen, Lili Yu, Wenhan Xiong, Barlas Oğuz, Yashar Mehdad, Wen-tau Yih | cs.CV, cs.CL | 2023-05-04 |
Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs | Jinyang Li, Binyuan Hui, Ge Qu, Binhua Li, Jiaxi Yang, Bowen Li, Bailin Wang, Bowen Qin, Rongyu Cao, Ruiying Geng, Nan Huo, Chenhao Ma, Kevin C. C. Chang, Fei Huang, Reynold Cheng, Yongbin Li | cs.CL | 2023-05-04 |
How to Choose Pretrained Handwriting Recognition Models for Single Writer Fine-Tuning | Vittorio Pippi, Silvia Cascianelli, Christopher Kermorvant, Rita Cucchiara | cs.CV, cs.DL | 2023-05-04 |
Governance of the AI, by the AI, and for the AI | Andrew W. Torrance, Bill Tomlinson | cs.CY, cs.AI | 2023-05-04 |
Task-Optimized Adapters for an End-to-End Task-Oriented Dialogue System | Namo Bang, Jeehyun Lee, Myoung-Wan Koo | cs.CL | 2023-05-04 |
A Systematic Study of Knowledge Distillation for Natural Language Generation with Pseudo-Target Training | Nitay Calderon, Subhabrata Mukherjee, Roi Reichart, Amir Kantor | cs.CL, cs.AI | 2023-05-03 |
Towards Summarizing Multiple Documents with Hierarchical Relationships | Miao Li, Eduard Hovy, Jey Han Lau | cs.CL, cs.AI | 2023-05-02 |
Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation | Patrick Fernandes, Aman Madaan, Emmy Liu, António Farinhas, Pedro Henrique Martins, Amanda Bertsch, José G. C. de Souza, Shuyan Zhou, Tongshuang Wu, Graham Neubig, André F. T. Martins | cs.CL, cs.AI, cs.LG | 2023-05-01 |
A Comprehensive AI Policy Education Framework for University Teaching and Learning | Cecilia Ka Yuk Chan | cs.CY, cs.AI | 2023-04-29 |
CONSCENDI: A Contrastive and Scenario-Guided Distillation Approach to Guardrail Models for Virtual Assistants | Albert Yu Sun, Varun Nair, Elliot Schumacher, Anitha Kannan | cs.CL, cs.AI, cs.LG | 2023-04-27 |
Controlled Text Generation with Natural Language Instructions | Wangchunshu Zhou, Yuchen Eleanor Jiang, Ethan Wilcox, Ryan Cotterell, Mrinmaya Sachan | cs.CL, cs.AI, cs.LG | 2023-04-27 |
SweCTRL-Mini: a data-transparent Transformer-based large language model for controllable text generation in Swedish | Dmytro Kalpakchi, Johan Boye | cs.CL | 2023-04-27 |
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond | Jingfeng Yang, Hongye Jin, Ruixiang Tang, Xiaotian Han, Qizhang Feng, Haoming Jiang, Bing Yin, Xia Hu | cs.CL, cs.AI, cs.LG | 2023-04-26 |
Evaluating Inter-Bilingual Semantic Parsing for Indian Languages | Divyanshu Aggarwal, Vivek Gupta, Anoop Kunchukuttan | cs.CL | 2023-04-25 |
Which Factors Predict the Chat Experience of a Natural Language Generation Dialogue Service? | Eason Chen | cs.CL, cs.HC | 2023-04-21 |
Multi-aspect Repetition Suppression and Content Moderation of Large Language Models | Minghui Zhang, Alex Sokolov, Weixin Cai, Si-Qing Chen | cs.CL, cs.LG | 2023-04-20 |
GPT-NER: Named Entity Recognition via Large Language Models | Shuhe Wang, Xiaofei Sun, Xiaoya Li, Rongbin Ouyang, Fei Wu, Tianwei Zhang, Jiwei Li, Guoyin Wang | cs.CL | 2023-04-20 |
Towards Zero-Shot Personalized Table-to-Text Generation with Contrastive Persona Distillation | Haolan Zhan, Xuming Lin, Shaobo Cui, Zhongzhou Zhao, Wei Zhou, Haiqing Chen | cs.CL | 2023-04-18 |
LongForm: Optimizing Instruction Tuning for Long Text Generation with Corpus Extraction | Abdullatif Köksal, Timo Schick, Anna Korhonen, Hinrich Schütze | cs.CL, cs.AI, cs.LG | 2023-04-17 |
VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset | Sihan Chen, Xingjian He, Longteng Guo, Xinxin Zhu, Weining Wang, Jinhui Tang, Jing Liu | cs.LG, cs.CL, cs.CV, cs.MM, eess.AS | 2023-04-17 |
VISAR: A Human-AI Argumentative Writing Assistant with Visual Programming and Rapid Draft Prototyping | Zheng Zhang, Jie Gao, Ranjodh Singh Dhaliwal, Toby Jia-Jun Li | cs.HC, cs.AI, cs.CL, cs.LG | 2023-04-16 |
ArguGPT: evaluating, understanding and identifying argumentative essays generated by GPT models | Yikang Liu, Ziyin Zhang, Wanyang Zhang, Shisen Yue, Xiaojing Zhao, Xinyuan Cheng, Yiwen Zhang, Hai Hu | cs.CL | 2023-04-16 |
ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual Learning | Viet Dac Lai, Nghia Trung Ngo, Amir Pouran Ben Veyseh, Hieu Man, Franck Dernoncourt, Trung Bui, Thien Huu Nguyen | cs.CL, cs.AI | 2023-04-12 |
Automated Reading Passage Generation with OpenAI’s Large Language Model | Ummugul Bezirhan, Matthias von Davier | cs.CL, cs.AI | 2023-04-10 |
Decoder-Only or Encoder-Decoder? Interpreting Language Model as a Regularized Encoder-Decoder | Zihao Fu, Wai Lam, Qian Yu, Anthony Man-Cho So, Shengding Hu, Zhiyuan Liu, Nigel Collier | cs.CL, cs.AI, cs.LG | 2023-04-08 |
Beyond Privacy: Navigating the Opportunities and Challenges of Synthetic Data | Boris van Breugel, Mihaela van der Schaar | cs.LG | 2023-04-07 |
Measuring and Manipulating Knowledge Representations in Language Models | Evan Hernandez, Belinda Z. Li, Jacob Andreas | cs.CL | 2023-04-03 |
Decoding the End-to-end Writing Trajectory in Scholarly Manuscripts | Ryan Koo, Anna Martin, Linghe Wang, Dongyeop Kang | cs.CL, cs.HC | 2023-03-31 |
Assessing Language Model Deployment with Risk Cards | Leon Derczynski, Hannah Rose Kirk, Vidhisha Balachandran, Sachin Kumar, Yulia Tsvetkov, M. R. Leiser, Saif Mohammad | cs.CL | 2023-03-31 |
Prefix tuning for automated audio captioning | Minkyu Kim, Kim Sung-Bin, Tae-Hyun Oh | eess.AS, cs.MM, cs.SD | 2023-03-30 |
Humans in Humans Out: On GPT Converging Toward Common Sense in both Success and Failure | Philipp Koralus, Vincent Wang-Maścianica | cs.AI, cs.CL, cs.HC, cs.LG, 00, 68, I.2.0; I.2.6 | 2023-03-30 |
Foundation Models and Fair Use | Peter Henderson, Xuechen Li, Dan Jurafsky, Tatsunori Hashimoto, Mark A. Lemley, Percy Liang | cs.CY, cs.AI, cs.LG | 2023-03-28 |
GPT is becoming a Turing machine: Here are some ways to program it | Ana Jojic, Zhen Wang, Nebojsa Jojic | cs.CL | 2023-03-25 |
CoBIT: A Contrastive Bi-directional Image-Text Generation Model | Haoxuan You, Mandy Guo, Zhecan Wang, Kai-Wei Chang, Jason Baldridge, Jiahui Yu | cs.CV, cs.CL | 2023-03-23 |
Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense | Kalpesh Krishna, Yixiao Song, Marzena Karpinska, John Wieting, Mohit Iyyer | cs.CL, cs.CR, cs.LG | 2023-03-23 |
Compositional Zero-Shot Domain Transfer with Text-to-Text Models | Fangyu Liu, Qianchu Liu, Shruthi Bannur, Fernando Pérez-García, Naoto Usuyama, Sheng Zhang, Tristan Naumann, Aditya Nori, Hoifung Poon, Javier Alvarez-Valle, Ozan Oktay, Stephanie L. Hyland | cs.CL, cs.LG | 2023-03-23 |
JaCoText: A Pretrained Model for Java Code-Text Generation | Jessica López Espejel, Mahaman Sanoussi Yahaya Alassan, Walid Dahhane, El Hassane Ettifouri | cs.CL | 2023-03-22 |
Chinese Intermediate English Learners outdid ChatGPT in deep cohesion: Evidence from English narrative writing | Tongquan Zhou, Siyi Cao, Siruo Zhou, Yao Zhang, Aijing He | cs.CL | 2023-03-21 |
Code-Switching Text Generation and Injection in Mandarin-English ASR | Haibin Yu, Yuxuan Hu, Yao Qian, Ma Jin, Linquan Liu, Shujie Liu, Yu Shi, Yanmin Qian, Edward Lin, Michael Zeng | eess.AS, cs.CL, cs.SD | 2023-03-20 |
SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language Models | Vithursan Thangarasa, Abhay Gupta, William Marshall, Tianda Li, Kevin Leong, Dennis DeCoste, Sean Lie, Shreyas Saxena | cs.LG, cs.CL | 2023-03-18 |
HIVE: Harnessing Human Feedback for Instructional Visual Editing | Shu Zhang, Xinyi Yang, Yihao Feng, Can Qin, Chia-Chih Chen, Ning Yu, Zeyuan Chen, Huan Wang, Silvio Savarese, Stefano Ermon, Caiming Xiong, Ran Xu | cs.CV, cs.AI, cs.CL, cs.HC, cs.LG | 2023-03-16 |
Input-length-shortening and text generation via attention values | Neşet Özkan Tan, Alex Yuxuan Peng, Joshua Bensemann, Qiming Bao, Tim Hartill, Mark Gahegan, Michael Witbrock | cs.CL | 2023-03-14 |
Diffusion Models for Non-autoregressive Text Generation: A Survey | Yifan Li, Kun Zhou, Wayne Xin Zhao, Ji-Rong Wen | cs.CL | 2023-03-12 |
ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation | Bang Yang, Fenglin Liu, Yuexian Zou, Xian Wu, Yaowei Wang, David A. Clifton | cs.CL, cs.AI, cs.CV | 2023-03-11 |
Reinforcement Learning-based Counter-Misinformation Response Generation: A Case Study of COVID-19 Vaccine Misinformation | Bing He, Mustaque Ahamad, Srijan Kumar | cs.SI, cs.LG | 2023-03-11 |
An Overview on Language Models: Recent Developments and Outlook | Chengwei Wei, Yun-Cheng Wang, Bin Wang, C. -C. Jay Kuo | cs.CL | 2023-03-10 |
Is ChatGPT a Good NLG Evaluator? A Preliminary Study | Jiaan Wang, Yunlong Liang, Fandong Meng, Haoxiang Shi, Zhixu Li, Jinan Xu, Jianfeng Qu, Jie Zhou | cs.CL, cs.AI | 2023-03-07 |
Large Language Models as Zero-Shot Human Models for Human-Robot Interaction | Bowen Zhang, Harold Soh | cs.RO, cs.CL, cs.HC, cs.LG | 2023-03-06 |
DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only Training | Wei Li, Linchao Zhu, Longyin Wen, Yi Yang | cs.CV, cs.AI, cs.CL | 2023-03-06 |
UZH_CLyp at SemEval-2023 Task 9: Head-First Fine-Tuning and ChatGPT Data Generation for Cross-Lingual Learning in Tweet Intimacy Prediction | Andrianos Michail, Stefanos Konstantinou, Simon Clematide | cs.CL, cs.AI, 68T50 | 2023-03-02 |
A Universal Question-Answering Platform for Knowledge Graphs | Reham Omar, Ishika Dhall, Panos Kalnis, Essam Mansour | cs.AI, cs.CL, cs.DB | 2023-03-01 |
TabGenie: A Toolkit for Table-to-Text Generation | Zdeněk Kasner, Ekaterina Garanina, Ondřej Plátek, Ondřej Dušek | cs.CL | 2023-02-27 |
Tailoring Language Generation Models under Total Variation Distance | Haozhe Ji, Pei Ke, Zhipeng Hu, Rongsheng Zhang, Minlie Huang | cs.CL | 2023-02-26 |
Few-Shot Table-to-Text Generation with Prompt-based Adapter | Zhixin Guo, Minyxuan Yan, Jiexing Qi, Jianping Zhou, Ziwei He, Zhouhan Lin, Guanjie Zheng, Xinbing Wang | cs.CL | 2023-02-24 |
Improved Training of Mixture-of-Experts Language GANs | Yekun Chai, Qiyue Yin, Junge Zhang | cs.CL | 2023-02-23 |
Improving User Controlled Table-To-Text Generation Robustness | Hanxu Hu, Yunqing Liu, Zhongyi Yu, Laura Perez-Beltrachini | cs.CL | 2023-02-20 |
Do We Still Need Clinical Language Models? | Eric Lehman, Evan Hernandez, Diwakar Mahajan, Jonas Wulff, Micah J. Smith, Zachary Ziegler, Daniel Nadler, Peter Szolovits, Alistair Johnson, Emily Alsentzer | cs.CL | 2023-02-16 |
Tree-Based Representation and Generation of Natural and Mathematical Language | Alexander Scarlatos, Andrew Lan | cs.CL | 2023-02-15 |
AutoBiasTest: Controllable Sentence Generation for Automated and Open-Ended Social Bias Testing in Language Models | Rafal Kocielnik, Shrimai Prabhumoye, Vivian Zhang, R. Michael Alvarez, Anima Anandkumar | cs.CL, cs.CY, 68T50, I.2.7; J.5; K.4.1 | 2023-02-14 |
Large Scale Multi-Lingual Multi-Modal Summarization Dataset | Yash Verma, Anubhav Jangra, Raghvendra Kumar, Sriparna Saha | cs.CL, cs.MM | 2023-02-13 |
Combined Location Online Weather Data: Easy-to-use Targeted Weather Analysis for Agriculture | Darren Yates, Christopher Blanchard, Allister Clarke, Sabih-Ur Rehman, Md Zahidul Islam, Russell Ford, Rob Walsh | cs.SI, J.2 | 2023-02-13 |
Investigating the Effect of Relative Positional Embeddings on AMR-to-Text Generation with Structural Adapters | Sebastien Montella, Alexis Nasr, Johannes Heinecke, Frederic Bechet, Lina M. Rojas-Barahona | cs.CL | 2023-02-12 |
Plan-then-Seam: Towards Efficient Table-to-Text Generation | Liang Li, Ruiying Geng, Chengyang Fang, Bing Li, Can Ma, Binhua Li, Yongbin Li | cs.CL | 2023-02-10 |
Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning | Zhuolin Yang, Wei Ping, Zihan Liu, Vijay Korthikanti, Weili Nie, De-An Huang, Linxi Fan, Zhiding Yu, Shiyi Lan, Bo Li, Ming-Yu Liu, Yuke Zhu, Mohammad Shoeybi, Bryan Catanzaro, Chaowei Xiao, Anima Anandkumar | cs.CV, cs.AI, cs.CL, cs.IR, cs.LG | 2023-02-09 |
Lightweight Transformers for Clinical Natural Language Processing | Omid Rohanian, Mohammadmahdi Nouriborji, Hannah Jauncey, Samaneh Kouchaki, ISARIC Clinical Characterisation Group, Lei Clifton, Laura Merson, David A. Clifton | cs.CL, cs.AI, cs.LG, 68T50, I.2.7 | 2023-02-09 |
Auto-Learning: An Adversarial Process of Two Pre-trained Models for Natural Language Generation | Zhengqing Yuan, Yuelin Lu, Chao Zhang, Huiwen Xue | cs.CL | 2023-02-08 |
What Matters In The Structured Pruning of Generative Language Models? | Michael Santacroce, Zixin Wen, Yelong Shen, Yuanzhi Li | cs.CL, cs.LG | 2023-02-07 |
Unleashing the True Potential of Sequence-to-Sequence Models for Sequence Tagging and Structure Parsing | Han He, Jinho D. Choi | cs.CL | 2023-02-05 |
Grounding Language Models to Images for Multimodal Generation | Jing Yu Koh, Ruslan Salakhutdinov, Daniel Fried | cs.CL, cs.AI, cs.CV, cs.LG | 2023-01-31 |
Semi-Parametric Video-Grounded Text Generation | Sungdong Kim, Jin-Hwa Kim, Jiyoung Lee, Minjoon Seo | cs.CV, cs.CL, cs.LG | 2023-01-27 |
DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature | Eric Mitchell, Yoonho Lee, Alexander Khazatsky, Christopher D. Manning, Chelsea Finn | cs.CL, cs.AI | 2023-01-26 |
Distilling Text into Circuits | Vincent Wang-Mascianica, Jonathon Liu, Bob Coecke | cs.CL, cs.AI, cs.LO, math.CT | 2023-01-25 |
One Model for All Domains: Collaborative Domain-Prefix Tuning for Cross-Domain NER | Xiang Chen, Lei Li, Qiaoshuo Fei, Ningyu Zhang, Chuanqi Tan, Yong Jiang, Fei Huang, Huajun Chen | cs.CL, cs.AI, cs.DB, cs.IR, cs.LG | 2023-01-25 |
Audience-Centric Natural Language Generation via Style Infusion | Samraj Moorjani, Adit Krishnan, Hari Sundaram, Ewa Maslowska, Aravind Sankar | cs.CL, cs.LG | 2023-01-24 |
ExClaim: Explainable Neural Claim Verification Using Rationalization | Sai Gurrapu, Lifu Huang, Feras A. Batarseh | cs.CL | 2023-01-21 |
Regeneration Learning: A Learning Paradigm for Data Generation | Xu Tan, Tao Qin, Jiang Bian, Tie-Yan Liu, Yoshua Bengio | cs.LG, cs.AI, cs.CL, cs.CV, eess.AS | 2023-01-21 |
UserSimCRS: A User Simulation Toolkit for Evaluating Conversational Recommender Systems | Jafar Afzali, Aleksander Mark Drzewiecki, Krisztian Balog, Shuo Zhang | cs.IR | 2023-01-13 |
Universal Multimodal Representation for Language Understanding | Zhuosheng Zhang, Kehai Chen, Rui Wang, Masao Utiyama, Eiichiro Sumita, Zuchao Li, Hai Zhao | cs.CL, cs.AI, cs.CV | 2023-01-09 |
Sequentially Controlled Text Generation | Alexander Spangher, Xinyu Hua, Yao Ming, Nanyun Peng | cs.CL, cs.AI, cs.LG | 2023-01-05 |
Towards Table-to-Text Generation with Pretrained Language Model: A Table Structure Understanding and Text Deliberating Approach | Miao Chen, Xinjiang Lu, Tong Xu, Yanyan Li, Jingbo Zhou, Dejing Dou, Hui Xiong | cs.CL, cs.AI | 2023-01-05 |
eVAE: Evolutionary Variational Autoencoder | Zhangkai Wu, Longbing Cao, Lei Qi | cs.NE, cs.LG | 2023-01-01 |
MAUVE Scores for Generative Models: Theory and Practice | Krishna Pillutla, Lang Liu, John Thickstun, Sean Welleck, Swabha Swayamdipta, Rowan Zellers, Sewoong Oh, Yejin Choi, Zaid Harchaoui | cs.LG, cs.AI, cs.CL | 2022-12-30 |
TegFormer: Topic-to-Essay Generation with Good Topic Coverage and High Text Coherence | Wang Qi, Rui Liu, Yuan Zuo, Yong Chen, Dell Zhang | cs.CL | 2022-12-27 |
TextBox 2.0: A Text Generation Library with Pre-trained Language Models | Tianyi Tang, Junyi Li, Zhipeng Chen, Yiwen Hu, Zhuohao Yu, Wenxun Dai, Zican Dong, Xiaoxue Cheng, Yuhao Wang, Wayne Xin Zhao, Jian-Yun Nie, Ji-Rong Wen | cs.CL | 2022-12-26 |
CORRPUS: Detecting Story Inconsistencies via Codex-Bootstrapped Neurosymbolic Reasoning | Yijiang River Dong, Lara J. Martin, Chris Callison-Burch | cs.CL | 2022-12-21 |
Tracing and Removing Data Errors in Natural Language Generation Datasets | Faisal Ladhak, Esin Durmus, Tatsunori Hashimoto | cs.CL | 2022-12-21 |
SimpleStyle: An Adaptable Style Transfer Approach | Elron Bandel, Yoav Katz, Noam Slonim, Liat Ein-Dor | cs.CL | 2022-12-20 |
CoCo: Coherence-Enhanced Machine-Generated Text Detection Under Data Limitation With Contrastive Learning | Xiaoming Liu, Zhaohan Zhang, Yichen Wang, Yu Lan, Chao Shen | cs.CL | 2022-12-20 |
Toward Human-Like Evaluation for Natural Language Generation with Error Analysis | Qingyu Lu, Liang Ding, Liping Xie, Kanjian Zhang, Derek F. Wong, Dacheng Tao | cs.CL | 2022-12-20 |
WeCheck: Strong Factual Consistency Checker via Weakly Supervised Learning | Wenhao Wu, Wei Li, Xinyan Xiao, Jiachen Liu, Sujian Li, Yajuan Lv | cs.CL | 2022-12-20 |
On the Blind Spots of Model-Based Evaluation Metrics for Text Generation | Tianxing He, Jingyu Zhang, Tianle Wang, Sachin Kumar, Kyunghyun Cho, James Glass, Yulia Tsvetkov | cs.CL | 2022-12-20 |
One Embedder, Any Task: Instruction-Finetuned Text Embeddings | Hongjin Su, Weijia Shi, Jungo Kasai, Yizhong Wang, Yushi Hu, Mari Ostendorf, Wen-tau Yih, Noah A. Smith, Luke Zettlemoyer, Tao Yu | cs.CL | 2022-12-19 |
Difformer: Empowering Diffusion Model on Embedding Space for Text Generation | Zhujin Gao, Junliang Guo, Xu Tan, Yongxin Zhu, Fang Zhang, Jiang Bian, Linli Xu | cs.CL, cs.AI, cs.LG | 2022-12-19 |
SEScore2: Retrieval Augmented Pretraining for Text Generation Evaluation | Wenda Xu, Xian Qian, Mingxuan Wang, Lei Li, William Yang Wang | cs.CL | 2022-12-19 |
Synthesis and Evaluation of a Domain-specific Large Data Set for Dungeons & Dragons | Akila Peiris, Nisansa de Silva | cs.CL, cs.LG | 2022-12-18 |
RISE: Leveraging Retrieval Techniques for Summarization Evaluation | David Uthus, Jianmo Ni | cs.CL | 2022-12-17 |
DuNST: Dual Noisy Self Training for Semi-Supervised Controllable Text Generation | Yuxi Feng, Xiaoyuan Yi, Xiting Wang, Laks V. S. Lakshmanan, Xing Xie | cs.CL | 2022-12-16 |
MURMUR: Modular Multi-Step Reasoning for Semi-Structured Data-to-Text Generation | Swarnadeep Saha, Xinyan Velocity Yu, Mohit Bansal, Ramakanth Pasunuru, Asli Celikyilmaz | cs.CL, cs.AI, cs.LG | 2022-12-16 |
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages | Yekun Chai, Shuohuan Wang, Chao Pang, Yu Sun, Hao Tian, Hua Wu | cs.CL, cs.LG, cs.PL, cs.SE | 2022-12-13 |
Collaborating Heterogeneous Natural Language Processing Tasks via Federated Learning | Chenhe Dong, Yuexiang Xie, Bolin Ding, Ying Shen, Yaliang Li | cs.CL | 2022-12-12 |
T5Score: Discriminative Fine-tuning of Generative Evaluation Metrics | Yiwei Qin, Weizhe Yuan, Graham Neubig, Pengfei Liu | cs.CL | 2022-12-12 |
The Role of AI in Drug Discovery: Challenges, Opportunities, and Strategies | Alexandre Blanco-Gonzalez, Alfonso Cabezon, Alejandro Seco-Gonzalez, Daniel Conde-Torres, Paula Antelo-Riveiro, Angel Pineiro, Rebeca Garcia-Fandino | cs.CL, cs.AI, cs.CY | 2022-12-08 |
Controlled Language Generation for Language Learning Items | Kevin Stowe, Debanjan Ghosh, Mengxuan Zhao | cs.CL, I.2.7 | 2022-11-28 |
CodeExp: Explanatory Code Document Generation | Haotian Cui, Chenglong Wang, Junjie Huang, Jeevana Priya Inala, Todd Mytkowicz, Bo Wang, Jianfeng Gao, Nan Duan | cs.CL, cs.LG, I.2.2; I.2.7 | 2022-11-25 |
MUSIED: A Benchmark for Event Detection from Multi-Source Heterogeneous Informal Texts | Xiangyu Xi, Jianwei Lv, Shuaipeng Liu, Wei Ye, Fan Yang, Guanglu Wan | cs.CL | 2022-11-25 |
Retrieval-Augmented Multimodal Language Modeling | Michihiro Yasunaga, Armen Aghajanyan, Weijia Shi, Rich James, Jure Leskovec, Percy Liang, Mike Lewis, Luke Zettlemoyer, Wen-tau Yih | cs.CV, cs.CL, cs.LG | 2022-11-22 |
How to Describe Images in a More Funny Way? Towards a Modular Approach to Cross-Modal Sarcasm Generation | Jie Ruan, Yue Wu, Xiaojun Wan, Yuesheng Zhu | cs.CV, cs.CL | 2022-11-20 |
GENIUS: Sketch-based Language Model Pre-training via Extreme and Selective Masking for Text Generation and Augmentation | Biyang Guo, Yeyun Gong, Yelong Shen, Songqiao Han, Hailiang Huang, Nan Duan, Weizhu Chen | cs.CL | 2022-11-18 |
Towards Computationally Verifiable Semantic Grounding for Language Models | Chris Alberti, Kuzman Ganchev, Michael Collins, Sebastian Gehrmann, Ciprian Chelba | cs.CL | 2022-11-16 |
Reward Gaming in Conditional Text Generation | Richard Yuanzhe Pang, Vishakh Padmakumar, Thibault Sellam, Ankur P. Parikh, He He | cs.CL, cs.AI, cs.LG | 2022-11-16 |
Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding | Mirac Suzgun, Luke Melas-Kyriazi, Dan Jurafsky | cs.CL, cs.LG | 2022-11-14 |
Evade the Trap of Mediocrity: Promoting Diversity and Novelty in Text Generation via Concentrating Attention | Wenhao Li, Xiaoyuan Yi, Jinyi Hu, Maosong Sun, Xing Xie | cs.CL | 2022-11-14 |
Controllable Citation Text Generation | Nianlong Gu, Richard H. R. Hahnloser | cs.CL | 2022-11-14 |
Self-conditioned Embedding Diffusion for Text Generation | Robin Strudel, Corentin Tallec, Florent Altché, Yilun Du, Yaroslav Ganin, Arthur Mensch, Will Grathwohl, Nikolay Savinov, Sander Dieleman, Laurent Sifre, Rémi Leblond | cs.CL, cs.LG | 2022-11-08 |
Generative Transformers for Design Concept Generation | Qihao Zhu, Jianxi Luo | cs.CL | 2022-11-07 |
Human-Machine Collaboration Approaches to Build a Dialogue Dataset for Hate Speech Countering | Helena Bonaldi, Sara Dellantonio, Serra Sinem Tekiroglu, Marco Guerini | cs.CL, cs.CY | 2022-11-07 |
Time-aware Prompting for Text Generation | Shuyang Cao, Lu Wang | cs.CL | 2022-11-03 |
TaTa: A Multilingual Table-to-Text Dataset for African Languages | Sebastian Gehrmann, Sebastian Ruder, Vitaly Nikolaev, Jan A. Botha, Michael Chavinda, Ankur Parikh, Clara Rivera | cs.CL, cs.LG | 2022-10-31 |
DiffusER: Discrete Diffusion via Edit-based Reconstruction | Machel Reid, Vincent J. Hellendoorn, Graham Neubig | cs.CL, cs.LG | 2022-10-30 |
Diverse Parallel Data Synthesis for Cross-Database Adaptation of Text-to-SQL Parsers | Abhijeet Awasthi, Ashutosh Sathe, Sunita Sarawagi | cs.CL, cs.AI, cs.LG | 2022-10-29 |
Nearest Neighbor Language Models for Stylistic Controllable Generation | Severino Trotta, Lucie Flek, Charles Welch | cs.CL | 2022-10-27 |
Categorical SDEs with Simplex Diffusion | Pierre H. Richemond, Sander Dieleman, Arnaud Doucet | cs.LG | 2022-10-26 |
SentBS: Sentence-level Beam Search for Controllable Summarization | Chenhui Shen, Liying Cheng, Lidong Bing, Yang You, Luo Si | cs.CL | 2022-10-26 |
On the Effectiveness of Automated Metrics for Text Generation Systems | Pius von Däniken, Jan Deriu, Don Tuggener, Mark Cieliebak | cs.CL, cs.AI | 2022-10-24 |
Finding Memo: Extractive Memorization in Constrained Sequence Generation Tasks | Vikas Raunak, Arul Menezes | cs.CL, cs.AI, cs.LG | 2022-10-24 |
Mapping Process for the Task: Wikidata Statements to Text as Wikipedia Sentences | Hoang Thang Ta, Alexander Gelbukha, Grigori Sidorov | cs.CL, cs.AI | 2022-10-23 |
Hard Gate Knowledge Distillation – Leverage Calibration for Robust and Reliable Language Model | Dongkyu Lee, Zhiliang Tian, Yingxiu Zhao, Ka Chun Cheung, Nevin L. Zhang | cs.CL, cs.AI | 2022-10-22 |
Recurrence Boosts Diversity! Revisiting Recurrent Latent Variable in Transformer-Based Variational AutoEncoder for Diverse Text Generation | Jinyi Hu, Xiaoyuan Yi, Wenhao Li, Maosong Sun, Xing Xie | cs.CL | 2022-10-22 |
ReasTAP: Injecting Table Reasoning Skills During Pre-training via Synthetic Reasoning Examples | Yilun Zhao, Linyong Nan, Zhenting Qi, Rui Zhang, Dragomir Radev | cs.CL | 2022-10-22 |
The University of Edinburgh’s Submission to the WMT22 Code-Mixing Shared Task (MixMT) | Faheem Kirefu, Vivek Iyer, Pinzhen Chen, Laurie Burchell | cs.CL | 2022-10-20 |
Image Semantic Relation Generation | Mingzhe Du | cs.CV, cs.CL | 2022-10-19 |
NGEP: A Graph-based Event Planning Framework for Story Generation | Chen Tang, Zhihao Zhang, Tyler Loakman, Chenghua Lin, Frank Guerin | cs.CL, cs.AI | 2022-10-19 |
Attribution and Obfuscation of Neural Text Authorship: A Data Mining Perspective | Adaku Uchendu, Thai Le, Dongwon Lee | cs.CL, cs.LG | 2022-10-19 |
Team Flow at DRC2022: Pipeline System for Travel Destination Recommendation Task in Spoken Dialogue | Ryu Hirai, Atsumoto Ohashi, Ao Guo, Hideki Shiroma, Xulin Zhou, Yukihiko Tone, Shinya Iizuka, Ryuichiro Higashinaka | cs.CL, cs.AI, cs.RO | 2022-10-18 |
Table-To-Text generation and pre-training with TabT5 | Ewa Andrejczuk, Julian Martin Eisenschlos, Francesco Piccinno, Syrine Krichene, Yasemin Altun | cs.CL, cs.LG | 2022-10-17 |
Model Criticism for Long-Form Text Generation | Yuntian Deng, Volodymyr Kuleshov, Alexander M. Rush | cs.CL, cs.LG, stat.ML | 2022-10-16 |
LEATHER: A Framework for Learning to Generate Human-like Text in Dialogue | Anthony Sicilia, Malihe Alikhani | cs.CL, cs.LG | 2022-10-14 |
Towards a Unified Multi-Dimensional Evaluator for Text Generation | Ming Zhong, Yang Liu, Da Yin, Yuning Mao, Yizhu Jiao, Pengfei Liu, Chenguang Zhu, Heng Ji, Jiawei Han | cs.CL | 2022-10-13 |
Scaling Back-Translation with Domain Text Generation for Sign Language Gloss Translation | Jinhui Ye, Wenxiang Jiao, Xing Wang, Zhaopeng Tu | cs.CL, cs.AI | 2022-10-13 |
Instruction Tuning for Few-Shot Aspect-Based Sentiment Analysis | Siddharth Varia, Shuai Wang, Kishaloy Halder, Robert Vacareanu, Miguel Ballesteros, Yassine Benajiba, Neha Anna John, Rishita Anubhai, Smaranda Muresan, Dan Roth | cs.CL | 2022-10-12 |
DATScore: Evaluating Translation with Data Augmented Translations | Moussa Kamal Eddine, Guokan Shang, Michalis Vazirgiannis | cs.CL | 2022-10-12 |
ASDOT: Any-Shot Data-to-Text Generation with Pretrained Language Models | Jiannan Xiang, Zhengzhong Liu, Yucheng Zhou, Eric P. Xing, Zhiting Hu | cs.CL | 2022-10-09 |
FAST: Improving Controllability for Text Generation with Feedback Aware Self-Training | Junyi Chai, Reid Pryzant, Victor Ye Dong, Konstantin Golobokov, Chenguang Zhu, Yi Liu | cs.CL | 2022-10-06 |
A Distributional Lens for Multi-Aspect Controllable Text Generation | Yuxuan Gu, Xiaocheng Feng, Sicheng Ma, Lingyuan Zhang, Heng Gong, Bing Qin | cs.CL | 2022-10-06 |
Unsupervised Sentence Textual Similarity with Compositional Phrase Semantics | Zihao Wang, Jiaheng Dou, Yong Zhang | cs.CL | 2022-10-05 |
CodeDSI: Differentiable Code Search | Usama Nadeem, Noah Ziems, Shaoen Wu | cs.SE, cs.IR | 2022-10-01 |
Calibrating Sequence likelihood Improves Conditional Language Generation | Yao Zhao, Misha Khalman, Rishabh Joshi, Shashi Narayan, Mohammad Saleh, Peter J. Liu | cs.CL | 2022-09-30 |
Co-Writing Screenplays and Theatre Scripts with Language Models: An Evaluation by Industry Professionals | Piotr Mirowski, Kory W. Mathewson, Jaylen Pittman, Richard Evans | cs.HC, cs.CL | 2022-09-29 |
DR.BENCH: Diagnostic Reasoning Benchmark for Clinical Natural Language Processing | Yanjun Gao, Dmitriy Dligach, Timothy Miller, John Caskey, Brihat Sharma, Matthew M Churpek, Majid Afshar | cs.CL, cs.AI | 2022-09-29 |
Informative Text Generation from Knowledge Triples | Zihao Fu, Yijiang River Dong, Lidong Bing, Wai Lam | cs.CL | 2022-09-26 |
Controllable Text Generation for Open-Domain Creativity and Fairness | Nanyun Peng | cs.CL, cs.AI | 2022-09-24 |
XF2T: Cross-lingual Fact-to-Text Generation for Low-Resource Languages | Shivprasad Sagare, Tushar Abhishek, Bhavyajeet Singh, Anubhav Sharma, Manish Gupta, Vasudeva Varma | cs.CL | 2022-09-22 |
Selective Token Generation for Few-shot Natural Language Generation | Daejin Jo, Taehwan Kwon, Eun-Sol Kim, Sungwoong Kim | cs.CL, cs.LG | 2022-09-17 |
Adaptive Natural Language Generation for Task-oriented Dialogue via Reinforcement Learning | Atsumoto Ohashi, Ryuichiro Higashinaka | cs.CL, cs.AI | 2022-09-16 |
TwHIN-BERT: A Socially-Enriched Pre-trained Language Model for Multilingual Tweet Representations | Xinyang Zhang, Yury Malkov, Omar Florez, Serim Park, Brian McWilliams, Jiawei Han, Ahmed El-Kishky | cs.CL | 2022-09-15 |
Distribution Aware Metrics for Conditional Natural Language Generation | David M Chan, Yiming Ni, Austin Myers, Sudheendra Vijayanarasimhan, David A Ross, John Canny | cs.CL, cs.AI, cs.CV, cs.LG | 2022-09-15 |
vec2text with Round-Trip Translations | Geoffrey Cideron, Sertan Girgin, Anton Raichuk, Olivier Pietquin, Olivier Bachem, Léonard Hussenot | cs.CL, cs.LG | 2022-09-14 |
LibertyMFD: A Lexicon to Assess the Moral Foundation of Liberty | Oscar Araque, Lorenzo Gatti, Kyriaki Kalimeri | cs.CL | 2022-09-14 |
OPAL: Ontology-Aware Pretrained Language Model for End-to-End Task-Oriented Dialogue | Zhi Chen, Yuncong Liu, Lu Chen, Su Zhu, Mengyue Wu, Kai Yu | cs.CL | 2022-09-10 |
Layer or Representation Space: What makes BERT-based Evaluation Metrics Robust? | Doan Nam Long Vu, Nafise Sadat Moosavi, Steffen Eger | cs.CL | 2022-09-06 |
Every picture tells a story: Image-grounded controllable stylistic story generation | Holy Lovenia, Bryan Wilie, Romain Barraud, Samuel Cahyawijaya, Willy Chung, Pascale Fung | cs.CL | 2022-09-04 |
Multi-Modal Experience Inspired AI Creation | Qian Cao, Xu Chen, Ruihua Song, Hao Jiang, Guang Yang, Zhao Cao | cs.AI | 2022-09-02 |
A Spanish dataset for Targeted Sentiment Analysis of political headlines | Tomás Alves Salgueiro, Emilio Recart Zapata, Damián Furman, Juan Manuel Pérez, Pablo Nicolás Fernández Larrosa | cs.CL | 2022-08-30 |
StoryTrans: Non-Parallel Story Author-Style Transfer with Discourse Representations and Content Enhancing | Xuekai Zhu, Jian Guan, Minlie Huang, Juan Liu | cs.CL | 2022-08-29 |
Nearest Neighbor Non-autoregressive Text Generation | Ayana Niwa, Sho Takase, Naoaki Okazaki | cs.CL | 2022-08-26 |
GenTUS: Simulating User Behaviour and Language in Task-oriented Dialogues with Generative Transformers | Hsien-Chin Lin, Christian Geishauser, Shutong Feng, Nurul Lubis, Carel van Niekerk, Michael Heck, Milica Gašić | cs.CL | 2022-08-23 |
Few-Shot Table-to-Text Generation with Prefix-Controlled Generator | Yutao Luo, Menghua Lu, Gongshen Liu, Shilin Wang | cs.CL | 2022-08-23 |
Automatic tagging of knowledge points for K12 math problems | Xiaolu Wang, Ziqi Ding, Liangyu Chen | cs.CL | 2022-08-21 |
Beyond Text Generation: Supporting Writers with Continuous Automatic Text Summaries | Hai Dang, Karim Benharrak, Florian Lehmann, Daniel Buschek | cs.HC, cs.CL, H.5.2; I.2.7 | 2022-08-19 |
Coarse-to-Fine: Hierarchical Multi-task Learning for Natural Language Understanding | Zhaoye Fei, Yu Tian, Yongkang Wu, Xinyu Zhang, Yutao Zhu, Zheng Liu, Jiawen Wu, Dejiang Kong, Ruofei Lai, Zhao Cao, Zhicheng Dou, Xipeng Qiu | cs.CL | 2022-08-19 |
Performance Optimization for Semantic Communications: An Attention-based Reinforcement Learning Approach | Yining Wang, Mingzhe Chen, Tao Luo, Walid Saad, Dusit Niyato, H. Vincent Poor, Shuguang Cui | cs.IT, cs.AI, math.IT | 2022-08-17 |
High Recall Data-to-text Generation with Progressive Edit | Choonghan Kim, Gary Geunbae Lee | cs.CL, cs.AI | 2022-08-09 |
Suggestion Lists vs. Continuous Generation: Interaction Design for Writing with Generative Models on Mobile Devices Affect Text Length, Wording and Perceived Authorship | Florian Lehmann, Niklas Markert, Hai Dang, Daniel Buschek | cs.HC, cs.AI | 2022-08-01 |
LaKo: Knowledge-driven Visual Question Answering via Late Knowledge-to-Text Injection | Zhuo Chen, Yufeng Huang, Jiaoyan Chen, Yuxia Geng, Yin Fang, Jeff Pan, Ningyu Zhang, Wen Zhang | cs.CV, cs.AI | 2022-07-26 |
Innovations in Neural Data-to-text Generation | Mandar Sharma, Ajay Gogineni, Naren Ramakrishnan | cs.CL | 2022-07-25 |
Leveraging Natural Supervision for Language Representation Learning and Generation | Mingda Chen | cs.CL | 2022-07-21 |
Neural Data-to-Text Generation Based on Small Datasets: Comparing the Added Value of Two Semi-Supervised Learning Approaches on Top of a Large Language Model | Chris van der Lee, Thiago Castro Ferreira, Chris Emmery, Travis Wiltshire, Emiel Krahmer | cs.CL | 2022-07-14 |
Fuse It More Deeply! A Variational Transformer with Layer-Wise Latent Variable Inference for Text Generation | Jinyi Hu, Xiaoyuan Yi, Wenhao Li, Maosong Sun, Xing Xie | cs.CL | 2022-07-13 |
Towards Multimodal Vision-Language Models Generating Non-Generic Text | Wes Robbins, Zanyar Zohourianshahzadi, Jugal Kalita | cs.CV, cs.AI | 2022-07-09 |
TalkToModel: Understanding Machine Learning Models With Open Ended Dialogues | Dylan Slack, Satyapriya Krishna, Himabindu Lakkaraju, Sameer Singh | cs.LG, cs.AI, cs.CL | 2022-07-08 |
Can Language Models Make Fun? A Case Study in Chinese Comical Crosstalk | Benyou Wang, Xiangbo Wu, Xiaokang Liu, Jianquan Li, Prayag Tiwari, Qianqian Xie | cs.CL | 2022-07-02 |
Syntax Controlled Knowledge Graph-to-Text Generation with Order and Semantic Consistency | Jin Liu, Chongfeng Fan, Fengyu Zhou, Huijuan Xu | cs.AI | 2022-07-02 |
Mapping the Design Space of Human-AI Interaction in Text Summarization | Ruijia Cheng, Alison Smith-Renner, Ke Zhang, Joel R. Tetreault, Alejandro Jaimes | cs.HC | 2022-06-29 |
Joint Generator-Ranker Learning for Natural Language Generation | Weizhou Shen, Yeyun Gong, Yelong Shen, Song Wang, Xiaojun Quan, Nan Duan, Weizhu Chen | cs.CL | 2022-06-28 |
Megapixel Image Generation with Step-Unrolled Denoising Autoencoders | Alex F. McKinney, Chris G. Willcocks | cs.CV, cs.LG | 2022-06-24 |
MVP: Multi-task Supervised Pre-training for Natural Language Generation | Tianyi Tang, Junyi Li, Wayne Xin Zhao, Ji-Rong Wen | cs.CL | 2022-06-24 |
Comparing informativeness of an NLG chatbot vs graphical app in diet-information domain | Simone Balloccu, Ehud Reiter | cs.CL, cs.AI | 2022-06-23 |
GEMv2: Multilingual NLG Benchmarking in a Single Line of Code | Sebastian Gehrmann, Abhik Bhattacharjee, Abinaya Mahendiran, Alex Wang, Alexandros Papangelis, Aman Madaan, Angelina McMillan-Major, Anna Shvets, Ashish Upadhyay, Bingsheng Yao, Bryan Wilie, Chandra Bhagavatula, Chaobin You, Craig Thomson, Cristina Garbacea, Dakuo Wang, Daniel Deutsch, Deyi Xiong, Di Jin, Dimitra Gkatzia, Dragomir Radev, Elizabeth Clark, Esin Durmus, Faisal Ladhak, Filip Ginter, Genta Indra Winata, Hendrik Strobelt, Hiroaki Hayashi, Jekaterina Novikova, Jenna Kanerva, Jenny Chim, Jiawei Zhou, Jordan Clive, Joshua Maynez, João Sedoc, Juraj Juraska, Kaustubh Dhole, Khyathi Raghavi Chandu, Laura Perez-Beltrachini, Leonardo F. R. Ribeiro, Lewis Tunstall, Li Zhang, Mahima Pushkarna, Mathias Creutz, Michael White, Mihir Sanjay Kale, Moussa Kamal Eddine, Nico Daheim, Nishant Subramani, Ondrej Dusek, Paul Pu Liang, Pawan Sasanka Ammanamanchi, Qi Zhu, Ratish Puduppully, Reno Kriz, Rifat Shahriyar, Ronald Cardenas, Saad Mahamood, Salomey Osei, Samuel Cahyawijaya, Sanja Štajner, Sebastien Montella, Shailza, Shailza Jolly, Simon Mille, Tahmid Hasan, Tianhao Shen, Tosin Adewumi, Vikas Raunak, Vipul Raheja, Vitaly Nikolaev, Vivian Tsai, Yacine Jernite, Ying Xu, Yisi Sang, Yixin Liu, Yufang Hou | cs.CL, cs.AI, cs.LG | 2022-06-22 |
BenchCLAMP: A Benchmark for Evaluating Language Models on Semantic Parsing | Subhro Roy, Sam Thomson, Tongfei Chen, Richard Shin, Adam Pauls, Jason Eisner, Benjamin Van Durme | cs.CL | 2022-06-21 |
Prefix Language Models are Unified Modal Learners | Shizhe Diao, Wangchunshu Zhou, Xinsong Zhang, Jiawei Wang | cs.CV, cs.CL, cs.LG | 2022-06-15 |
A Benchmark for Federated Hetero-Task Learning | Liuyi Yao, Dawei Gao, Zhen Wang, Yuexiang Xie, Weirui Kuang, Daoyuan Chen, Haohui Wang, Chenhe Dong, Bolin Ding, Yaliang Li | cs.LG | 2022-06-07 |
DeepCAVE: An Interactive Analysis Tool for Automated Machine Learning | René Sass, Eddie Bergman, André Biedenkapp, Frank Hutter, Marius Lindauer | cs.LG | 2022-06-07 |
Plot Writing From Pre-Trained Language Models | Yiping Jin, Vishakha Kadam, Dittaya Wanvarie | cs.CL | 2022-06-07 |
Curriculum-Based Self-Training Makes Better Few-Shot Learners for Data-to-Text Generation | Pei Ke, Haozhe Ji, Zhenyu Yang, Yi Huang, Junlan Feng, Xiaoyan Zhu, Minlie Huang | cs.CL | 2022-06-06 |
Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation | Jin Xu, Xiaojiang Liu, Jianhao Yan, Deng Cai, Huayang Li, Jian Li | cs.CL | 2022-06-06 |
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech | Ziyue Jiang, Su Zhe, Zhou Zhao, Qian Yang, Yi Ren, Jinglin Liu, Zhenhui Ye | eess.AS, cs.CL, cs.SD | 2022-06-05 |
CoNT: Contrastive Neural Text Generation | Chenxin An, Jiangtao Feng, Kai Lv, Lingpeng Kong, Xipeng Qiu, Xuanjing Huang | cs.CL | 2022-05-29 |
Controllable Text Generation with Neurally-Decomposed Oracle | Tao Meng, Sidi Lu, Nanyun Peng, Kai-Wei Chang | cs.CL | 2022-05-27 |
Diffusion-LM Improves Controllable Text Generation | Xiang Lisa Li, John Thickstun, Ishaan Gulrajani, Percy Liang, Tatsunori B. Hashimoto | cs.CL, cs.AI, cs.LG | 2022-05-27 |
Revisiting Generative Commonsense Reasoning: A Pre-Ordering Approach | Chao Zhao, Faeze Brahman, Tenghao Huang, Snigdha Chaturvedi | cs.CL | 2022-05-26 |
Automatic question generation based on sentence structure analysis using machine learning approach | Miroslav Blšták, Viera Rozinajová | cs.CL, cs.AI | 2022-05-25 |
PLOG: Table-to-Logic Pretraining for Logical Table-to-Text Generation | Ao Liu, Haoyu Dong, Naoaki Okazaki, Shi Han, Dongmei Zhang | cs.CL | 2022-05-25 |