written on 2024-11-28
title | authors | categories | displaydate |
---|---|---|---|
Can LLMs be Good Graph Judger for Knowledge Graph Construction? | Haoyu Huang, Chong Chen, Conghui He, Yang Li, Jiawei Jiang, Wentao Zhang | cs.CL, cs.AI | 2024-11-26 |
Dynamic Self-Distillation via Previous Mini-batches for Fine-tuning Small Language Models | Yao Fu, Yin Yu, Xiaotian Han, Runchao Li, Xianxuan Long, Haotian Yu, Pan Li | cs.CL | 2024-11-25 |
MolMetaLM: a Physicochemical Knowledge-Guided Molecular Meta Language Model | Yifan Wu, Min Zeng, Yang Li, Yang Zhang, Min Li | cs.ET, cs.CL | 2024-11-23 |
Benchmarking Multimodal Models for Ukrainian Language Understanding Across Academic and Cultural Domains | Yurii Paniv, Artur Kiulian, Dmytro Chaplynskyi, Mykola Khandoga, Anton Polishko, Tetiana Bas, Guillermo Gabrielli | cs.CL | 2024-11-22 |
Knowledge Graphs, Large Language Models, and Hallucinations: An NLP Perspective | Ernests Lavrinovics, Russa Biswas, Johannes Bjerva, Katja Hose | cs.CL, cs.AI, 68-02, I.2.7 | 2024-11-21 |
Robust Detection of Watermarks for Large Language Models Under Human Edits | Xiang Li, Feng Ruan, Huiyuan Wang, Qi Long, Weijie J. Su | cs.LG, cs.CL, math.ST, stat.ME, stat.ML, stat.TH | 2024-11-21 |
From Text to Pose to Image: Improving Diffusion Model Control and Quality | Clément Bonnet, Ariel N. Lee, Franck Wertel, Antoine Tamano, Tanguy Cizain, Pablo Ducru | cs.CV, cs.AI, cs.LG | 2024-11-19 |
A Combined Encoder and Transformer Approach for Coherent and High-Quality Text Generation | Jiajing Chen, Shuo Wang, Zhen Qi, Zhenhong Zhang, Chihang Wang, Hongye Zheng | cs.CL | 2024-11-19 |
FedCoLLM: A Parameter-Efficient Federated Co-tuning Framework for Large and Small Language Models | Tao Fan, Yan Kang, Guoqiang Ma, Lixin Fan, Kai Chen, Qiang Yang | cs.CL, cs.AI | 2024-11-18 |
CROW: Eliminating Backdoors from Large Language Models via Internal Consistency Regularization | Nay Myat Min, Long H. Pham, Yige Li, Jun Sun | cs.CL, cs.AI, cs.LG | 2024-11-18 |
SEFD: Semantic-Enhanced Framework for Detecting LLM-Generated Text | Weiqing He, Bojian Hou, Tianqi Shang, Davoud Ataee Tarzanagh, Qi Long, Li Shen | cs.CL, cs.AI, cs.IR | 2024-11-17 |
LLM Hallucination Reasoning with Zero-shot Knowledge Test | Seongmin Lee, Hsiang Hsu, Chun-Fu Chen | cs.AI, cs.CL | 2024-11-14 |
LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models | Zhengyi Wang, Jonathan Lorraine, Yikai Wang, Hang Su, Jun Zhu, Sanja Fidler, Xiaohui Zeng | cs.LG, cs.AI, cs.CL, cs.CV, 68T05, I.3.5; I.2.10; I.2.6 | 2024-11-14 |
Asterisk*: Keep it Simple | Andrew Semenov | cs.CL, cs.AI | 2024-11-08 |
Ev2R: Evaluating Evidence Retrieval in Automated Fact-Checking | Mubashara Akhtar, Michael Schlichtkrull, Andreas Vlachos | cs.CL, cs.AI, cs.IR, cs.LG | 2024-11-08 |
PadChest-GR: A Bilingual Chest X-ray Dataset for Grounded Radiology Report Generation | Daniel C. Castro, Aurelia Bustos, Shruthi Bannur, Stephanie L. Hyland, Kenza Bouzid, Maria Teodora Wetscherek, Maria Dolores Sánchez-Valverde, Lara Jaques-Pérez, Lourdes Pérez-Rodríguez, Kenji Takeda, José María Salinas, Javier Alvarez-Valle, Joaquín Galant Herrero, Antonio Pertusa | cs.AI, cs.CL, cs.CV | 2024-11-07 |
GASE: Generatively Augmented Sentence Encoding | Manuel Frank, Haithem Afli | cs.CL | 2024-11-07 |
VTechAGP: An Academic-to-General-Audience Text Paraphrase Dataset and Benchmark Models | Ming Cheng, Jiaying Gong, Chenhan Yuan, William A. Ingram, Edward Fox, Hoda Eldardiry | cs.CL, cs.DL, cs.LG | 2024-11-07 |
RAGulator: Lightweight Out-of-Context Detectors for Grounded Text Generation | Ian Poey, Jiajun Liu, Qishuai Zhong, Adrien Chenailler | cs.CL | 2024-11-06 |
Advanced RAG Models with Graph Structures: Optimizing Complex Knowledge Reasoning and Text Generation | Yuxin Dong, Shuo Wang, Hongye Zheng, Jiajing Chen, Zhenhong Zhang, Chihang Wang | cs.IR | 2024-11-06 |
Graph-DPEP: Decomposed Plug and Ensemble Play for Few-Shot Document Relation Extraction with Graph-of-Thoughts Reasoning | Tao Zhang, Ning Yan, Masood Mortazavi, Hoang H. Nguyen, Zhongfen Deng, Philip S. Yu | cs.CL, cs.AI, cs.IR | 2024-11-05 |
A Comprehensive Survey of Small Language Models in the Era of Large Language Models: Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness | Fali Wang, Zhiwei Zhang, Xianren Zhang, Zongyu Wu, Tzuhao Mo, Qiuhao Lu, Wanjing Wang, Rui Li, Junjie Xu, Xianfeng Tang, Qi He, Yao Ma, Ming Huang, Suhang Wang | cs.CL, cs.AI, cs.LG, 68T50 (Primary) 68T07 (Secondary), I.2.7 | 2024-11-04 |
E2E-AFG: An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation | Yun Jiang, Zilong Xie, Wei Zhang, Yun Fang, Shuai Pan | cs.CL, cs.AI | 2024-11-01 |
Enhancing Authorship Attribution through Embedding Fusion: A Novel Approach with Masked and Encoder-Decoder Language Models | Arjun Ramesh Kaushik, Sunil Rufus R P, Nalini Ratha | cs.CL, cs.LG | 2024-11-01 |
Language Models can Self-Lengthen to Generate Long Texts | Shanghaoran Quan, Tianyi Tang, Bowen Yu, An Yang, Dayiheng Liu, Bofei Gao, Jianhong Tu, Yichang Zhang, Jingren Zhou, Junyang Lin | cs.CL | 2024-10-31 |
Mind the Gap: A Generalized Approach for Cross-Modal Embedding Alignment | Arihan Yadav, Alan McMillan | cs.LG, cs.CL, cs.IR, H.3.3; I.2.7; I.2.6 | 2024-10-30 |
Parameter-Efficient Fine-Tuning in Large Models: A Survey of Methodologies | Luping Wang, Sheng Chen, Linnan Jiang, Shu Pan, Runze Cai, Sen Yang, Fei Yang | cs.CL, cs.AI, cs.LG | 2024-10-24 |
Dynamic and Textual Graph Generation Via Large-Scale LLM-based Agent Simulation | Jiarui Ji, Runlin Lei, Jialing Bi, Zhewei Wei, Yankai Lin, Xuchen Pan, Yaliang Li, Bolin Ding | cs.CL | 2024-10-13 |
Enhancing elusive clues in knowledge learning by contrasting attention of language models | Jian Gao, Xiao Zhang, Ji Wu, Miao Li | cs.AI | 2024-09-26 |
Zero- and Few-shot Named Entity Recognition and Text Expansion in Medication Prescriptions using ChatGPT | Natthanaphop Isaradech, Andrea Riedel, Wachiranun Sirikul, Markus Kreuzthaler, Stefan Schulz | cs.CL, cs.AI | 2024-09-26 |
Zero-Shot Detection of LLM-Generated Text using Token Cohesiveness | Shixuan Ma, Quan Wang | cs.CL | 2024-09-25 |
Probing Omissions and Distortions in Transformer-based RDF-to-Text Models | Juliette Faille, Albert Gatt, Claire Gardent | cs.CL | 2024-09-25 |
ESPERANTO: Evaluating Synthesized Phrases to Enhance Robustness in AI Detection for Text Origination | Navid Ayoobi, Lily Knab, Wen Cheng, David Pantoja, Hamidreza Alikhani, Sylvain Flamant, Jin Kim, Arjun Mukherjee | cs.CL, cs.AI | 2024-09-22 |
Beyond Accuracy Optimization: Computer Vision Losses for Large Language Model Fine-Tuning | Daniele Rege Cambrin, Giuseppe Gallipoli, Irene Benedetto, Luca Cagliero, Paolo Garza | cs.CL, cs.CV | 2024-09-20 |
Leveraging Knowledge Graphs and LLMs to Support and Monitor Legislative Systems | Andrea Colombo | cs.DB, cs.AI | 2024-09-20 |
A Multiple-Fill-in-the-Blank Exam Approach for Enhancing Zero-Resource Hallucination Detection in Large Language Models | Satoshi Munakata, Taku Fukui, Takao Mohri | cs.CL, cs.AI, 68T50, F.2.2; I.2.7 | 2024-09-20 |
THaMES: An End-to-End Tool for Hallucination Mitigation and Evaluation in Large Language Models | Mengfei Liang, Archish Arun, Zekun Wu, Cristian Munoz, Jonathan Lutch, Emre Kazim, Adriano Koshiyama, Philip Treleaven | cs.CL | 2024-09-17 |
Zero-resource Hallucination Detection for Text Generation via Graph-based Contextual Knowledge Triples Modeling | Xinyue Fang, Zhen Huang, Zhiliang Tian, Minghui Fang, Ziyi Pan, Quntian Fang, Zhihua Wen, Hengyue Pan, Dongsheng Li | cs.CL, cs.AI | 2024-09-17 |
MGSA: Multi-Granularity Graph Structure Attention for Knowledge Graph-to-Text Generation | Shanshan Wang, Chun Zhang, Ning Zhang | cs.CL, cs.AI | 2024-09-16 |
Spatio-Temporal Context Prompting for Zero-Shot Action Detection | Wei-Jhe Huang, Min-Hung Chen, Shang-Hong Lai | cs.CV, cs.AI | 2024-08-28 |
DIAGen: Diverse Image Augmentation with Generative Models | Tobias Lingenberg, Markus Reuter, Gopika Sudhakaran, Dominik Gojny, Stefan Roth, Simone Schaub-Meyer | cs.CV, cs.AI | 2024-08-26 |
Biomedical Large Languages Models Seem not to be Superior to Generalist Models on Unseen Medical Data | Felix J. Dorfner, Amin Dada, Felix Busch, Marcus R. Makowski, Tianyu Han, Daniel Truhn, Jens Kleesiek, Madhumita Sushil, Jacqueline Lammert, Lisa C. Adams, Keno K. Bressem | cs.CL | 2024-08-25 |
MDD-5k: A New Diagnostic Conversation Dataset for Mental Disorders Synthesized via Neuro-Symbolic LLM Agents | Congchi Yin, Feng Li, Shu Zhang, Zike Wang, Jun Shao, Piji Li, Jianhua Chen, Xun Jiang | cs.CL, cs.AI | 2024-08-22 |
Analysis of Plan-based Retrieval for Grounded Text Generation | Ameya Godbole, Nicholas Monath, Seungyeon Kim, Ankit Singh Rawat, Andrew McCallum, Manzil Zaheer | cs.CL, cs.IR | 2024-08-20 |
DELIA: Diversity-Enhanced Learning for Instruction Adaptation in Large Language Models | Yuanhao Zeng, Fei Ren, Xinpeng Zhou, Yihang Wang, Yingxia Shao | cs.AI, cs.CL | 2024-08-19 |
SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models | Anke Tang, Li Shen, Yong Luo, Shuai Xie, Han Hu, Lefei Zhang, Bo Du, Dacheng Tao | cs.LG, cs.AI | 2024-08-19 |
Rhyme-aware Chinese lyric generator based on GPT | Yixiao Yuan, Yangchen Huang, Yu Ma, Xinjin Li, Zhenglin Li, Yiming Shi, Huapeng Zhou | cs.CL, cs.AI | 2024-08-19 |
LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description | Yizhang Jin, Jian Li, Jiangning Zhang, Jianlong Hu, Zhenye Gan, Xin Tan, Yong Liu, Yabiao Wang, Chengjie Wang, Lizhuang Ma | cs.CV, cs.AI | 2024-08-09 |
Cool-Fusion: Fuse Large Language Models without Training | Cong Liu, Xiaojun Quan, Yan Pan, Liang Lin, Weigang Wu, Xu Chen | cs.CL | 2024-07-29 |
Positive Text Reframing under Multi-strategy Optimization | Shutong Jia, Biwei Cao, Qingqing Gao, Jiuxin Cao, Bo Liu | cs.CL, cs.AI | 2024-07-25 |
The Power of Combining Data and Knowledge: GPT-4o is an Effective Interpreter of Machine Learning Models in Predicting Lymph Node Metastasis of Lung Cancer | Danqing Hu, Bing Liu, Xiaofeng Zhu, Nan Wu | cs.CL, cs.LG | 2024-07-25 |
Are Large Language Models Possible to Conduct Cognitive Behavioral Therapy? | Hao Shen, Zihan Li, Minqiang Yang, Minghui Ni, Yongfeng Tao, Zhengyang Yu, Weihao Zheng, Chen Xu, Bin Hu | cs.CL | 2024-07-25 |
Generative artificial intelligence in dentistry: Current approaches and future challenges | Fabián Villena, Claudia Véliz, Rosario García-Huidobro, Sebastián Aguayo | cs.CL | 2024-07-24 |
Robust Privacy Amidst Innovation with Large Language Models Through a Critical Assessment of the Risks | Yao-Shun Chuang, Atiquer Rahman Sarkar, Noman Mohammed, Xiaoqian Jiang | cs.CL | 2024-07-23 |
Finetuning Generative Large Language Models with Discrimination Instructions for Knowledge Graph Completion | Yang Liu, Xiaobin Tian, Zequn Sun, Wei Hu | cs.CL, cs.AI | 2024-07-23 |
Intelligent Artistic Typography: A Comprehensive Review of Artistic Text Design and Generation | Yuhang Bai, Zichuan Huang, Wenshuo Gao, Shuai Yang, Jiaying Liu | cs.CV, cs.AI, cs.GR | 2024-07-20 |
Check-Eval: A Checklist-based Approach for Evaluating Text Quality | Jayr Pereira, Roberto Lotufo | cs.CL, cs.AI | 2024-07-19 |
From Instruction to Insight: Exploring the Functional and Semantic Roles of Text in Interactive Dashboards | Nicole Sultanum, Vidya Setlur | cs.HC | 2024-07-19 |
KoMA: Knowledge-driven Multi-agent Framework for Autonomous Driving with Large Language Models | Kemou Jiang, Xuan Cai, Zhiyong Cui, Aoyong Li, Yilong Ren, Haiyang Yu, Hao Yang, Daocheng Fu, Licheng Wen, Pinlong Cai | cs.AI | 2024-07-19 |
Clinical Reading Comprehension with Encoder-Decoder Models Enhanced by Direct Preference Optimization | Md Sultan Al Nahian, Ramakanth Kavuluru | cs.IR, cs.CL, cs.LG | 2024-07-19 |
BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs | Zhiting Fan, Ruizhe Chen, Ruiling Xu, Zuozhu Liu | cs.CL | 2024-07-14 |
Is Contrasting All You Need? Contrastive Learning for the Detection and Attribution of AI-generated Text | Lucio La Cava, Davide Costa, Andrea Tagarelli | cs.CL, cs.AI, cs.CY, cs.HC, physics.soc-ph | 2024-07-12 |
Who Writes the Review, Human or AI? | Panagiotis C. Theocharopoulos, Spiros V. Georgakopoulos, Sotiris K. Tasoulis, Vassilis P. Plagianakos | cs.CL | 2024-05-30 |
Kernel Language Entropy: Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities | Alexander Nikitin, Jannik Kossen, Yarin Gal, Pekka Marttinen | cs.LG, cs.AI, cs.CL | 2024-05-30 |
Large Language Model Watermark Stealing With Mixed Integer Programming | Zhaoxi Zhang, Xiaomei Zhang, Yanjun Zhang, Leo Yu Zhang, Chao Chen, Shengshan Hu, Asif Gill, Shirui Pan | cs.CR, cs.AI | 2024-05-30 |
WRDScore: New Metric for Evaluation of Natural Language Generation Models | Ravil Mussabayev | cs.CL, cs.AI | 2024-05-29 |
MetaToken: Detecting Hallucination in Image Descriptions by Meta Classification | Laura Fieback, Jakob Spiegelberg, Hanno Gottschalk | cs.CV, cs.CL, cs.LG, I.4 | 2024-05-29 |
Alt4Blind: A User Interface to Simplify Charts Alt-Text Creation | Omar Moured, Shahid Ali Farooqui, Karin Muller, Sharifeh Fadaeijouybari, Thorsten Schwarz, Mohammed Javed, Rainer Stiefelhagen | cs.CV, cs.HC | 2024-05-29 |
LMO-DP: Optimizing the Randomization Mechanism for Differentially Private Fine-Tuning (Large) Language Models | Qin Yang, Meisam Mohammad, Han Wang, Ali Payani, Ashish Kundu, Kai Shu, Yan Yan, Yuan Hong | cs.CR, cs.CL, cs.LG | 2024-05-29 |
Augmenting Textual Generation via Topology Aware Retrieval | Yu Wang, Nedim Lipka, Ruiyi Zhang, Alexa Siu, Yuying Zhao, Bo Ni, Xin Wang, Ryan Rossi, Tyler Derr | cs.IR | 2024-05-27 |
A Library for Automatic Natural Language Generation of Spanish Texts | Silvia García-Méndez, Milagros Fernández-Gavilanes, Enrique Costa-Montenegro, Jonathan Juncal-Martínez, F. Javier González-Castaño | cs.CL | 2024-05-27 |
On the Noise Robustness of In-Context Learning for Text Generation | Hongfu Gao, Feipeng Zhang, Wenyu Jiang, Jun Shu, Feng Zheng, Hongxin Wei | cs.CL, cs.LG | 2024-05-27 |
ECG Semantic Integrator (ESI): A Foundation ECG Model Pretrained with LLM-Enhanced Cardiological Text | Han Yu, Peikun Guo, Akane Sano | eess.SP, cs.AI | 2024-05-26 |
Certifiably Robust RAG against Retrieval Corruption | Chong Xiang, Tong Wu, Zexuan Zhong, David Wagner, Danqi Chen, Prateek Mittal | cs.LG, cs.CL, cs.CR | 2024-05-24 |
Linearly Controlled Language Generation with Performative Guarantees | Emily Cheng, Marco Baroni, Carmen Amo Alonso | cs.CL, cs.SY, eess.SY | 2024-05-24 |
Evaluating Consistency and Reasoning Capabilities of Large Language Models | Yash Saxena, Sarthak Chopra, Arunendra Mani Tripathi | cs.CL, cs.AI | 2024-04-25 |
Semantic Routing for Enhanced Performance of LLM-Assisted Intent-Based 5G Core Network Management and Orchestration | Dimitrios Michael Manias, Ali Chouman, Abdallah Shami | cs.NI, cs.AI | 2024-04-24 |
Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval | Young Kyun Jang, Donghyun Kim, Zihang Meng, Dat Huynh, Ser-Nam Lim | cs.CV, cs.AI | 2024-04-23 |
LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots | Dongge Han, Trevor McInroe, Adam Jelley, Stefano V. Albrecht, Peter Bell, Amos Storkey | cs.RO, cs.AI | 2024-04-22 |
Context-Enhanced Language Models for Generating Multi-Paper Citations | Avinash Anand, Kritarth Prasad, Ujjwal Goel, Mohit Gupta, Naman Lal, Astha Verma, Rajiv Ratn Shah | cs.CL | 2024-04-22 |
Filtered Direct Preference Optimization | Tetsuro Morimura, Mitsuki Sakamoto, Yuu Jinnai, Kenshi Abe, Kaito Ariu | cs.LG, cs.AI, cs.CL | 2024-04-22 |
Parameter Efficient Diverse Paraphrase Generation Using Sequence-Level Knowledge Distillation | Lasal Jayawardena, Prasan Yapa | cs.CL, cs.AI, cs.LG | 2024-04-19 |
Incubating Text Classifiers Following User Instruction with Nothing but LLM | Letian Peng, Jingbo Shang | cs.CL | 2024-04-16 |
LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation? | Yuchi Wang, Shuhuai Ren, Rundong Gao, Linli Yao, Qingyan Guo, Kaikai An, Jianhong Bai, Xu Sun | cs.AI, cs.CL, cs.CV | 2024-04-16 |
Generative Text Steganography with Large Language Model | Jiaxuan Wu, Zhengxian Wu, Yiming Xue, Juan Wen, Wanli Peng | cs.CL | 2024-04-16 |
Gaining More Insight into Neural Semantic Parsing with Challenging Benchmarks | Xiao Zhang, Chunliu Wang, Rik van Noord, Johan Bos | cs.CL | 2024-04-12 |
Adapting LLMs for Efficient Context Processing through Soft Prompt Compression | Cangqing Wang, Yutian Yang, Ruisi Li, Dan Sun, Ruicong Cai, Yuzhu Zhang, Chengqian Fu, Lillian Floyd | cs.LG, cs.AI, cs.CL | 2024-04-07 |
ToXCL: A Unified Framework for Toxic Speech Detection and Explanation | Nhat M. Hoang, Xuan Long Do, Duc Anh Do, Duc Anh Vu, Luu Anh Tuan | cs.CL, cs.CY | 2024-03-25 |
Evidence-Driven Retrieval Augmented Response Generation for Online Misinformation | Zhenrui Yue, Huimin Zeng, Yimeng Lu, Lanyu Shang, Yang Zhang, Dong Wang | cs.CL, cs.AI | 2024-03-22 |
A Closer Look at Claim Decomposition | Miriam Wanner, Seth Ebner, Zhengping Jiang, Mark Dredze, Benjamin Van Durme | cs.CL | 2024-03-18 |
GPT-4 as Evaluator: Evaluating Large Language Models on Pest Management in Agriculture | Shanglong Yang, Zhipeng Yuan, Shunbao Li, Ruoling Peng, Kang Liu, Po Yang | cs.CL | 2024-03-18 |
Embedded Named Entity Recognition using Probing Classifiers | Nicholas Popovič, Michael Färber | cs.CL | 2024-03-18 |
Reinforcement Learning with Token-level Feedback for Controllable Text Generation | Wendi Li, Wei Wei, Kaihe Xu, Wenfeng Xie, Dangyang Chen, Yu Cheng | cs.CL, cs.AI | 2024-03-18 |
ProSwitch: Knowledge-Guided Language Model Fine-Tuning to Generate Professional and Non-Professional Styled Text | Chang Zong, Yuyan Chen, Weiming Lu, Jian Shao, Yueting Zhuang | cs.CL, cs.AI, 68T50, I.2.7 | 2024-03-14 |
Truth-Aware Context Selection: Mitigating the Hallucinations of Large Language Models Being Misled by Untruthful Contexts | Tian Yu, Shaolei Zhang, Yang Feng | cs.CL | 2024-03-12 |
Complex Reasoning over Logical Queries on Commonsense Knowledge Graphs | Tianqing Fang, Zeming Chen, Yangqiu Song, Antoine Bosselut | cs.CL, cs.AI | 2024-03-12 |
Premonition: Using Generative Models to Preempt Future Data Changes in Continual Learning | Mark D. McDonnell, Dong Gong, Ehsan Abbasnejad, Anton van den Hengel | cs.CV, cs.LG | 2024-03-12 |
Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews | Weixin Liang, Zachary Izzo, Yaohui Zhang, Haley Lepp, Hancheng Cao, Xuandong Zhao, Lingjiao Chen, Haotian Ye, Sheng Liu, Zhi Huang, Daniel A. McFarland, James Y. Zou | cs.CL, cs.AI, cs.LG, cs.SI, I.2.7 | 2024-03-11 |
One Category One Prompt: Dataset Distillation using Diffusion Models | Ali Abbasi, Ashkan Shahbazi, Hamed Pirsiavash, Soheil Kolouri | cs.CV, cs.CL, cs.LG | 2024-03-11 |
Narrating Causal Graphs with Large Language Models | Atharva Phatak, Vijay K. Mago, Ameeta Agrawal, Aravind Inbasekaran, Philippe J. Giabbanelli | cs.CL | 2024-03-11 |
Evolving Knowledge Distillation with Large Language Models and Active Learning | Chengyuan Liu, Yangyang Kang, Fubang Zhao, Kun Kuang, Zhuoren Jiang, Changlong Sun, Fei Wu | cs.CL | 2024-03-11 |
Know Your Audience: The benefits and pitfalls of generating plain language summaries beyond the “general” audience | Tal August, Kyle Lo, Noah A. Smith, Katharina Reinecke | cs.HC | 2024-03-08 |
KnowledgeVIS: Interpreting Language Models by Comparing Fill-in-the-Blank Prompts | Adam Coscia, Alex Endert | cs.HC, cs.AI, cs.CY, cs.LG | 2024-03-07 |
Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification | Ekaterina Fadeeva, Aleksandr Rubashevskii, Artem Shelmanov, Sergey Petrakov, Haonan Li, Hamdy Mubarak, Evgenii Tsymbalov, Gleb Kuzmin, Alexander Panchenko, Timothy Baldwin, Preslav Nakov, Maxim Panov | cs.CL, cs.AI, cs.LG | 2024-03-07 |
Enhancing Court View Generation with Knowledge Injection and Guidance | Ang Li, Yiquan Wu, Yifei Liu, Fei Wu, Ming Cai, Kun Kuang | cs.AI | 2024-03-07 |
Advancing Biomedical Text Mining with Community Challenges | Hui Zong, Rongrong Wu, Jiaxue Cha, Erman Wu, Jiakun Li, Liang Tao, Zuofeng Li, Buzhou Tang, Bairong Shen | cs.AI, cs.CL, cs.LG | 2024-03-07 |
Quantifying Contamination in Evaluating Code Generation Capabilities of Language Models | Martin Riddell, Ansong Ni, Arman Cohan | cs.SE, cs.CL, cs.LG | 2024-03-06 |
DECIDER: A Rule-Controllable Decoding Strategy for Language Generation by Imitating Dual-System Cognitive Theory | Chen Xu, Tian Lan, Changlong Yu, Wei Wang, Jun Gao, Yu Ji, Qunxi Dong, Kun Qian, Piji Li, Wei Bi, Bin Hu | cs.CL, cs.AI, cs.LO | 2024-03-04 |
Direct Alignment of Draft Model for Speculative Decoding with Chat-Fine-Tuned LLMs | Raghavv Goel, Mukul Gagrani, Wonseok Jeon, Junyoung Park, Mingu Lee, Christopher Lott | cs.LG, cs.AI, cs.CL | 2024-02-29 |
Token-Specific Watermarking with Enhanced Detectability and Semantic Coherence for Large Language Models | Mingjia Huo, Sai Ashish Somayajula, Youwei Liang, Ruisi Zhang, Farinaz Koushanfar, Pengtao Xie | cs.LG, cs.CL, cs.CR | 2024-02-28 |
CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples | Jianrui Zhang, Mu Cai, Tengyang Xie, Yong Jae Lee | cs.CV, cs.AI, cs.CL, cs.LG | 2024-02-20 |
Adaptive Text Watermark for Large Language Models | Yepeng Liu, Yuheng Bu | cs.CL | 2024-01-25 |
Consistency Guided Knowledge Retrieval and Denoising in LLMs for Zero-shot Document-level Relation Triplet Extraction | Qi Sun, Kun Huang, Xiaocui Yang, Rong Tong, Kun Zhang, Soujanya Poria | cs.CL | 2024-01-24 |
Fine-grained Contract NER using instruction based model | Hiranmai Sri Adibhatla, Pavan Baswani, Manish Shrivastava | cs.IR | 2024-01-24 |
MF-AED-AEC: Speech Emotion Recognition by Leveraging Multimodal Fusion, ASR Error Detection, and ASR Error Correction | Jiajun He, Xiaohan Shi, Xingfeng Li, Tomoki Toda | cs.CL, cs.MM, cs.SD, eess.AS | 2024-01-24 |
IndiText Boost: Text Augmentation for Low Resource India Languages | Onkar Litake, Niraj Yagnik, Shreyas Labhsetwar | cs.CL, cs.AI, cs.LG | 2024-01-23 |
Unsupervised Learning of Graph from Recipes | Aissatou Diallo, Antonis Bikakis, Luke Dickens, Anthony Hunter, Rob Miller | cs.CL | 2024-01-22 |
With Greater Text Comes Greater Necessity: Inference-Time Training Helps Long Text Generation | Y. Wang, D. Ma, D. Cai | cs.CL, cs.AI | 2024-01-21 |
Large Language Models for Scientific Information Extraction: An Empirical Study for Virology | Mahsa Shamsabadi, Jennifer D’Souza, Sören Auer | cs.CL, cs.AI, cs.DL, cs.IT, math.IT | 2024-01-18 |
Evolutionary Computation in the Era of Large Language Model: Survey and Roadmap | Xingyu Wu, Sheng-hao Wu, Jibin Wu, Liang Feng, Kay Chen Tan | cs.NE, cs.AI, cs.CL | 2024-01-18 |
Contrastive Perplexity for Controlled Generation: An Application in Detoxifying Large Language Models | Tassilo Klein, Moin Nabi | cs.CL, cs.LG | 2024-01-16 |
LLMs for Test Input Generation for Semantic Caches | Zafaryab Rasool, Scott Barnett, David Willie, Stefanus Kurniawan, Sherwin Balugo, Srikanth Thudumu, Mohamed Abdelrazek | cs.SE, cs.AI | 2024-01-16 |
Leveraging External Knowledge Resources to Enable Domain-Specific Comprehension | Saptarshi Sengupta, Connor Heaton, Prasenjit Mitra, Soumalya Sarkar | cs.CL | 2024-01-15 |
Automating Knowledge Acquisition for Content-Centric Cognitive Agents Using LLMs | Sanjay Oruganti, Sergei Nirenburg, Jesse English, Marjorie McShane | cs.CL, cs.AI | 2023-12-27 |
Zur Darstellung eines mehrstufigen Prototypbegriffs in der multilingualen automatischen Sprachgenerierung: vom Korpus über word embeddings bis hin zum automatischen Wörterbuch | María José Domínguez Vázquez | cs.CL | 2023-12-26 |
Making Large Language Models A Better Foundation For Dense Retrieval | Chaofan Li, Zheng Liu, Shitao Xiao, Yingxia Shao | cs.CL | 2023-12-24 |
Exploiting Novel GPT-4 APIs | Kellin Pelrine, Mohammad Taufeeque, Michał Zając, Euan McLean, Adam Gleave | cs.CR, cs.AI, cs.CL, cs.LG, I.2.7 | 2023-12-21 |
LlaMaVAE: Guiding Large Language Model Generation via Continuous Latent Sentence Spaces | Yingji Zhang, Danilo S. Carvalho, Ian Pratt-Hartmann, André Freitas | cs.CL | 2023-12-20 |
Can ChatGPT be Your Personal Medical Assistant? | Md. Rafiul Biswas, Ashhadul Islam, Zubair Shah, Wajdi Zaghouani, Samir Brahim Belhaouari | cs.CL, cs.SI | 2023-12-19 |
External Knowledge Augmented Polyphone Disambiguation Using Large Language Model | Chen Li | cs.CL | 2023-12-19 |
Deep dive into language traits of AI-generated Abstracts | Vikas Kumar, Amisha Bharti, Devanshu Verma, Vasudha Bhatnagar | cs.CL, cs.LG | 2023-12-17 |
A Soft Contrastive Learning-based Prompt Model for Few-shot Sentiment Analysis | Jingyi Zhou, Jie Zhou, Jiabao Zhao, Siyin Wang, Haijun Shan, Gui Tao, Qi Zhang, Xuanjing Huang | cs.CL | 2023-12-16 |
CoAScore: Chain-of-Aspects Prompting for NLG Evaluation | Peiyuan Gong, Jiaxin Mao | cs.CL | 2023-12-16 |
GSQA: An End-to-End Model for Generative Spoken Question Answering | Min-Han Shih, Ho-Lam Chung, Yu-Chi Pai, Ming-Hao Hsu, Guan-Ting Lin, Shang-Wen Li, Hung-yi Lee | cs.CL, cs.AI | 2023-12-15 |
Extending Context Window of Large Language Models via Semantic Compression | Weizhi Fei, Xueyan Niu, Pingyi Zhou, Lu Hou, Bo Bai, Lei Deng, Wei Han | cs.CL, cs.IT, math.IT | 2023-12-15 |
Towards Verifiable Text Generation with Evolving Memory and Self-Reflection | Hao Sun, Hengyi Cai, Bo Wang, Yingyan Hou, Xiaochi Wei, Shuaiqiang Wang, Yan Zhang, Dawei Yin | cs.CL | 2023-12-14 |
Towards Optimal Statistical Watermarking | Baihe Huang, Banghua Zhu, Hanlin Zhu, Jason D. Lee, Jiantao Jiao, Michael I. Jordan | cs.LG, cs.CL, cs.CR, cs.IT, math.IT, stat.ML | 2023-12-13 |
Leveraging Generative Language Models for Weakly Supervised Sentence Component Analysis in Video-Language Joint Learning | Zaber Ibn Abdul Hakim, Najibul Haque Sarker, Rahul Pratap Singh, Bishmoy Paul, Ali Dabouei, Min Xu | cs.CV, cs.LG | 2023-12-10 |
An Attention-Based Denoising Framework for Personality Detection in Social Media Texts | Qirui Tang, Wenkang Jiang, Yihua Du, Lei Lin | cs.CY, cs.CL | 2023-11-16 |
X-Mark: Towards Lossless Watermarking Through Lexical Redundancy | Liang Chen, Yatao Bian, Yang Deng, Shuaiyi Li, Bingzhe Wu, Peilin Zhao, Kam-fai Wong | cs.CL | 2023-11-16 |
The Curious Decline of Linguistic Diversity: Training Language Models on Synthetic Text | Yanzhu Guo, Guokan Shang, Michalis Vazirgiannis, Chloé Clavel | cs.CL | 2023-11-16 |
Improving the Generation Quality of Watermarked Large Language Models via Word Importance Scoring | Yuhang Li, Yihan Wang, Zhouxing Shi, Cho-Jui Hsieh | cs.CL, cs.CR, cs.LG | 2023-11-16 |
Think While You Write: Hypothesis Verification Promotes Faithful Knowledge-to-Text Generation | Yifu Qiu, Varun Embar, Shay B. Cohen, Benjamin Han | cs.CL, cs.AI | 2023-11-16 |
GRIM: GRaph-based Interactive narrative visualization for gaMes | Jorge Leandro, Sudha Rao, Michael Xu, Weijia Xu, Nebosja Jojic, Chris Brockett, Bill Dolan | cs.CL | 2023-11-15 |
REST: Retrieval-Based Speculative Decoding | Zhenyu He, Zexuan Zhong, Tianle Cai, Jason D Lee, Di He | cs.CL, cs.AI, cs.IR, cs.LG | 2023-11-14 |
RECALL: A Benchmark for LLMs Robustness against External Counterfactual Knowledge | Yi Liu, Lianzhe Huang, Shicheng Li, Sishuo Chen, Hao Zhou, Fandong Meng, Jie Zhou, Xu Sun | cs.CL, cs.AI | 2023-11-14 |
Insights into Classifying and Mitigating LLMs’ Hallucinations | Alessandro Bruno, Pier Luigi Mazzeo, Aladine Chetouani, Marouane Tliba, Mohamed Amine Kerkouri | cs.CL | 2023-11-14 |
LLatrieval: LLM-Verified Retrieval for Verifiable Generation | Xiaonan Li, Changtai Zhu, Linyang Li, Zhangyue Yin, Tianxiang Sun, Xipeng Qiu | cs.CL, cs.AI, cs.IR | 2023-11-14 |
AuthentiGPT: Detecting Machine-Generated Text via Black-Box Language Models Denoising | Zhen Guo, Shangdi Yu | cs.CL, cs.AI, cs.LG | 2023-11-13 |
Controlled Text Generation for Black-box Language Models via Score-based Progressive Editor | Sangwon Yu, Changmin Lee, Hojin Lee, Sungroh Yoon | cs.CL | 2023-11-13 |
TencentLLMEval: A Hierarchical Evaluation of Real-World Capabilities for Human-Aligned LLMs | Shuyi Xie, Wenlin Yao, Yong Dai, Shaobo Wang, Donlin Zhou, Lifeng Jin, Xinhua Feng, Pengzhi Wei, Yujie Lin, Zhichao Hu, Dong Yu, Zhengyou Zhang, Jing Nie, Yuhong Liu | cs.CL, cs.AI | 2023-11-09 |
Aspects of human memory and Large Language Models | Romuald A. Janik | cs.CL, cs.AI, cs.LG, q-bio.NC | 2023-11-07 |
Automatic Logical Forms improve fidelity in Table-to-Text generation | Iñigo Alonso, Eneko Agirre | cs.CL | 2023-10-26 |
Knowledge Editing for Large Language Models: A Survey | Song Wang, Yaochen Zhu, Haochen Liu, Zaiyi Zheng, Chen Chen, Jundong Li | cs.CL, cs.AI | 2023-10-24 |
Woodpecker: Hallucination Correction for Multimodal Large Language Models | Shukang Yin, Chaoyou Fu, Sirui Zhao, Tong Xu, Hao Wang, Dianbo Sui, Yunhang Shen, Ke Li, Xing Sun, Enhong Chen | cs.CV, cs.AI, cs.CL, cs.LG | 2023-10-24 |
Enhancing Biomedical Lay Summarisation with External Knowledge Graphs | Tomas Goldsack, Zhihao Zhang, Chen Tang, Carolina Scarton, Chenghua Lin | cs.CL | 2023-10-24 |
Let the Pretrained Language Models “Imagine” for Short Texts Topic Modeling | Pritom Saha Akash, Jie Huang, Kevin Chen-Chuan Chang | cs.CL | 2023-10-24 |
Towards Possibilities & Impossibilities of AI-generated Text Detection: A Survey | Soumya Suvra Ghosal, Souradip Chakraborty, Jonas Geiping, Furong Huang, Dinesh Manocha, Amrit Singh Bedi | cs.CL, cs.AI | 2023-10-23 |
Fidelity-Enriched Contrastive Search: Reconciling the Faithfulness-Diversity Trade-Off in Text Generation | Wei-Lin Chen, Cheng-Kuang Wu, Hsin-Hsi Chen, Chung-Chi Chen | cs.CL | 2023-10-23 |
HateRephrase: Zero- and Few-Shot Reduction of Hate Intensity in Online Posts using Large Language Models | Vibhor Agarwal, Yu Chen, Nishanth Sastry | cs.CL | 2023-10-21 |
Automatic Unit Test Data Generation and Actor-Critic Reinforcement Learning for Code Synthesis | Philip John Gorinski, Matthieu Zimmer, Gerasimos Lampouras, Derrick Goh Xin Deik, Ignacio Iacobacci | cs.LG, cs.AI, cs.CL, cs.PL | 2023-10-20 |
Knowledge-Augmented Language Model Verification | Jinheon Baek, Soyeong Jeong, Minki Kang, Jong C. Park, Sung Ju Hwang | cs.CL, cs.LG | 2023-10-19 |
MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter | Zhiyuan Liu, Sihang Li, Yanchen Luo, Hao Fei, Yixin Cao, Kenji Kawaguchi, Xiang Wang, Tat-Seng Chua | cs.CL, cs.MM | 2023-10-19 |
GraphextQA: A Benchmark for Evaluating Graph-Enhanced Large Language Models | Yuanchun Shen, Ruotong Liao, Zhen Han, Yunpu Ma, Volker Tresp | cs.CL | 2023-10-12 |
DistillSpec: Improving Speculative Decoding via Knowledge Distillation | Yongchao Zhou, Kaifeng Lyu, Ankit Singh Rawat, Aditya Krishna Menon, Afshin Rostamizadeh, Sanjiv Kumar, Jean-François Kagy, Rishabh Agarwal | cs.CL, cs.AI, cs.LG | 2023-10-12 |
CP-KGC: Constrained-Prompt Knowledge Graph Completion with Large Language Models | Rui Yang, Li Fang, Yi Zhou | cs.CL, cs.AI | 2023-10-12 |
Ziya-VL: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning | Junyu Lu, Dixiang Zhang, Xiaojun Wu, Xinyu Gao, Ruyi Gan, Jiaxing Zhang, Yan Song, Pingjian Zhang | cs.CL | 2023-10-12 |
Multimodal Graph Learning for Generative Tasks | Minji Yoon, Jing Yu Koh, Bryan Hooi, Ruslan Salakhutdinov | cs.AI | 2023-10-11 |
MatChat: A Large Language Model and Application Service Platform for Materials Science | Ziyi Chen, Fankai Xie, Meng Wan, Yang Yuan, Miao Liu, Zongguo Wang, Sheng Meng, Yangang Wang | cond-mat.mtrl-sci, cs.AI | 2023-10-11 |
The Temporal Structure of Language Processing in the Human Brain Corresponds to The Layered Hierarchy of Deep Language Models | Ariel Goldstein, Eric Ham, Mariano Schain, Samuel Nastase, Zaid Zada, Avigail Dabush, Bobbi Aubrey, Harshvardhan Gazula, Amir Feder, Werner K Doyle, Sasha Devore, Patricia Dugan, Daniel Friedman, Roi Reichart, Michael Brenner, Avinatan Hassidim, Orrin Devinsky, Adeen Flinker, Omer Levy, Uri Hasson | cs.CL, cs.AI, cs.LG, q-bio.NC | 2023-10-11 |
A Semantic Invariant Robust Watermark for Large Language Models | Aiwei Liu, Leyi Pan, Xuming Hu, Shiao Meng, Lijie Wen | cs.CR, cs.CL, 68T50, I.2.7 | 2023-10-10 |
CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain Performance and Calibration | Rachneet Sachdeva, Martin Tutek, Iryna Gurevych | cs.CL | 2023-09-14 |
Semantic reconstruction of continuous language from MEG signals | Bo Wang, Xiran Xu, Longxiang Zhang, Boda Xiao, Xihong Wu, Jing Chen | cs.HC, eess.SP, q-bio.NC | 2023-09-14 |
Text Encoders Lack Knowledge: Leveraging Generative LLMs for Domain-Specific Semantic Textual Similarity | Joseph Gatto, Omar Sharif, Parker Seegmiller, Philip Bohlman, Sarah Masud Preum | cs.CL | 2023-09-12 |
Unveiling the potential of large language models in generating semantic and cross-language clones | Palash R. Roy, Ajmain I. Alam, Farouq Al-omari, Banani Roy, Chanchal K. Roy, Kevin A. Schneider | cs.SE, cs.AI, cs.LG | 2023-09-12 |
BHASA: A Holistic Southeast Asian Linguistic and Cultural Evaluation Suite for Large Language Models | Wei Qi Leong, Jian Gang Ngui, Yosephine Susanto, Hamsawardhini Rengarajan, Kengatharaiyer Sarveswaran, William Chandra Tjhi | cs.CL | 2023-09-12 |
Neural Latent Geometry Search: Product Manifold Inference via Gromov-Hausdorff-Informed Bayesian Optimization | Haitz Saez de Ocariz Borde, Alvaro Arroyo, Ismael Morales, Ingmar Posner, Xiaowen Dong | cs.LG, stat.ML | 2023-09-09 |
Parameter Efficient Audio Captioning With Faithful Guidance Using Audio-text Shared Latent Representation | Arvind Krishna Sridhar, Yinyi Guo, Erik Visser, Rehana Mahfuz | cs.CL, cs.MM, cs.SD | 2023-09-06 |
Synthetic Text Generation using Hypergraph Representations | Natraj Raman, Sameena Shah | cs.CL, cs.AI | 2023-09-06 |
Persona-aware Generative Model for Code-mixed Language | Ayan Sengupta, Md Shad Akhtar, Tanmoy Chakraborty | cs.CL, cs.LG | 2023-09-06 |
HC3 Plus: A Semantic-Invariant Human ChatGPT Comparison Corpus | Zhenpeng Su, Xing Wu, Wei Zhou, Guangyuan Ma, Songlin Hu | cs.CL, cs.AI | 2023-09-06 |
Generative AI-aided Joint Training-free Secure Semantic Communications via Multi-modal Prompts | Hongyang Du, Guangyuan Liu, Dusit Niyato, Jiayi Zhang, Jiawen Kang, Zehui Xiong, Bo Ai, Dong In Kim | eess.IV, cs.LG, cs.NI | 2023-09-05 |
Optimizing Factual Accuracy in Text Generation through Dynamic Knowledge Selection | Hongjin Qian, Zhicheng Dou, Jiejun Tan, Haonan Chen, Haoqi Gu, Ruofei Lai, Xinyu Zhang, Zhao Cao, Ji-Rong Wen | cs.CL, cs.AI | 2023-08-30 |
Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP | Vedant Palit, Rohan Pandey, Aryaman Arora, Paul Pu Liang | cs.CL, cs.AI, cs.CV | 2023-08-27 |
Planning with Logical Graph-based Language Model for Instruction Generation | Fan Zhang, Kebing Jin, Hankz Hankui Zhuo | cs.CL, cs.AI | 2023-08-26 |
GeoExplainer: A Visual Analytics Framework for Spatial Modeling Contextualization and Report Generation | Fan Lei, Yuxin Ma, Stewart Fotheringham, Elizabeth Mack, Ziqi Li, Mehak Sachdeva, Sarah Bardin, Ross Maciejewski | cs.HC, cs.LG | 2023-08-25 |
ARTIST: ARTificial Intelligence for Simplified Text | Lorenzo Corti, Jie Yang | cs.CL | 2023-08-25 |
Ceci n’est pas une pomme: Adversarial Illusions in Multi-Modal Embeddings | Eugene Bagdasaryan, Vitaly Shmatikov | cs.CR, cs.AI, cs.LG | 2023-08-22 |
Semantic Consistency for Assuring Reliability of Large Language Models | Harsh Raj, Vipul Gupta, Domenic Rosati, Subhabrata Majumdar | cs.CL, cs.AI, cs.CY | 2023-08-17 |
Can Knowledge Graphs Simplify Text? | Anthony Colas, Haodi Ma, Xuanli He, Yang Bai, Daisy Zhe Wang | cs.CL | 2023-08-14 |
Classification of Human- and AI-Generated Texts: Investigating Features for ChatGPT | Lorenz Mindner, Tim Schlippe, Kristina Schaaff | cs.CL, cs.AI | 2023-08-10 |
Few-Shot Data-to-Text Generation via Unified Representation and Multi-Source Learning | Alexander Hanbo Li, Mingyue Shang, Evangelia Spiliopoulou, Jie Ma, Patrick Ng, Zhiguo Wang, Bonan Min, William Wang, Kathleen McKeown, Vittorio Castelli, Dan Roth, Bing Xiang | cs.CL | 2023-08-10 |
Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism | Jiaxin Fan, Yong Zhang, Hanzhang Li, Jianzong Wang, Zhitao Li, Sheng Ouyang, Ning Cheng, Jing Xiao | cs.CL, cs.AI | 2023-08-07 |
Evaluating Generative Models for Graph-to-Text Generation | Shuzhou Yuan, Michael Färber | cs.CL, cs.AI | 2023-07-27 |
Controllable Generation of Dialogue Acts for Dialogue Systems via Few-Shot Response Generation and Ranking | Angela Ramirez, Karik Agarwal, Juraj Juraska, Utkarsh Garg, Marilyn A. Walker | cs.CL | 2023-07-26 |
Watermarking Conditional Text Generation for AI Detection: Unveiling Challenges and a Semantic-Aware Watermark Remedy | Yu Fu, Deyi Xiong, Yue Dong | cs.CL, cs.CR | 2023-07-25 |
FacTool: Factuality Detection in Generative AI – A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios | I-Chun Chern, Steffi Chern, Shiqi Chen, Weizhe Yuan, Kehua Feng, Chunting Zhou, Junxian He, Graham Neubig, Pengfei Liu | cs.CL, cs.AI | 2023-07-25 |
Why Is Prompt Tuning for Vision-Language Models Robust to Noisy Labels? | Cheng-En Wu, Yu Tian, Haichao Yu, Heng Wang, Pedro Morgado, Yu Hen Hu, Linjie Yang | cs.CV, cs.AI, cs.LG | 2023-07-22 |
OxfordTVG-HIC: Can Machine Make Humorous Captions from Images? | Runjia Li, Shuyang Sun, Mohamed Elhoseiny, Philip Torr | cs.CV, cs.CL | 2023-07-21 |
Jina Embeddings: A Novel Set of High-Performance Sentence Embedding Models | Michael Günther, Louis Milliken, Jonathan Geuter, Georgios Mastrapas, Bo Wang, Han Xiao | cs.CL, cs.AI, cs.IR, cs.LG, 68T50, H.3.1; H.3.3; I.2.7; I.5.4 | 2023-07-20 |
Generative Language Models on Nucleotide Sequences of Human Genes | Musa Nuri Ihtiyar, Arzucan Ozgur | q-bio.GN, cs.CL, cs.LG | 2023-07-20 |
On the Origin of LLMs: An Evolutionary Tree and Graph for 15,821 Large Language Models | Sarah Gao, Andrew Kean Gao | cs.DL, cs.CL, I.2.1; H.5.0 | 2023-07-19 |
Unveiling Gender Bias in Terms of Profession Across LLMs: Analyzing and Addressing Sociological Implications | Vishesh Thakur | cs.CL | 2023-07-18 |
Large Language Models Perform Diagnostic Reasoning | Cheng-Kuang Wu, Wei-Lin Chen, Hsin-Hsi Chen | cs.CL | 2023-07-18 |
COLLIE: Systematic Construction of Constrained Text Generation Tasks | Shunyu Yao, Howard Chen, Austin W. Hanjie, Runzhe Yang, Karthik Narasimhan | cs.CL, cs.AI, cs.LG | 2023-07-17 |
A Dialogue System for Assessing Activities of Daily Living: Improving Consistency with Grounded Knowledge | Zhecheng Sheng, Raymond Finzel, Michael Lucke, Sheena Dufresne, Maria Gini, Serguei Pakhomov | cs.CL, cs.AI | 2023-07-15 |
Using Large Language Models for Zero-Shot Natural Language Generation from Knowledge Graphs | Agnes Axelsson, Gabriel Skantze | cs.CL, 68T50, I.2.7; I.2.4 | 2023-07-14 |
Reading Radiology Imaging Like The Radiologist | Yuhao Wang | cs.CV, cs.AI | 2023-07-12 |
SkipDecode: Autoregressive Skip Decoding with Batching and Caching for Efficient LLM Inference | Luciano Del Corro, Allie Del Giorno, Sahaj Agarwal, Bin Yu, Ahmed Awadallah, Subhabrata Mukherjee | cs.CL | 2023-07-05 |
A ChatGPT Aided Explainable Framework for Zero-Shot Medical Image Diagnosis | Jiaxiang Liu, Tianxiang Hu, Yan Zhang, Xiaotang Gai, Yang Feng, Zuozhu Liu | eess.IV, cs.CV, cs.LG | 2023-07-05 |
Knowledge-Aware Audio-Grounded Generative Slot Filling for Limited Annotated Data | Guangzhi Sun, Chao Zhang, Ivan Vulić, Paweł Budzianowski, Philip C. Woodland | cs.CL | 2023-07-04 |
PatternGPT :A Pattern-Driven Framework for Large Language Model Text Generation | Le Xiao, Xin Shan | cs.CL, cs.AI | 2023-07-02 |
Concept-Oriented Deep Learning with Large Language Models | Daniel T. Chang | cs.LG, cs.CL | 2023-06-29 |
Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio | Allen Roush, Sanjay Basu, Akshay Moorthy, Dmitry Dubovoy | cs.CL, cs.AI, cs.LG | 2023-06-28 |
Knowledge Graph-Augmented Korean Generative Commonsense Reasoning | Dahyun Jung, Jaehyung Seo, Jaewook Lee, Chanjun Park, Heuiseok Lim | cs.CL, cs.AI | 2023-06-26 |
Long-range Language Modeling with Self-retrieval | Ohad Rubin, Jonathan Berant | cs.CL | 2023-06-23 |
AudioPaLM: A Large Language Model That Can Speak and Listen | Paul K. Rubenstein, Chulayuth Asawaroengchai, Duc Dung Nguyen, Ankur Bapna, Zalán Borsos, Félix de Chaumont Quitry, Peter Chen, Dalia El Badawy, Wei Han, Eugene Kharitonov, Hannah Muckenhirn, Dirk Padfield, James Qin, Danny Rozenberg, Tara Sainath, Johan Schalkwyk, Matt Sharifi, Michelle Tadmor Ramanovich, Marco Tagliasacchi, Alexandru Tudor, Mihajlo Velimirović, Damien Vincent, Jiahui Yu, Yongqiang Wang, Vicky Zayats, Neil Zeghidour, Yu Zhang, Zhishuai Zhang, Lukas Zilka, Christian Frank | cs.CL, cs.AI, cs.SD, eess.AS, stat.ML | 2023-06-22 |
Learning to Generate Better Than Your LLM | Jonathan D. Chang, Kiante Brantley, Rajkumar Ramamurthy, Dipendra Misra, Wen Sun | cs.LG, cs.AI, cs.CL | 2023-06-20 |
ChatGPT is not Enough: Enhancing Large Language Models with Knowledge Graphs for Fact-aware Language Modeling | Linyao Yang, Hongyang Chen, Zhao Li, Xiao Ding, Xindong Wu | cs.CL, cs.AI | 2023-06-20 |
FutureTOD: Teaching Future Knowledge to Pre-trained Language Model for Task-Oriented Dialogue | Weihao Zeng, Keqing He, Yejie Wang, Chen Zeng, Jingang Wang, Yunsen Xian, Weiran Xu | cs.CL | 2023-06-17 |
Semi-supervised Relation Extraction via Data Augmentation and Consistency-training | Komal K. Teru | cs.CL, cs.IR | 2023-06-16 |
Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models | Geon Yeong Park, Jeongsol Kim, Beomsu Kim, Sang Wan Lee, Jong Chul Ye | cs.CV, cs.AI, cs.CL, cs.LG | 2023-06-16 |
Neural models for Factual Inconsistency Classification with Explanations | Tathagata Raha, Mukund Choudhary, Abhinav Menon, Harshit Gupta, KV Aditya Srivatsa, Manish Gupta, Vasudeva Varma | cs.CL, cs.AI | 2023-06-15 |
Knowledge Distillation of Large Language Models | Yuxian Gu, Li Dong, Furu Wei, Minlie Huang | cs.CL, cs.AI | 2023-06-14 |
PoetryDiffusion: Towards Joint Semantic and Metrical Manipulation in Poetry Generation | Zhiyuan Hu, Chumin Liu, Yue Feng, Bryan Hooi | cs.CL | 2023-06-14 |
Unifying Large Language Models and Knowledge Graphs: A Roadmap | Shirui Pan, Linhao Luo, Yufei Wang, Chen Chen, Jiapu Wang, Xindong Wu | cs.CL, cs.AI | 2023-06-14 |
GBSD: Generative Bokeh with Stage Diffusion | Jieren Deng, Xin Zhou, Hao Tian, Zhihong Pan, Derek Aguiar | cs.CV, cs.AI | 2023-06-14 |
Question Decomposition Tree for Answering Complex Questions over Knowledge Bases | Xiang Huang, Sitao Cheng, Yiheng Shu, Yuheng Bao, Yuzhong Qu | cs.CL | 2023-06-13 |
Absformer: Transformer-based Model for Unsupervised Multi-Document Abstractive Summarization | Mohamed Trabelsi, Huseyin Uzunalioglu | cs.CL, cs.LG | 2023-06-07 |
Injecting knowledge into language generation: a case study in auto-charting after-visit care instructions from medical dialogue | Maksim Eremeev, Ilya Valmianski, Xavier Amatriain, Anitha Kannan | cs.CL | 2023-06-06 |
PLANNER: Generating Diversified Paragraph via Latent Language Diffusion Model | Yizhe Zhang, Jiatao Gu, Zhuofeng Wu, Shuangfei Zhai, Josh Susskind, Navdeep Jaitly | cs.CL | 2023-06-05 |
Adaptive and Personalized Exercise Generation for Online Language Learning | Peng Cui, Mrinmaya Sachan | cs.CL, cs.AI | 2023-06-04 |
Commonsense Knowledge Transfer for Pre-trained Language Models | Wangchunshu Zhou, Ronan Le Bras, Yejin Choi | cs.CL, cs.AI, cs.LG | 2023-06-04 |
Modular Transformers: Compressing Transformers into Modularized Layers for Flexible Efficient Inference | Wangchunshu Zhou, Ronan Le Bras, Yejin Choi | cs.CL, cs.LG | 2023-06-04 |
FlexRound: Learnable Rounding based on Element-wise Division for Post-Training Quantization | Jung Hyun Lee, Jeonghoon Kim, Se Jung Kwon, Dongsoo Lee | cs.LG, cs.AI | 2023-06-01 |
An Invariant Learning Characterization of Controlled Text Generation | Carolina Zheng, Claudia Shi, Keyon Vafa, Amir Feder, David M. Blei | cs.CL, cs.LG | 2023-05-31 |
Pre-Trained Language-Meaning Models for Multilingual Parsing and Generation | Chunliu Wang, Huiyuan Lai, Malvina Nissim, Johan Bos | cs.CL | 2023-05-31 |
Scalable Learning of Latent Language Structure With Logical Offline Cycle Consistency | Maxwell Crouse, Ramon Astudillo, Tahira Naseem, Subhajit Chaudhury, Pavan Kapanipathi, Salim Roukos, Alexander Gray | cs.CL, cs.AI | 2023-05-31 |
Fine-grained Text Style Transfer with Diffusion-Based Language Models | Yiwei Lyu, Tiange Luo, Jiacheng Shi, Todd C. Hollon, Honglak Lee | cs.CL, cs.AI, cs.LG | 2023-05-31 |
ScoNe: Benchmarking Negation Reasoning in Language Models With Fine-Tuning and In-Context Learning | Jingyuan Selena She, Christopher Potts, Samuel R. Bowman, Atticus Geiger | cs.CL, cs.LG | 2023-05-30 |
Generating with Confidence: Uncertainty Quantification for Black-box Large Language Models | Zhen Lin, Shubhendu Trivedi, Jimeng Sun | cs.CL, cs.LG, stat.ML | 2023-05-30 |
Response Generation in Longitudinal Dialogues: Which Knowledge Representation Helps? | Seyed Mahed Mousavi, Simone Caldarella, Giuseppe Riccardi | cs.CL | 2023-05-25 |
Self-contradictory Hallucinations of Large Language Models: Evaluation, Detection and Mitigation | Niels Mündler, Jingxuan He, Slobodan Jenko, Martin Vechev | cs.CL, cs.AI, cs.LG | 2023-05-25 |
Peek Across: Improving Multi-Document Modeling via Cross-Document Question-Answering | Avi Caciularu, Matthew E. Peters, Jacob Goldberger, Ido Dagan, Arman Cohan | cs.CL, cs.AI | 2023-05-24 |
Not All Metrics Are Guilty: Improving NLG Evaluation with LLM Paraphrasing | Tianyi Tang, Hongyuan Lu, Yuchen Eleanor Jiang, Haoyang Huang, Dongdong Zhang, Wayne Xin Zhao, Furu Wei | cs.CL | 2023-05-24 |
The ACL OCL Corpus: advancing Open science in Computational Linguistics | Shaurya Rohatgi, Yanxia Qin, Benjamin Aw, Niranjana Unnithan, Min-Yen Kan | cs.CL, cs.DL | 2023-05-24 |
Trusting Your Evidence: Hallucinate Less with Context-aware Decoding | Weijia Shi, Xiaochuang Han, Mike Lewis, Yulia Tsvetkov, Luke Zettlemoyer, Scott Wen-tau Yih | cs.CL | 2023-05-24 |
Diffusion Models in NLP: A Survey | Hao Zou, Zae Myung Kim, Dongyeop Kang | cs.CL | 2023-05-24 |
Enhancing Generation through Summarization Duality and Explicit Outline Control | Yunzhe Li, Qian Chen, Weixiang Yan, Wen Wang, Qinglin Zhang, Hari Sundaram | cs.CL, cs.AI | 2023-05-23 |
INSTRUCTSCORE: Towards Explainable Text Generation Evaluation with Automatic Feedback | Wenda Xu, Danqing Wang, Liangming Pan, Zhenqiao Song, Markus Freitag, William Yang Wang, Lei Li | cs.CL, cs.AI | 2023-05-23 |
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation | Sewon Min, Kalpesh Krishna, Xinxi Lyu, Mike Lewis, Wen-tau Yih, Pang Wei Koh, Mohit Iyyer, Luke Zettlemoyer, Hannaneh Hajishirzi | cs.CL, cs.AI, cs.LG | 2023-05-23 |
Evaluating Factual Consistency of Texts with Semantic Role Labeling | Jing Fan, Dennis Aumiller, Michael Gertz | cs.CL | 2023-05-22 |
GEST: the Graph of Events in Space and Time as a Common Representation between Vision and Language | Mihai Masala, Nicolae Cudlenco, Traian Rebedea, Marius Leordeanu | cs.CL | 2023-05-22 |
STOAT: Structured Data to Analytical Text With Controls | Deepanway Ghosal, Preksha Nema, Aravindan Raghuveer | cs.CL, cs.AI | 2023-05-19 |
Generating Visual Spatial Description via Holistic 3D Scene Understanding | Yu Zhao, Hao Fei, Wei Ji, Jianguo Wei, Meishan Zhang, Min Zhang, Tat-Seng Chua | cs.CV, cs.CL | 2023-05-19 |
What Comes Next? Evaluating Uncertainty in Neural Text Generators Against Human Production Variability | Mario Giulianelli, Joris Baan, Wilker Aziz, Raquel Fernández, Barbara Plank | cs.CL, cs.AI, cs.LG | 2023-05-19 |
Cross-modality Data Augmentation for End-to-End Sign Language Translation | Jinhui Ye, Wenxiang Jiao, Xing Wang, Zhaopeng Tu, Hui Xiong | cs.CL | 2023-05-18 |
What You See is What You Read? Improving Text-Image Alignment Evaluation | Michal Yarom, Yonatan Bitton, Soravit Changpinyo, Roee Aharoni, Jonathan Herzig, Oran Lang, Eran Ofek, Idan Szpektor | cs.CL, cs.CV | 2023-05-17 |
Boosting Event Extraction with Denoised Structure-to-Text Augmentation | bo wang, Heyan Huang, Xiaochi Wei, Ge Shi, Xiao Liu, Chong Feng, Tong Zhou, Shuaiqiang Wang, Dawei Yin | cs.CL | 2023-05-16 |
Watermarking Text Generated by Black-Box Language Models | Xi Yang, Kejiang Chen, Weiming Zhang, Chang Liu, Yuang Qi, Jie Zhang, Han Fang, Nenghai Yu | cs.CL, cs.AI | 2023-05-14 |
ProKnow: Process Knowledge for Safety Constrained and Explainable Question Generation for Mental Health Diagnostic Assistance | Kaushik Roy, Manas Gaur, Misagh Soltani, Vipula Rawte, Ashwin Kalyan, Amit Sheth | cs.CL | 2023-05-13 |
Autocorrelations Decay in Texts and Applicability Limits of Language Models | Nikolay Mikhaylovskiy, Ilya Churilov | cs.CL, I.2.7 | 2023-05-11 |
Long-Tailed Question Answering in an Open World | Yi Dai, Hao Lang, Yinhe Zheng, Fei Huang, Yongbin Li | cs.CL, cs.AI, cs.LG | 2023-05-11 |
Are ChatGPT and GPT-4 General-Purpose Solvers for Financial Text Analytics? An Examination on Several Typical Tasks | Xianzhi Li, Xiaodan Zhu, Zhiqiang Ma, Xiaomo Liu, Sameena Shah | cs.CL, cs.AI | 2023-05-10 |
Vārta: A Large-Scale Headline-Generation Dataset for Indic Languages | Rahul Aralikatte, Ziling Cheng, Sumanth Doddapaneni, Jackie Chi Kit Cheung | cs.CL | 2023-05-10 |
Vision-Language Models in Remote Sensing: Current Progress and Future Trends | Congcong Wen, Yuan Hu, Xiang Li, Zhenghang Yuan, Xiao Xiang Zhu | cs.CV, cs.AI | 2023-05-09 |
Dreams Are More “Predictable’’ Than You Think | Lorenzo Bertolini | cs.CL | 2023-05-08 |
Diffusion-NAT: Self-Prompting Discrete Diffusion for Non-Autoregressive Text Generation | Kun Zhou, Yifan Li, Wayne Xin Zhao, Ji-Rong Wen | cs.CL | 2023-05-06 |
Expository Text Generation: Imitate, Retrieve, Paraphrase | Nishant Balepur, Jie Huang, Kevin Chen-Chuan Chang | cs.CL | 2023-05-05 |
Stylized Data-to-Text Generation: A Case Study in the E-Commerce Domain | Liqiang Jing, Xuemeng Song, Xuming Lin, Zhongzhou Zhao, Wei Zhou, Liqiang Nie | cs.CL | 2023-05-05 |
Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs | Jinyang Li, Binyuan Hui, Ge Qu, Binhua Li, Jiaxi Yang, Bowen Li, Bailin Wang, Bowen Qin, Rongyu Cao, Ruiying Geng, Nan Huo, Chenhao Ma, Kevin C. C. Chang, Fei Huang, Reynold Cheng, Yongbin Li | cs.CL | 2023-05-04 |
A Systematic Study of Knowledge Distillation for Natural Language Generation with Pseudo-Target Training | Nitay Calderon, Subhabrata Mukherjee, Roi Reichart, Amir Kantor | cs.CL, cs.AI | 2023-05-03 |
The Benefits of Bad Advice: Autocontrastive Decoding across Model Layers | Ariel Gera, Roni Friedman, Ofir Arviv, Chulaka Gunasekara, Benjamin Sznajder, Noam Slonim, Eyal Shnarch | cs.CL, cs.LG | 2023-05-02 |
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond | Jingfeng Yang, Hongye Jin, Ruixiang Tang, Xiaotian Han, Qizhang Feng, Haoming Jiang, Bing Yin, Xia Hu | cs.CL, cs.AI, cs.LG | 2023-04-26 |
Evaluating Inter-Bilingual Semantic Parsing for Indian Languages | Divyanshu Aggarwal, Vivek Gupta, Anoop Kunchukuttan | cs.CL | 2023-04-25 |
Divide and Prompt: Chain of Thought Prompting for Text-to-SQL | Xiping Liu, Zhao Tan | cs.CL, cs.AI | 2023-04-23 |
Translationese Reduction using Abstract Meaning Representation | Shira Wein, Nathan Schneider | cs.CL | 2023-04-23 |
GeneGPT: Augmenting Large Language Models with Domain Tools for Improved Access to Biomedical Information | Qiao Jin, Yifan Yang, Qingyu Chen, Zhiyong Lu | cs.CL, cs.AI, q-bio.GN | 2023-04-19 |
ArguGPT: evaluating, understanding and identifying argumentative essays generated by GPT models | Yikang Liu, Ziyin Zhang, Wanyang Zhang, Shisen Yue, Xiaojing Zhao, Xinyuan Cheng, Yiwen Zhang, Hai Hu | cs.CL | 2023-04-16 |
Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study | Boxin Wang, Wei Ping, Peng Xu, Lawrence McAfee, Zihan Liu, Mohammad Shoeybi, Yi Dong, Oleksii Kuchaiev, Bo Li, Chaowei Xiao, Anima Anandkumar, Bryan Catanzaro | cs.CL, cs.AI, cs.IR, cs.LG | 2023-04-13 |
Training Large Language Models Efficiently with Sparsity and Dataflow | Venkat Srinivasan, Darshan Gandhi, Urmish Thakker, Raghu Prabhakar | cs.LG | 2023-04-11 |
To ChatGPT, or not to ChatGPT: That is the question! | Alessandro Pegoraro, Kavita Kumari, Hossein Fereidooni, Ahmad-Reza Sadeghi | cs.LG, cs.AI, cs.CL | 2023-04-04 |
Measuring and Manipulating Knowledge Representations in Language Models | Evan Hernandez, Belinda Z. Li, Jacob Andreas | cs.CL | 2023-04-03 |
Assessing Language Model Deployment with Risk Cards | Leon Derczynski, Hannah Rose Kirk, Vidhisha Balachandran, Sachin Kumar, Yulia Tsvetkov, M. R. Leiser, Saif Mohammad | cs.CL | 2023-03-31 |
ChatGPT or academic scientist? Distinguishing authorship with over 99% accuracy using off-the-shelf machine learning tools | Heather Desaire, Aleesa E. Chua, Madeline Isom, Romana Jarosova, David Hua | cs.LG, cs.CL | 2023-03-28 |
On Codex Prompt Engineering for OCL Generation: An Empirical Study | Seif Abukhalaf, Mohammad Hamdaqa, Foutse Khomh | cs.SE, cs.AI | 2023-03-28 |
Large Language Models are Diverse Role-Players for Summarization Evaluation | Ning Wu, Ming Gong, Linjun Shou, Shining Liang, Daxin Jiang | cs.CL | 2023-03-27 |
Paraphrase Detection: Human vs. Machine Content | Jonas Becker, Jan Philip Wahle, Terry Ruas, Bela Gipp | cs.CL, cs.AI | 2023-03-24 |
CoBIT: A Contrastive Bi-directional Image-Text Generation Model | Haoxuan You, Mandy Guo, Zhecan Wang, Kai-Wei Chang, Jason Baldridge, Jiahui Yu | cs.CV, cs.CL | 2023-03-23 |
Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense | Kalpesh Krishna, Yixiao Song, Marzena Karpinska, John Wieting, Mohit Iyyer | cs.CL, cs.CR, cs.LG | 2023-03-23 |
Compositional Zero-Shot Domain Transfer with Text-to-Text Models | Fangyu Liu, Qianchu Liu, Shruthi Bannur, Fernando Pérez-García, Naoto Usuyama, Sheng Zhang, Tristan Naumann, Aditya Nori, Hoifung Poon, Javier Alvarez-Valle, Ozan Oktay, Stephanie L. Hyland | cs.CL, cs.LG | 2023-03-23 |
Language Model Behavior: A Comprehensive Survey | Tyler A. Chang, Benjamin K. Bergen | cs.CL | 2023-03-20 |
SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language Models | Vithursan Thangarasa, Abhay Gupta, William Marshall, Tianda Li, Kevin Leong, Dennis DeCoste, Sean Lie, Shreyas Saxena | cs.LG, cs.CL | 2023-03-18 |
Input-length-shortening and text generation via attention values | Neşet Özkan Tan, Alex Yuxuan Peng, Joshua Bensemann, Qiming Bao, Tim Hartill, Mark Gahegan, Michael Witbrock | cs.CL | 2023-03-14 |
Transformer-based Planning for Symbolic Regression | Parshin Shojaee, Kazem Meidani, Amir Barati Farimani, Chandan K. Reddy | cs.LG, cs.AI | 2023-03-13 |
Large Language Models as Zero-Shot Human Models for Human-Robot Interaction | Bowen Zhang, Harold Soh | cs.RO, cs.CL, cs.HC, cs.LG | 2023-03-06 |
A Universal Question-Answering Platform for Knowledge Graphs | Reham Omar, Ishika Dhall, Panos Kalnis, Essam Mansour | cs.AI, cs.CL, cs.DB | 2023-03-01 |
Inseq: An Interpretability Toolkit for Sequence Generation Models | Gabriele Sarti, Nils Feldhus, Ludwig Sickert, Oskar van der Wal | cs.CL, cs.AI, cs.HC, cs.LG | 2023-02-27 |
Toward Fairness in Text Generation via Mutual Information Minimization based on Importance Sampling | Rui Wang, Pengyu Cheng, Ricardo Henao | cs.CL, cs.AI | 2023-02-25 |
Few-Shot Table-to-Text Generation with Prompt-based Adapter | Zhixin Guo, Minyxuan Yan, Jiexing Qi, Jianping Zhou, Ziwei He, Zhouhan Lin, Guanjie Zheng, Xinbing Wang | cs.CL | 2023-02-24 |
How Does In-Context Learning Help Prompt Tuning? | Simeng Sun, Yang Liu, Dan Iter, Chenguang Zhu, Mohit Iyyer | cs.CL | 2023-02-22 |
KILM: Knowledge Injection into Encoder-Decoder Language Models | Yan Xu, Mahdi Namazifar, Devamanyu Hazarika, Aishwarya Padmakumar, Yang Liu, Dilek Hakkani-Tür | cs.CL, cs.AI | 2023-02-17 |
Do We Still Need Clinical Language Models? | Eric Lehman, Evan Hernandez, Diwakar Mahajan, Jonas Wulff, Micah J. Smith, Zachary Ziegler, Daniel Nadler, Peter Szolovits, Alistair Johnson, Emily Alsentzer | cs.CL | 2023-02-16 |
Tree-Based Representation and Generation of Natural and Mathematical Language | Alexander Scarlatos, Andrew Lan | cs.CL | 2023-02-15 |
Investigating the Effect of Relative Positional Embeddings on AMR-to-Text Generation with Structural Adapters | Sebastien Montella, Alexis Nasr, Johannes Heinecke, Frederic Bechet, Lina M. Rojas-Barahona | cs.CL | 2023-02-12 |
TextDefense: Adversarial Text Detection based on Word Importance Entropy | Lujia Shen, Xuhong Zhang, Shouling Ji, Yuwen Pu, Chunpeng Ge, Xing Yang, Yanghe Feng | cs.CL, cs.AI, cs.CR | 2023-02-12 |
Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning | Zhuolin Yang, Wei Ping, Zihan Liu, Vijay Korthikanti, Weili Nie, De-An Huang, Linxi Fan, Zhiding Yu, Shiyi Lan, Bo Li, Ming-Yu Liu, Yuke Zhu, Mohammad Shoeybi, Bryan Catanzaro, Chaowei Xiao, Anima Anandkumar | cs.CV, cs.AI, cs.CL, cs.IR, cs.LG | 2023-02-09 |
Lightweight Transformers for Clinical Natural Language Processing | Omid Rohanian, Mohammadmahdi Nouriborji, Hannah Jauncey, Samaneh Kouchaki, ISARIC Clinical Characterisation Group, Lei Clifton, Laura Merson, David A. Clifton | cs.CL, cs.AI, cs.LG, 68T50, I.2.7 | 2023-02-09 |
Few-Shot Table-to-Text Generation with Prompt Planning and Knowledge Memorization | Zhixin Guo, Minyxuan Yan, Jiexing Qi, Jianping Zhou, Ziwei He, Zhouhan Lin, Guanjie Zheng, Xinbing Wang | cs.CL, cs.AI | 2023-02-09 |
Auto-Learning: An Adversarial Process of Two Pre-trained Models for Natural Language Generation | Zhengqing Yuan, Yuelin Lu, Chao Zhang, Huiwen Xue | cs.CL | 2023-02-08 |
Controlling Personality Style in Dialogue with Zero-Shot Prompt-Based Learning | Angela Ramirez, Mamon Alsalihy, Kartik Aggarwal, Cecilia Li, Liren Wu, Marilyn Walker | cs.CL | 2023-02-08 |
Execution-based Code Generation using Deep Reinforcement Learning | Parshin Shojaee, Aneesh Jain, Sindhu Tipirneni, Chandan K. Reddy | cs.LG, cs.AI, cs.CL, cs.PL | 2023-01-31 |
DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature | Eric Mitchell, Yoonho Lee, Alexander Khazatsky, Christopher D. Manning, Chelsea Finn | cs.CL, cs.AI | 2023-01-26 |
One Model for All Domains: Collaborative Domain-Prefix Tuning for Cross-Domain NER | Xiang Chen, Lei Li, Qiaoshuo Fei, Ningyu Zhang, Chuanqi Tan, Yong Jiang, Fei Huang, Huajun Chen | cs.CL, cs.AI, cs.DB, cs.IR, cs.LG | 2023-01-25 |
Semantic-aware Contrastive Learning for Electroencephalography-to-Text Generation with Curriculum Learning | Xiachong Feng, Xiaocheng Feng, Bing Qin | cs.HC, cs.CL | 2023-01-23 |
Rationalization for Explainable NLP: A Survey | Sai Gurrapu, Ajay Kulkarni, Lifu Huang, Ismini Lourentzou, Laura Freeman, Feras A. Batarseh | cs.CL, cs.LG | 2023-01-21 |
A Multi-Modal Geographic Pre-Training Method | Ruixue Ding, Boli Chen, Pengjun Xie, Fei Huang, Xin Li, Qiang Zhang, Yao Xu | cs.CL | 2023-01-11 |
Universal Multimodal Representation for Language Understanding | Zhuosheng Zhang, Kehai Chen, Rui Wang, Masao Utiyama, Eiichiro Sumita, Zuchao Li, Hai Zhao | cs.CL, cs.AI, cs.CV | 2023-01-09 |
TegFormer: Topic-to-Essay Generation with Good Topic Coverage and High Text Coherence | Wang Qi, Rui Liu, Yuan Zuo, Yong Chen, Dell Zhang | cs.CL | 2022-12-27 |
Do DALL-E and Flamingo Understand Each Other? | Hang Li, Jindong Gu, Rajat Koner, Sahand Sharifzadeh, Volker Tresp | cs.CV, cs.LG | 2022-12-23 |
Understanding Stereotypes in Language Models: Towards Robust Measurement and Zero-Shot Debiasing | Justus Mattern, Zhijing Jin, Mrinmaya Sachan, Rada Mihalcea, Bernhard Schölkopf | cs.CL, cs.LG | 2022-12-20 |
Controllable Text Generation with Language Constraints | Howard Chen, Huihan Li, Danqi Chen, Karthik Narasimhan | cs.CL | 2022-12-20 |
CoCo: Coherence-Enhanced Machine-Generated Text Detection Under Data Limitation With Contrastive Learning | Xiaoming Liu, Zhaohan Zhang, Yichen Wang, Yu Lan, Chao Shen | cs.CL | 2022-12-20 |
One Embedder, Any Task: Instruction-Finetuned Text Embeddings | Hongjin Su, Weijia Shi, Jungo Kasai, Yizhong Wang, Yushi Hu, Mari Ostendorf, Wen-tau Yih, Noah A. Smith, Luke Zettlemoyer, Tao Yu | cs.CL | 2022-12-19 |
Synthesis and Evaluation of a Domain-specific Large Data Set for Dungeons & Dragons | Akila Peiris, Nisansa de Silva | cs.CL, cs.LG | 2022-12-18 |
Plansformer: Generating Symbolic Plans using Transformers | Vishal Pallagani, Bharath Muppasani, Keerthiram Murugesan, Francesca Rossi, Lior Horesh, Biplav Srivastava, Francesco Fabiano, Andrea Loreggia | cs.AI | 2022-12-16 |
MURMUR: Modular Multi-Step Reasoning for Semi-Structured Data-to-Text Generation | Swarnadeep Saha, Xinyan Velocity Yu, Mohit Bansal, Ramakanth Pasunuru, Asli Celikyilmaz | cs.CL, cs.AI, cs.LG | 2022-12-16 |
ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning | Olga Golovneva, Moya Chen, Spencer Poff, Martin Corredor, Luke Zettlemoyer, Maryam Fazel-Zarandi, Asli Celikyilmaz | cs.CL, cs.LG | 2022-12-15 |
Collaborating Heterogeneous Natural Language Processing Tasks via Federated Learning | Chenhe Dong, Yuexiang Xie, Bolin Ding, Ying Shen, Yaliang Li | cs.CL | 2022-12-12 |
Discovering Latent Knowledge in Language Models Without Supervision | Collin Burns, Haotian Ye, Dan Klein, Jacob Steinhardt | cs.CL, cs.AI, cs.LG | 2022-12-07 |
Harnessing Knowledge and Reasoning for Human-Like Natural Language Generation: A Brief Review | Jiangjie Chen, Yanghua Xiao | cs.CL | 2022-12-07 |
Momentum Decoding: Open-ended Text Generation As Graph Exploration | Tian Lan, Yixuan Su, Shuhang Liu, Heyan Huang, Xian-Ling Mao | cs.CL | 2022-12-05 |
Learning Automata-Based Task Knowledge Representation from Large-Scale Generative Language Models | Yunhao Yang, Jean-Raphaël Gaglione, Ufuk Topcu | cs.FL, cs.CL | 2022-12-04 |
What do you MEME? Generating Explanations for Visual Semantic Role Labelling in Memes | Shivam Sharma, Siddhant Agarwal, Tharun Suresh, Preslav Nakov, Md. Shad Akhtar, Tanmoy Charkraborty | cs.CY, cs.CL | 2022-12-01 |
CliMedBERT: A Pre-trained Language Model for Climate and Health-related Text | B. Jalalzadeh Fard, S. A. Hasan, J. E. Bell | cs.CL, 68T50, I.2.7 | 2022-12-01 |
On the Importance of Image Encoding in Automated Chest X-Ray Report Generation | Otabek Nazarov, Mohammad Yaqub, Karthik Nandakumar | cs.CV, cs.AI | 2022-11-24 |
Retrieval-Augmented Multimodal Language Modeling | Michihiro Yasunaga, Armen Aghajanyan, Weijia Shi, Rich James, Jure Leskovec, Percy Liang, Mike Lewis, Luke Zettlemoyer, Wen-tau Yih | cs.CV, cs.CL, cs.LG | 2022-11-22 |
Towards Computationally Verifiable Semantic Grounding for Language Models | Chris Alberti, Kuzman Ganchev, Michael Collins, Sebastian Gehrmann, Ciprian Chelba | cs.CL | 2022-11-16 |
kogito: A Commonsense Knowledge Inference Toolkit | Mete Ismayilzada, Antoine Bosselut | cs.CL | 2022-11-15 |
A Survey of Knowledge-Enhanced Pre-trained Language Models | Linmei Hu, Zeyi Liu, Ziwang Zhao, Lei Hou, Liqiang Nie, Juanzi Li | cs.CL | 2022-11-11 |
CCPrompt: Counterfactual Contrastive Prompt-Tuning for Many-Class Classification | Yang Li, Canran Xu, Tao Shen, Jing Jiang, Guodong Long | cs.CL | 2022-11-11 |
Measuring Reliability of Large Language Models through Semantic Consistency | Harsh Raj, Domenic Rosati, Subhabrata Majumdar | cs.CL, cs.AI, cs.CY | 2022-11-10 |
Generative Transformers for Design Concept Generation | Qihao Zhu, Jianxi Luo | cs.CL | 2022-11-07 |
CLSE: Corpus of Linguistically Significant Entities | Aleksandr Chuklin, Justin Zhao, Mihir Kale | cs.CL | 2022-11-04 |
Dialect-robust Evaluation of Generated Text | Jiao Sun, Thibault Sellam, Elizabeth Clark, Tu Vu, Timothy Dozat, Dan Garrette, Aditya Siddhant, Jacob Eisenstein, Sebastian Gehrmann | cs.CL | 2022-11-02 |
Seq2Seq-SC: End-to-End Semantic Communication Systems with Pre-trained Language Model | Ju-Hyung Lee, Dong-Ho Lee, Eunsoo Sheen, Thomas Choi, Jay Pujara, Joongheon Kim | eess.SP, cs.CL | 2022-10-27 |
Contrastive Search Is What You Need For Neural Text Generation | Yixuan Su, Nigel Collier | cs.CL | 2022-10-25 |
Mapping Process for the Task: Wikidata Statements to Text as Wikipedia Sentences | Hoang Thang Ta, Alexander Gelbukha, Grigori Sidorov | cs.CL, cs.AI | 2022-10-23 |
Hard Gate Knowledge Distillation – Leverage Calibration for Robust and Reliable Language Model | Dongkyu Lee, Zhiliang Tian, Yingxiu Zhao, Ka Chun Cheung, Nevin L. Zhang | cs.CL, cs.AI | 2022-10-22 |
What do Large Language Models Learn beyond Language? | Avinash Madasu, Shashank Srivastava | cs.CL | 2022-10-21 |
Image Semantic Relation Generation | Mingzhe Du | cs.CV, cs.CL | 2022-10-19 |
NGEP: A Graph-based Event Planning Framework for Story Generation | Chen Tang, Zhihao Zhang, Tyler Loakman, Chenghua Lin, Frank Guerin | cs.CL, cs.AI | 2022-10-19 |
SafeText: A Benchmark for Exploring Physical Safety in Language Models | Sharon Levy, Emily Allaway, Melanie Subbiah, Lydia Chilton, Desmond Patton, Kathleen McKeown, William Yang Wang | cs.CL, cs.AI | 2022-10-18 |
DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Generation | Hanqing Zhang, Dawei Song | cs.CL | 2022-10-18 |
UniTune: Text-Driven Image Editing by Fine Tuning an Image Generation Model on a Single Image | Dani Valevski, Matan Kalman, Yossi Matias, Yaniv Leviathan | cs.CV, cs.GR, cs.LG | 2022-10-17 |
Deepfake Text Detection: Limitations and Opportunities | Jiameng Pu, Zain Sarwar, Sifat Muhammad Abdullah, Abdullah Rehman, Yoonjin Kim, Parantapa Bhattacharya, Mobin Javed, Bimal Viswanath | cs.CR, cs.CL, cs.LG | 2022-10-17 |
Towards a Unified Multi-Dimensional Evaluator for Text Generation | Ming Zhong, Yang Liu, Da Yin, Yuning Mao, Yizhu Jiao, Pengfei Liu, Chenguang Zhu, Heng Ji, Jiawei Han | cs.CL | 2022-10-13 |
RaP: Redundancy-aware Video-language Pre-training for Text-Video Retrieval | Xing Wu, Chaochen Gao, Zijia Lin, Zhongyuan Wang, Jizhong Han, Songlin Hu | cs.CV, cs.AI | 2022-10-13 |
Equi-Tuning: Group Equivariant Fine-Tuning of Pretrained Models | Sourya Basu, Prasanna Sattigeri, Karthikeyan Natesan Ramamurthy, Vijil Chenthamarakshan, Kush R. Varshney, Lav R. Varshney, Payel Das | cs.LG, cs.CL | 2022-10-13 |
CHARD: Clinical Health-Aware Reasoning Across Dimensions for Text Generation Models | Steven Y. Feng, Vivek Khetan, Bogdan Sacaleanu, Anatole Gershman, Eduard Hovy | cs.CL, cs.AI, cs.LG | 2022-10-09 |
BLAB Reporter: Automated journalism covering the Blue Amazon | Yan V. Sym, João Gabriel M. Campos, Fabio G. Cozman | cs.CL | 2022-10-08 |
Bird-Eye Transformers for Text Generation Models | Lei Sha, Yuhang Song, Yordan Yordanov, Tommaso Salvatori, Thomas Lukasiewicz | cs.CL | 2022-10-08 |
A Unified Encoder-Decoder Framework with Entity Memory | Zhihan Zhang, Wenhao Yu, Chenguang Zhu, Meng Jiang | cs.CL | 2022-10-07 |
Unsupervised Sentence Textual Similarity with Compositional Phrase Semantics | Zihao Wang, Jiaheng Dou, Yong Zhang | cs.CL | 2022-10-05 |
Synonym Detection Using Syntactic Dependency And Neural Embeddings | Dongqiang Yang, Pikun Wang, Xiaodong Sun, Ning Li | cs.CL, cs.AI | 2022-09-30 |
Co-Writing Screenplays and Theatre Scripts with Language Models: An Evaluation by Industry Professionals | Piotr Mirowski, Kory W. Mathewson, Jaylen Pittman, Richard Evans | cs.HC, cs.CL | 2022-09-29 |
DR.BENCH: Diagnostic Reasoning Benchmark for Clinical Natural Language Processing | Yanjun Gao, Dmitriy Dligach, Timothy Miller, John Caskey, Brihat Sharma, Matthew M Churpek, Majid Afshar | cs.CL, cs.AI | 2022-09-29 |
FiD-Light: Efficient and Effective Retrieval-Augmented Text Generation | Sebastian Hofstätter, Jiecao Chen, Karthik Raman, Hamed Zamani | cs.CL, cs.IR | 2022-09-28 |
Informative Text Generation from Knowledge Triples | Zihao Fu, Yijiang River Dong, Lidong Bing, Wai Lam | cs.CL | 2022-09-26 |
Controllable Text Generation for Open-Domain Creativity and Fairness | Nanyun Peng | cs.CL, cs.AI | 2022-09-24 |
Can we do that simpler? Simple, Efficient, High-Quality Evaluation Metrics for NLG | Jens Grünwald, Christoph Leiter, Steffen Eger | cs.CL, cs.LG | 2022-09-20 |
Selective Token Generation for Few-shot Natural Language Generation | Daejin Jo, Taehwan Kwon, Eun-Sol Kim, Sungwoong Kim | cs.CL, cs.LG | 2022-09-17 |
TwHIN-BERT: A Socially-Enriched Pre-trained Language Model for Multilingual Tweet Representations | Xinyang Zhang, Yury Malkov, Omar Florez, Serim Park, Brian McWilliams, Jiawei Han, Ahmed El-Kishky | cs.CL | 2022-09-15 |
Distribution Aware Metrics for Conditional Natural Language Generation | David M Chan, Yiming Ni, Austin Myers, Sudheendra Vijayanarasimhan, David A Ross, John Canny | cs.CL, cs.AI, cs.CV, cs.LG | 2022-09-15 |
vec2text with Round-Trip Translations | Geoffrey Cideron, Sertan Girgin, Anton Raichuk, Olivier Pietquin, Olivier Bachem, Léonard Hussenot | cs.CL, cs.LG | 2022-09-14 |
Visual Recipe Flow: A Dataset for Learning Visual State Changes of Objects with Recipe Flows | Keisuke Shirai, Atsushi Hashimoto, Taichi Nishimura, Hirotaka Kameko, Shuhei Kurita, Yoshitaka Ushiku, Shinsuke Mori | cs.CL, cs.AI | 2022-09-13 |
Elaboration-Generating Commonsense Question Answering at Scale | Wenya Wang, Vivek Srikumar, Hanna Hajishirzi, Noah A. Smith | cs.CL | 2022-09-02 |
Multi-Modal Experience Inspired AI Creation | Qian Cao, Xu Chen, Ruihua Song, Hao Jiang, Guang Yang, Zhao Cao | cs.AI | 2022-09-02 |
Unified Knowledge Prompt Pre-training for Customer Service Dialogues | Keqing He, Jingang Wang, Chaobo Sun, Wei Wu | cs.CL | 2022-08-31 |
StoryTrans: Non-Parallel Story Author-Style Transfer with Discourse Representations and Content Enhancing | Xuekai Zhu, Jian Guan, Minlie Huang, Juan Liu | cs.CL | 2022-08-29 |
GenTUS: Simulating User Behaviour and Language in Task-oriented Dialogues with Generative Transformers | Hsien-Chin Lin, Christian Geishauser, Shutong Feng, Nurul Lubis, Carel van Niekerk, Michael Heck, Milica Gašić | cs.CL | 2022-08-23 |
Automatic tagging of knowledge points for K12 math problems | Xiaolu Wang, Ziqi Ding, Liangyu Chen | cs.CL | 2022-08-21 |
Performance Optimization for Semantic Communications: An Attention-based Reinforcement Learning Approach | Yining Wang, Mingzhe Chen, Tao Luo, Walid Saad, Dusit Niyato, H. Vincent Poor, Shuguang Cui | cs.IT, cs.AI, math.IT | 2022-08-17 |
Understanding Attention for Vision-and-Language Tasks | Feiqi Cao, Soyeon Caren Han, Siqu Long, Changwei Xu, Josiah Poon | cs.CV, cs.CL | 2022-08-17 |
A Representation Modeling Based Language GAN with Completely Random Initialization | Da Ren, Qing Li | cs.CL | 2022-08-04 |
LaKo: Knowledge-driven Visual Question Answering via Late Knowledge-to-Text Injection | Zhuo Chen, Yufeng Huang, Jiaoyan Chen, Yuxia Geng, Yin Fang, Jeff Pan, Ningyu Zhang, Wen Zhang | cs.CV, cs.AI | 2022-07-26 |
Leveraging Natural Supervision for Language Representation Learning and Generation | Mingda Chen | cs.CL | 2022-07-21 |
Syntax Controlled Knowledge Graph-to-Text Generation with Order and Semantic Consistency | Jin Liu, Chongfeng Fan, Fengyu Zhou, Huijuan Xu | cs.AI | 2022-07-02 |
Is neural language acquisition similar to natural? A chronological probing study | Ekaterina Voloshina, Oleg Serikov, Tatiana Shavrina | cs.CL | 2022-07-01 |
Evaluation of Semantic Answer Similarity Metrics | Farida Mustafazade, Peter F. Ebbinghaus | cs.CL, cs.AI, cs.LG | 2022-06-25 |
BenchCLAMP: A Benchmark for Evaluating Language Models on Semantic Parsing | Subhro Roy, Sam Thomson, Tongfei Chen, Richard Shin, Adam Pauls, Jason Eisner, Benjamin Van Durme | cs.CL | 2022-06-21 |
Interpretable AMR-Based Question Decomposition for Multi-hop Question Answering | Zhenyun Deng, Yonghua Zhu, Yang Chen, Michael Witbrock, Patricia Riddle | cs.CL | 2022-06-16 |
An Exploration of Post-Editing Effectiveness in Text Summarization | Vivian Lai, Alison Smith-Renner, Ke Zhang, Ruijia Cheng, Wenjuan Zhang, Joel Tetreault, Alejandro Jaimes | cs.CL, cs.AI, cs.HC | 2022-06-13 |
Plot Writing From Pre-Trained Language Models | Yiping Jin, Vishakha Kadam, Dittaya Wanvarie | cs.CL | 2022-06-07 |
Enhancing Dual-Encoders with Question and Answer Cross-Embeddings for Answer Retrieval | Yanmeng Wang, Jun Bai, Ye Wang, Jianfei Zhang, Wenge Rong, Zongcheng Ji, Shaojun Wang, Jing Xiao | cs.CL, cs.AI | 2022-06-07 |
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech | Ziyue Jiang, Su Zhe, Zhou Zhao, Qian Yang, Yi Ren, Jinglin Liu, Zhenhui Ye | eess.AS, cs.CL, cs.SD | 2022-06-05 |
Beyond Opinion Mining: Summarizing Opinions of Customer Reviews | Reinald Kim Amplayo, Arthur Bražinskas, Yoshi Suhara, Xiaolan Wang, Bing Liu | cs.CL, cs.AI, cs.LG | 2022-06-03 |
Revisiting Generative Commonsense Reasoning: A Pre-Ordering Approach | Chao Zhao, Faeze Brahman, Tenghao Huang, Snigdha Chaturvedi | cs.CL | 2022-05-26 |
Automatic question generation based on sentence structure analysis using machine learning approach | Miroslav Blšták, Viera Rozinajová | cs.CL, cs.AI | 2022-05-25 |
PLOG: Table-to-Logic Pretraining for Logical Table-to-Text Generation | Ao Liu, Haoyu Dong, Naoaki Okazaki, Shi Han, Dongmei Zhang | cs.CL | 2022-05-25 |
Exploring industrial safety knowledge via Zipf law | Zhenhua Wang, Ming Ren, Dong Gao, Zhuang Li | cs.CL | 2022-05-25 |
RSTGen: Imbuing Fine-Grained Interpretable Control into Long-FormText Generators | Rilwan A. Adewoyin, Ritabrata Dutta, Yulan He | cs.CL | 2022-05-25 |
A Dynamic, Interpreted CheckList for Meaning-oriented NLG Metric Evaluation – through the Lens of Semantic Similarity Rating | Laura Zeidler, Juri Opitz, Anette Frank | cs.CL | 2022-05-24 |
What Makes Data-to-Text Generation Hard for Pretrained Language Models? | Moniba Keymanesh, Adrian Benton, Mark Dredze | cs.CL, cs.AI, cs.IR, cs.LG | 2022-05-23 |
A Self-Paced Mixed Distillation Method for Non-Autoregressive Generation | Weizhen Qi, Yeyun Gong, Yelong Shen, Jian Jiao, Yu Yan, Houqiang Li, Ruofei Zhang, Weizhu Chen, Nan Duan | cs.CL | 2022-05-23 |
CORAL: Contextual Response Retrievability Loss Function for Training Dialog Generation Models | Bishal Santra, Ravi Ghadia, Arpit Dwivedi, Manish Gupta, Pawan Goyal | cs.CL | 2022-05-21 |
Exploiting Inductive Bias in Transformers for Unsupervised Disentanglement of Syntax and Semantics with VAEs | Ghazi Felhi, Joseph Le Roux, Djamé Seddah | cs.CL, cs.LG | 2022-05-12 |
Robust (Controlled) Table-to-Text Generation with Structure-Aware Equivariance Learning | Fei Wang, Zhewei Xu, Pedro Szekely, Muhao Chen | cs.CL, cs.AI, cs.LG | 2022-05-08 |
Language Models Can See: Plugging Visual Controls in Text Generation | Yixuan Su, Tian Lan, Yahui Liu, Fangyu Liu, Dani Yogatama, Yan Wang, Lingpeng Kong, Nigel Collier | cs.CV, cs.CL | 2022-05-05 |
Efficient Few-Shot Fine-Tuning for Opinion Summarization | Arthur Bražinskas, Ramesh Nallapati, Mohit Bansal, Markus Dreyer | cs.CL, cs.AI, cs.LG | 2022-05-04 |
Faithful to the Document or to the World? Mitigating Hallucinations via Entity-linked Knowledge in Abstractive Summarization | Yue Dong, John Wieting, Pat Verga | cs.CL | 2022-04-28 |
Recovering Patient Journeys: A Corpus of Biomedical Entities and Relations on Twitter (BEAR) | Amelie Wührl, Roman Klinger | cs.CL, cs.IR | 2022-04-21 |
A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond | Yisheng Xiao, Lijun Wu, Junliang Guo, Juntao Li, Min Zhang, Tao Qin, Tie-yan Liu | cs.CL, cs.LG | 2022-04-20 |
Rows from Many Sources: Enriching row completions from Wikidata with a pre-trained Language Model | Carina Negreanu, Alperen Karaoglu, Jack Williams, Shuang Chen, Daniel Fabian, Andrew Gordon, Chin-Yew Lin | cs.CL, cs.AI | 2022-04-14 |
GAP: A Graph-aware Language Model Framework for Knowledge Graph-to-Text Generation | Anthony Colas, Mehrdad Alvandipour, Daisy Zhe Wang | cs.CL | 2022-04-13 |
Explanation Graph Generation via Pre-trained Language Models: An Empirical Study with Contrastive Learning | Swarnadeep Saha, Prateek Yadav, Mohit Bansal | cs.CL, cs.AI, cs.LG | 2022-04-11 |
Explaining Deep Convolutional Neural Networks via Latent Visual-Semantic Filter Attention | Yu Yang, Seungbae Kim, Jungseock Joo | cs.CV, cs.AI, cs.LG | 2022-04-10 |
Knowledge Infused Decoding | Ruibo Liu, Guoqing Zheng, Shashank Gupta, Radhika Gaonkar, Chongyang Gao, Soroush Vosoughi, Milad Shokouhi, Ahmed Hassan Awadallah | cs.CL, cs.AI, cs.LG | 2022-04-06 |
CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations | Leonard Salewski, A. Sophia Koepke, Hendrik P. A. Lensch, Zeynep Akata | cs.CV, cs.CL | 2022-04-05 |
Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos | Muheng Li, Lei Chen, Yueqi Duan, Zhilan Hu, Jianjiang Feng, Jie Zhou, Jiwen Lu | cs.CV, cs.AI, cs.LG | 2022-03-26 |
A Roadmap for Big Model | Sha Yuan, Hanyu Zhao, Shuai Zhao, Jiahong Leng, Yangxiao Liang, Xiaozhi Wang, Jifan Yu, Xin Lv, Zhou Shao, Jiaao He, Yankai Lin, Xu Han, Zhenghao Liu, Ning Ding, Yongming Rao, Yizhao Gao, Liang Zhang, Ming Ding, Cong Fang, Yisen Wang, Mingsheng Long, Jing Zhang, Yinpeng Dong, Tianyu Pang, Peng Cui, Lingxiao Huang, Zheng Liang, Huawei Shen, Hui Zhang, Quanshi Zhang, Qingxiu Dong, Zhixing Tan, Mingxuan Wang, Shuo Wang, Long Zhou, Haoran Li, Junwei Bao, Yingwei Pan, Weinan Zhang, Zhou Yu, Rui Yan, Chence Shi, Minghao Xu, Zuobai Zhang, Guoqiang Wang, Xiang Pan, Mengjie Li, Xiaoyu Chu, Zijun Yao, Fangwei Zhu, Shulin Cao, Weicheng Xue, Zixuan Ma, Zhengyan Zhang, Shengding Hu, Yujia Qin, Chaojun Xiao, Zheni Zeng, Ganqu Cui, Weize Chen, Weilin Zhao, Yuan Yao, Peng Li, Wenzhao Zheng, Wenliang Zhao, Ziyi Wang, Borui Zhang, Nanyi Fei, Anwen Hu, Zenan Ling, Haoyang Li, Boxi Cao, Xianpei Han, Weidong Zhan, Baobao Chang, Hao Sun, Jiawen Deng, Juanzi Li, Lei Hou, Xigang Cao, Jidong Zhai, Zhiyuan Liu, Maosong Sun, Jiwen Lu, Zhiwu Lu, Qin Jin, Ruihua Song, Ji-Rong Wen, Zhouchen Lin, Liwei Wang, Hang Su, Jun Zhu, Zhifang Sui, Jiajun Zhang, Yang Liu, Xiaodong He, Minlie Huang, Jian Tang, Jie Tang | cs.LG, cs.AI, cs.CL | 2022-03-26 |
Mitigating Gender Bias in Distilled Language Models via Counterfactual Role Reversal | Umang Gupta, Jwala Dhamala, Varun Kumar, Apurv Verma, Yada Pruksachatkun, Satyapriya Krishna, Rahul Gupta, Kai-Wei Chang, Greg Ver Steeg, Aram Galstyan | cs.CL, cs.LG | 2022-03-23 |
An Empirical Study on Learning and Improving the Search Objective for Unsupervised Paraphrasing | Weikai Steven Lu | cs.CL, cs.AI, cs.LG | 2022-03-23 |
Perturbations in the Wild: Leveraging Human-Written Text Perturbations for Realistic Adversarial Attack and Defense | Thai Le, Jooyoung Lee, Kevin Yen, Yifan Hu, Dongwon Lee | cs.LG, cs.CL, cs.CR | 2022-03-19 |
Dependency-based Mixture Language Models | Zhixian Yang, Xiaojun Wan | cs.CL | 2022-03-19 |
RoMe: A Robust Metric for Evaluating Natural Language Generation | Md Rashad Al Hasan Rony, Liubov Kovriguina, Debanjan Chaudhuri, Ricardo Usbeck, Jens Lehmann | cs.CL, cs.AI | 2022-03-17 |
PLANET: Dynamic Content Planning in Autoregressive Transformers for Long-form Text Generation | Zhe Hu, Hou Pong Chan, Jiachen Liu, Xinyan Xiao, Hua Wu, Lifu Huang | cs.CL | 2022-03-17 |
TegTok: Augmenting Text Generation via Task-specific and Open-world Knowledge | Chao-Hong Tan, Jia-Chen Gu, Chongyang Tao, Zhen-Hua Ling, Can Xu, Huang Hu, Xiubo Geng, Daxin Jiang | cs.CL, cs.AI | 2022-03-16 |
Graph Pre-training for AMR Parsing and Generation | Xuefeng Bai, Yulong Chen, Yue Zhang | cs.CL | 2022-03-15 |
Pruned Graph Neural Network for Short Story Ordering | Melika Golestani, Zeinab Borhanifard, Farnaz Tahmasebian, Heshaam Faili | cs.CL | 2022-03-13 |
IndicNLG Suite: Multilingual Datasets for Diverse NLG Tasks in Indic Languages | Aman Kumar, Himani Shrotriya, Prachi Sahu, Raj Dabre, Ratish Puduppully, Anoop Kunchukuttan, Amogh Mishra, Mitesh M. Khapra, Pratyush Kumar | cs.CL, cs.AI | 2022-03-10 |
Recent Advances in Neural Text Generation: A Task-Agnostic Survey | Chen Tang, Frank Guerin, Yucheng Li, Chenghua Lin | cs.CL, cs.AI | 2022-03-06 |
Deep Latent-Variable Models for Text Generation | Xiaoyu Shen | cs.CL | 2022-03-03 |