written on 2025-12-31
| title | authors | categories | displaydate |
|---|---|---|---|
| Web World Models | Jichen Feng, Yifan Zhang, Chenggong Zhang, Yifu Lu, Shilong Liu, Mengdi Wang | cs.AI, cs.CL, cs.CV | 2025-12-29 |
| Divergent-Convergent Thinking in Large Language Models for Creative Problem Generation | Manh Hung Nguyen, Adish Singla | cs.AI | 2025-12-29 |
| Anka: A Domain-Specific Language for Reliable LLM Code Generation | Saif Khalfan Saif Al Mazrouei | cs.CL, cs.LG, cs.PL, cs.SE | 2025-12-29 |
| Scoring, Reasoning, and Selecting the Best! Ensembling Large Language Models via a Peer-Review Process | Zhijun Chen, Zeyu Ji, Qianren Mao, Junhang Cheng, Bangjie Qin, Hao Wu, Zhuoran Li, Jingzheng Li, Kai Sun, Zizhe Wang, Yikun Ban, Zhu Sun, Xiangyang Ji, Hailong Sun | cs.CL, cs.AI | 2025-12-29 |
| Not too long do read: Evaluating LLM-generated extreme scientific summaries | Zhuoqi Lyu, Qing Ke | cs.CL, cs.AI | 2025-12-29 |
| BeHGAN: Bengali Handwritten Word Generation from Plain Text Using Generative Adversarial Networks | Md. Rakibul Islam, Md. Kamrozzaman Bhuiyan, Safwan Muntasir, Arifur Rahman Jawad, Most. Sharmin Sultana Samu | cs.CV, cs.AI | 2025-12-25 |
| Quadrupped-Legged Robot Movement Plan Generation using Large Language Model | Muhtadin, Vincentius Gusti Putu A. B. M., Ahmad Zaini, Mauridhi Hery Purnomo, I Ketut Eddy Purnama, Chastine Fatichah | cs.RO, cs.HC | 2025-12-24 |
| Emotion Diffusion in Real and Simulated Social Graphs: Structural Limits of LLM-Based Social Simulation | Qiqi Qiang | cs.SI | 2025-12-24 |
| NVIDIA Nemotron 3: Efficient and Open Intelligence | NVIDIA, :, Aaron Blakeman, Aaron Grattafiori, Aarti Basant, Abhibha Gupta, Abhinav Khattar, Adi Renduchintala, Aditya Vavre, Akanksha Shukla, Akhiad Bercovich, Aleksander Ficek, Aleksandr Shaposhnikov, Alex Kondratenko, Alexander Bukharin, Alexandre Milesi, Ali Taghibakhshi, Alisa Liu, Amelia Barton, Ameya Sunil Mahabaleshwarkar, Amir Klein, Amit Zuker, Amnon Geifman, Amy Shen, Anahita Bhiwandiwalla, Andrew Tao, Anjulie Agrusa, Ankur Verma, Ann Guan, Anubhav Mandarwal, Arham Mehta, Ashwath Aithal, Ashwin Poojary, Asif Ahamed, Asit Mishra, Asma Kuriparambil Thekkumpate, Ayush Dattagupta, Banghua Zhu, Bardiya Sadeghi, Barnaby Simkin, Ben Lanir, Benedikt Schifferer, Besmira Nushi, Bilal Kartal, Bita Darvish Rouhani, Boris Ginsburg, Brandon Norick, Brandon Soubasis, Branislav Kisacanin, Brian Yu, Bryan Catanzaro, Carlo del Mundo, Chantal Hwang, Charles Wang, Cheng-Ping Hsieh, Chenghao Zhang, Chenhan Yu, Chetan Mungekar, Chintan Patel, Chris Alexiuk, Christopher Parisien, Collin Neale, Cyril Meurillon, Damon Mosk-Aoyama, Dan Su, Dane Corneil, Daniel Afrimi, Daniel Lo, Daniel Rohrer, Daniel Serebrenik, Daria Gitman, Daria Levy, Darko Stosic, David Mosallanezhad, Deepak Narayanan, Dhruv Nathawani, Dima Rekesh, Dina Yared, Divyanshu Kakwani, Dong Ahn, Duncan Riach, Dusan Stosic, Edgar Minasyan, Edward Lin, Eileen Long, Eileen Peters Long, Elad Segal, Elena Lantz, Ellie Evans, Elliott Ning, Eric Chung, Eric Harper, Eric Tramel, Erick Galinkin, Erik Pounds, Evan Briones, Evelina Bakhturina, Evgeny Tsykunov, Faisal Ladhak, Fay Wang, Fei Jia, Felipe Soares, Feng Chen, Ferenc Galko, Frank Sun, Frankie Siino, Gal Hubara Agam, Ganesh Ajjanagadde, Gantavya Bhatt, Gargi Prasad, George Armstrong, Gerald Shen, Gorkem Batmaz, Grigor Nalbandyan, Haifeng Qian, Harsh Sharma, Hayley Ross, Helen Ngo, Herbert Hum, Herman Sahota, Hexin Wang, Himanshu Soni, Hiren Upadhyay, Huizi Mao, Huy C Nguyen, Huy Q Nguyen, Iain Cunningham, Ido Galil, Ido Shahaf, Igor Gitman, Ilya Loshchilov, Itamar Schen, Itay Levy, Ivan Moshkov, Izik Golan, Izzy Putterman, Jan Kautz, Jane Polak Scowcroft, Jared Casper, Jatin Mitra, Jeffrey Glick, Jenny Chen, Jesse Oliver, Jian Zhang, Jiaqi Zeng, Jie Lou, Jimmy Zhang, Jinhang Choi, Jining Huang, Joey Conway, Joey Guman, John Kamalu, Johnny Greco, Jonathan Cohen, Joseph Jennings, Joyjit Daw, Julien Veron Vialard, Junkeun Yi, Jupinder Parmar, Kai Xu, Kan Zhu, Kari Briski, Katherine Cheung, Katherine Luna, Keith Wyss, Keshav Santhanam, Kevin Shih, Kezhi Kong, Khushi Bhardwaj, Kirthi Shankar, Krishna C. Puvvada, Krzysztof Pawelec, Kumar Anik, Lawrence McAfee, Laya Sleiman, Leon Derczynski, Li Ding, Lizzie Wei, Lucas Liebenwein, Luis Vega, Maanu Grover, Maarten Van Segbroeck, Maer Rodrigues de Melo, Mahdi Nazemi, Makesh Narsimhan Sreedhar, Manoj Kilaru, Maor Ashkenazi, Marc Romeijn, Marcin Chochowski, Mark Cai, Markus Kliegl, Maryam Moosaei, Matt Kulka, Matvei Novikov, Mehrzad Samadi, Melissa Corpuz, Mengru Wang, Meredith Price, Michael Andersch, Michael Boone, Michael Evans, Miguel Martinez, Mikail Khona, Mike Chrzanowski, Minseok Lee, Mohammad Dabbah, Mohammad Shoeybi, Mostofa Patwary, Nabin Mulepati, Najeeb Nabwani, Natalie Hereth, Nave Assaf, Negar Habibi, Neta Zmora, Netanel Haber, Nicola Sessions, Nidhi Bhatia, Nikhil Jukar, Nikki Pope, Nikolai Ludwig, Nima Tajbakhsh, Nir Ailon, Nirmal Juluru, Nishant Sharma, Oleksii Hrinchuk, Oleksii Kuchaiev, Olivier Delalleau, Oluwatobi Olabiyi, Omer Ullman Argov, Omri Puny, Oren Tropp, Ouye Xie, Parth Chadha, Pasha Shamis, Paul Gibbons, Pavlo Molchanov, Pawel Morkisz, Peter Dykas, Peter Jin, Pinky Xu, Piotr Januszewski, Pranav Prashant Thombre, Prasoon Varshney, Pritam Gundecha, Przemek Tredak, Qing Miao, Qiyu Wan, Rabeeh Karimi Mahabadi, Rachit Garg, Ran El-Yaniv, Ran Zilberstein, Rasoul Shafipour, Rich Harang, Rick Izzo, Rima Shahbazyan, Rishabh Garg, Ritika Borkar, Ritu Gala, Riyad Islam, Robert Hesse, Roger Waleffe, Rohit Watve, Roi Koren, Ruoxi Zhang, Russell Hewett, Russell J. Hewett, Ryan Prenger, Ryan Timbrook, Sadegh Mahdavi, Sahil Modi, Samuel Kriman, Sangkug Lim, Sanjay Kariyappa, Sanjeev Satheesh, Saori Kaji, Satish Pasumarthi, Saurav Muralidharan, Sean Narentharen, Sean Narenthiran, Seonmyeong Bak, Sergey Kashirsky, Seth Poulos, Shahar Mor, Shanmugam Ramasamy, Shantanu Acharya, Shaona Ghosh, Sharath Turuvekere Sreenivas, Shelby Thomas, Shiqing Fan, Shreya Gopal, Shrimai Prabhumoye, Shubham Pachori, Shubham Toshniwal, Shuoyang Ding, Siddharth Singh, Simeng Sun, Smita Ithape, Somshubra Majumdar, Soumye Singhal, Stas Sergienko, Stefania Alborghetti, Stephen Ge, Sugam Dipak Devare, Sumeet Kumar Barua, Suseella Panguluri, Suyog Gupta, Sweta Priyadarshi, Syeda Nahida Akter, Tan Bui, Teodor-Dumitru Ene, Terry Kong, Thanh Do, Tijmen Blankevoort, Tim Moon, Tom Balough, Tomer Asida, Tomer Bar Natan, Tomer Ronen, Tugrul Konuk, Twinkle Vashishth, Udi Karpas, Ushnish De, Vahid Noorozi, Vahid Noroozi, Venkat Srinivasan, Venmugil Elango, Victor Cui, Vijay Korthikanti, Vinay Rao, Vitaly Kurin, Vitaly Lavrukhin, Vladimir Anisimov, Wanli Jiang, Wasi Uddin Ahmad, Wei Du, Wei Ping, Wenfei Zhou, Will Jennings, William Zhang, Wojciech Prazuch, Xiaowei Ren, Yashaswi Karnati, Yejin Choi, Yev Meyer, Yi-Fu Wu, Yian Zhang, Yigong Qin, Ying Lin, Yonatan Geifman, Yonggan Fu, Yoshi Subara, Yoshi Suhara, Yubo Gao, Zach Moshe, Zhen Dong, Zhongbo Zhu, Zihan Liu, Zijia Chen, Zijie Yan | cs.CL, cs.AI, cs.LG | 2025-12-24 |
| AXIOM: Benchmarking LLM-as-a-Judge for Code via Rule-Based Perturbation and Multisource Quality Calibration | Ruiqi Wang, Xinchen Wang, Cuiyun Gao, Chun Yong Chong, Xin Xia, Qing Liao | cs.SE, cs.AI | 2025-12-23 |
| CodeSimpleQA: Scaling Factuality in Code Large Language Models | Jian Yang, Wei Zhang, Yizhi Li, Shawn Guo, Haowen Wang, Aishan Liu, Ge Zhang, Zili Wang, Zhoujun Li, Xianglong Liu, Weifeng Lv | cs.CL | 2025-12-22 |
| VIGOR+: Iterative Confounder Generation and Validation via LLM-CEVAE Feedback Loop | JiaWei Zhu, ZiHeng Liu | cs.AI, cs.LG | 2025-12-22 |
| Identifying Features Associated with Bias Against 93 Stigmatized Groups in Language Models and Guardrail Model Safety Mitigation | Anna-Maria Gueorguieva, Aylin Caliskan | cs.CL, cs.AI, cs.LG | 2025-12-22 |
| Watch Closely: Mitigating Object Hallucinations in Large Vision-Language Models with Disentangled Decoding | Ruiqi Ma, Yu Yan, Chunhong Zhang, Minghao Yin, XinChao Liu, Zhihong Jin, Zheng Hu | cs.CV, cs.CL | 2025-12-22 |
| MemEvolve: Meta-Evolution of Agent Memory Systems | Guibin Zhang, Haotian Ren, Chong Zhan, Zhenhong Zhou, Junhao Wang, He Zhu, Wangchunshu Zhou, Shuicheng Yan | cs.CL, cs.MA | 2025-12-21 |
| LLM Agents Implement an NLG System from Scratch: Building Interpretable Rule-Based RDF-to-Text Generators | Mateusz Lango, Ondřej Dušek | cs.CL, cs.AI | 2025-12-20 |
| Inflation Attitudes of Large Language Models | Nikoleta Anesti, Edward Hill, Andreas Joseph | cs.CL, econ.EM | 2025-12-16 |
| Embedding-Based Rankings of Educational Resources based on Learning Outcome Alignment: Benchmarking, Expert Validation, and Learner Performance | Mohammadreza Molavi, Mohammad Moein, Mohammadreza Tavakoli, Abdolali Faraji, Stefan T. Mol, Gábor Kismihók | cs.CY, cs.AI | 2025-12-15 |
| MineTheGap: Automatic Mining of Biases in Text-to-Image Models | Noa Cohen, Nurit Spingarn-Eliezer, Inbar Huberman-Spiegelglas, Tomer Michaeli | cs.CV, cs.LG | 2025-12-15 |
| Pre-review to Peer review: Pitfalls of Automating Reviews using Large Language Models | Akhil Pandey Akella, Harish Varma Siravuri, Shaurya Rohatgi | cs.DL, cs.AI, cs.CY | 2025-12-14 |
| HyperEdit: Unlocking Instruction-based Text Editing in LLMs via Hypernetworks | Yiming Zeng, Jinghan Cao, Zexin Li, Wanhao Yu, Zhankai Ye, Dawei Xiang, Ting Hua, Xin Liu, Shangqian Gao, Tingting Yu | cs.CL, cs.LG | 2025-12-14 |
| Beyond the Black Box: Identifiable Interpretation and Control in Generative Models via Causal Minimality | Lingjing Kong, Shaoan Xie, Guangyi Chen, Yuewen Sun, Xiangchen Song, Eric P. Xing, Kun Zhang | cs.LG | 2025-12-11 |
| LLM-Auction: Generative Auction towards LLM-Native Advertising | Chujie Zhao, Qun Hu, Shiping Song, Dagui Chen, Han Zhu, Jian Xu, Bo Zheng | cs.GT, cs.AI, cs.LG | 2025-12-11 |
| Semantic Reconstruction of Adversarial Plagiarism: A Context-Aware Framework for Detecting and Restoring “Tortured Phrases” in Scientific Literature | Agniva Maiti, Prajwal Panth, Suresh Chandra Satapathy | cs.CL | 2025-12-11 |
| INFORM-CT: INtegrating LLMs and VLMs FOR Incidental Findings Management in Abdominal CT | Idan Tankel, Nir Mazor, Rafi Brada, Christina LeBedis, Guy ben-Yosef | cs.LG, cs.AI, cs.CV, eess.IV | 2025-12-10 |
| PARAN: Persona-Augmented Review ANswering system on Food Delivery Review Dataset | Moonsoo Park, Jeongseok Yun, Bohyung Kim | cs.CL, cs.AI | 2025-12-10 |
| Generate-Then-Validate: A Novel Question Generation Approach Using Small Language Models | Yumou Wei, John Stamper, Paulo F. Carvalho | cs.CL, cs.HC | 2025-12-10 |
| Local LLM Ensembles for Zero-shot Portuguese Named Entity Recognition | João Lucas Luz Lima Sarcinelli, Diego Furtado Silva | cs.LG | 2025-12-10 |
| Can LLMs Evaluate What They Cannot Annotate? Revisiting LLM Reliability in Hate Speech Detection | Paloma Piot, David Otero, Patricia Martín-Rodilla, Javier Parapar | cs.CL, cs.AI | 2025-12-10 |
| ImageTalk: Designing a Multimodal AAC Text Generation System Driven by Image Recognition and Natural Language Generation | Boyin Yang, Puming Jiang, Per Ola Kristensson | cs.HC, cs.AI, cs.CV | 2025-12-10 |
| Large Language Models for Education and Research: An Empirical and User Survey-based Analysis | Md Mostafizer Rahman, Ariful Islam Shiplu, Md Faizul Ibne Amin, Yutaka Watanobe, Lu Peng | cs.AI | 2025-12-08 |
| MINES: Explainable Anomaly Detection through Web API Invariant Inference | Wenjie Zhang, Yun Lin, Chun Fung Amos Kwok, Xiwen Teoh, Xiaofei Xie, Frank Liauw, Hongyu Zhang, Jin Song Dong | cs.SE, cs.CR, cs.DB, cs.LG | 2025-12-07 |
| LLM as a Neural Architect: Controlled Generation of Image Captioning Models Under Strict API Contracts | Krunal Jesani, Dmitry Ignatov, Radu Timofte | cs.LG, cs.AI, cs.CL, cs.CV | 2025-12-07 |
| Faithfulness metric fusion: Improving the evaluation of LLM trustworthiness across domains | Ben Malin, Tatiana Kalganova, Nikolaos Boulgouris | cs.CL, cs.AI | 2025-12-05 |
| Decoding the Black Box: Discerning AI Rhetorics About and Through Poetic Prompting | P. D. Edgar, Alia Hall | cs.CL, cs.CY | 2025-12-04 |
| Automatic Attack Discovery for Few-Shot Class-Incremental Learning via Large Language Models | Haidong Kang, Wei Wu, Hanling Wang | cs.LG | 2025-12-03 |
| LLM-Generated Ads: From Personalization Parity to Persuasion Superiority | Elyas Meguellati, Stefano Civelli, Lei Han, Abraham Bernstein, Shazia Sadiq, Gianluca Demartini | cs.CY, cs.CL | 2025-12-03 |
| ASCIIBench: Evaluating Language-Model-Based Understanding of Visually-Oriented Text | Kerry Luo, Michael Fu, Joshua Peguero, Husnain Malik, Anvay Patil, Joyce Lin, Megan Van Overborg, Ryan Sarmiento, Kevin Zhu | cs.LG | 2025-12-02 |
| PEFT-Factory: Unified Parameter-Efficient Fine-Tuning of Autoregressive Large Language Models | Robert Belanec, Ivan Srba, Maria Bielikova | cs.CL | 2025-12-02 |
| DialogGuard: Multi-Agent Psychosocial Safety Evaluation of Sensitive LLM Responses | Han Luo, Guy Laban | cs.AI, cs.HC, cs.MA | 2025-12-01 |
| InstructLR: A Scalable Approach to Create Instruction Dataset for Under-Resourced Languages | Mamadou K. Keita, Sebastien Diarra, Christopher Homan, Seydou Diallo | cs.LG | 2025-12-01 |
| First, do NOHARM: towards clinically safe large language models | David Wu, Fateme Nateghi Haredasht, Saloni Kumar Maharaj, Priyank Jain, Jessica Tran, Matthew Gwiazdon, Arjun Rustagi, Jenelle Jindal, Jacob M. Koshy, Vinay Kadiyala, Anup Agarwal, Bassman Tappuni, Brianna French, Sirus Jesudasen, Christopher V. Cosgriff, Rebanta Chakraborty, Jillian Caldwell, Susan Ziolkowski, David J. Iberri, Robert Diep, Rahul S. Dalal, Kira L. Newman, Kristin Galetta, J. Carl Pallais, Nancy Wei, Kathleen M. Buchheit, David I. Hong, Ernest Y. Lee, Allen Shih, Vartan Pahalyants, Tamara B. Kaplan, Vishnu Ravi, Sarita Khemani, April S. Liang, Daniel Shirvani, Advait Patil, Nicholas Marshall, Kanav Chopra, Joel Koh, Adi Badhwar, Liam G. McCoy, David J. H. Wu, Yingjie Weng, Sumant Ranji, Kevin Schulman, Nigam H. Shah, Jason Hom, Arnold Milstein, Adam Rodman, Jonathan H. Chen, Ethan Goh | cs.CY, cs.AI | 2025-12-01 |
| Evaluating Legal Reasoning Traces with Legal Issue Tree Rubrics | Jinu Lee, Kyoung-Woon On, Simeng Han, Arman Cohan, Julia Hockenmaier | cs.AI, cs.CL | 2025-11-30 |