Lu Wang, CSE, University of Michigan

Value-Conflict Diagnostics Reveal Widespread Alignment Faking in Language Models

Inderjeet Nair, Jie Ruan, and Lu Wang

Preprint Paper arXiv:2604.20995, 2026.

Think Through Uncertainty: Improving Long-Form Generation Factuality via Reasoning Calibration

Xin Liu and Lu Wang

Preprint Paper arXiv:2604.12046, 2026.

Gaming the Judge: Unfaithful Chain-of-Thought Can Undermine Agent Evaluation

Muhammad Khalifa, Lajanugen Logeswaran, Jaekyeom Kim, Sungryull Sohn, Yunxiang Zhang, Moontae Lee, Hao Peng, Lu Wang*, and Honglak Lee*

Preprint Paper arXiv:2601.14691, 2026.

Countdown-Code: A Testbed for Studying The Emergence and Generalization of Reward Hacking in RLVR

Muhammad Khalifa, Zohaib Khan, Omer Tafveez, Hao Peng, and Lu Wang

Preprint Paper arXiv:2603.07084, 2026.

TSUBASA: Improving Long-Horizon Personalization via Evolving Memory and Self-Learning with Context Distillation

Xinliang Frederick Zhang and Lu Wang

Preprint Paper arXiv:2604.07894, 2026.

Do LLMs Really Need 10+ Thoughts for "Find the Time 1000 Days Later"? Towards Structural Understanding of LLM Overthinking

Xinliang Frederick Zhang, Anhad Mohananey, Alexandra Chronopoulou, Pinelopi Papalampidi, Somit Gupta, Tsendsuren Munkhdalai, Lu Wang, and Shyam Upadhyay

Conference Paper Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL), 2026.

CASPER in the Machine: Insights into Character Variety in LLM-Generated Stories

Anneliese Brei, Abhisheik Sharma, Nicholas Sanaie, Lu Wang, and Snigdha Chaturvedi

Conference Paper Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL), 2026.

Skill-Aware Data Selection and Fine-Tuning for Data-Efficient Reasoning Distillation

Lechen Zhang, Yunxiang Zhang, Wei Hu, and Lu Wang

Conference Paper Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL), short paper, 2026.

Logit Arithmetic Elicits Long Reasoning Capabilities Without Training

Yunxiang Zhang, Muhammad Khalifa, Lechen Zhang, Xin Liu, Ayoung Lee, Xinliang Frederick Zhang, Farima Fatahi Bayat, and Lu Wang

Conference Paper Findings of the Association for Computational Linguistics (Findings of ACL), 2026.

Do Language Models Think Consistently? A Study of Value Preferences Across Varying Response Lengths

Inderjeet Nair and Lu Wang

Conference Paper Findings of the Association for Computational Linguistics (Findings of ACL), 2026.

Process Reward Models That Think

Muhammad Khalifa, Rishabh Agarwal, Lajanugen Logeswaran, Jaekyeom Kim, Hao Peng, Moontae Lee, Honglak Lee, and Lu Wang

Journal Paper Transactions on Machine Learning Research (TMLR), 2026.

ExpertLongBench: Benchmarking Language Models on Expert-Level Long-Form Generation Tasks with Structured Checklists

Jie Ruan, Inderjeet Nair, Shuyang Cao, Amy Liu, Sheza Munir, Micah Pollens-Dempsey, Tiffany Chiang, Lucy Kates, Nicholas David, Sihan Chen, Ruxin Yang, Yuqian Yang, Jasmine Gump, Tessa Bialek, Vivek Sankaran, Margo Schlanger, and Lu Wang

Conference Paper Proceedings of the International Conference on Learning Representations (ICLR), 2026.

CLASH: Evaluating Language Models on Judging High-Stakes Dilemmas from Multiple Perspectives

Ayoung Lee, Ryan Sungmo Kwon, Peter Railton, and Lu Wang

Conference Paper Proceedings of the International Conference on Learning Representations (ICLR), 2026.

LLMs as Rules Oracles: Exploring Real-World Multimodal Reasoning in Tabletop Strategy Game Environments

Joseph J. Peper, Sai Krishna Gandra, Yunxiang Zhang, Vaibhav Chennareddy, Shloki Jha, Ali Payani, and Lu Wang

Conference Paper Proceedings of the International Conference on Learning Representations (ICLR), 2026.

LiveOIBench: Can Large Language Models Outperform Human Contestants in Informatics Olympiads?

Kaijian Zou, Aaron Xiong, Yunxiang Zhang, Frederick Zhang, Yueqi Ren, Jirong Yang, Ayoung Lee, Shitanshu Bhushan, and Lu Wang

Preprint Paper arXiv:2510.09595, 2025.

Evaluation Framework for AI Systems in "the Wild"

Sarah Jabbour*, Trenton Chang*, Anindya Das Antar*, Joseph Peper, Insu Jang, Jiachen Liu, Jae-Won Chung, Shiqi He, Michael Wellman, Bryan Goodman, Elizabeth Bondi-Kelly, Kevin Samy, Rada Mihalcea, Mosharaf Chowhury, David Jurgens*, and Lu Wang*

Preprint Paper arXiv:2504.16778, 2025.

If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs

Muhammad Khalifa, Yi-Chern Tan, Arash Ahmadian, Tom Hosking, Honglak Lee, Lu Wang, Ahmet Üstün, Tom Sherborne, and Matthias Gallé

Preprint Paper arXiv:2412.04144, 2025.

Evaluating the Retrieval Robustness of Large Language Models

Shuyang Cao, Karthik Radhakrishnan, David Rosenberg, Steven Lu, Pengxiang Cheng, Lu Wang, Shiyue Zhang

Preprint Paper arXiv:2505.21870, 2025.

MLRC-Bench: Can Language Agents Solve Machine Learning Research Challenges?

Yunxiang Zhang, Muhammad Khalifa, Shitanshu Bhushan, Grant D Murphy, Lajanugen Logeswaran, Jaekyeom Kim, Moontae Lee, Honglak Lee, and Lu Wang

Conference Paper Proceedings of the Annual Conference on Neural Information Processing Systems (NeurIPS), Datasets & Benchmarks Track, 2025.