Lu Wang, CSE, University of Michigan

ExpertLongBench: Benchmarking Language Models on Expert-Level Long-Form Generation Tasks with Structured Checklists

Jie Ruan, Inderjeet Nair, Shuyang Cao, Amy Liu, Sheza Munir, Micah Pollens-Dempsey, Tiffany Chiang, Lucy Kates, Nicholas David, Sihan Chen, Ruxin Yang, Yuqian Yang, Jasmine Gump, Tessa Bialek, Vivek Sankaran, Margo Schlanger, and Lu Wang

Preprint Paper arXiv:2506.01241, 2025.

VeriFact: Enhancing Long-Form Factuality Evaluation with Refined Fact Extraction and Reference Facts

Xin Liu, Lechen Zhang, Sheza Munir, Yiyang Gu, and Lu Wang

Preprint Paper arXiv:2505.09701, 2025.

Process Reward Models That Think

Muhammad Khalifa, Rishabh Agarwal, Lajanugen Logeswaran, Jaekyeom Kim, Hao Peng, Moontae Lee, Honglak Lee, and Lu Wang

Preprint Paper arXiv:2504.16828, 2025.

Logit Arithmetic Elicits Long Reasoning Capabilities Without Training

Yunxiang Zhang, Muhammad Khalifa, Lechen Zhang, Xin Liu, Ayoung Lee, Xinliang Frederick Zhang, Farima Fatahi Bayat, and Lu Wang

Preprint Paper arXiv:2507.12759, 2025.

MLRC-Bench: Can Language Agents Solve Machine Learning Research Challenges?

Yunxiang Zhang, Muhammad Khalifa, Shitanshu Bhushan, Grant D Murphy, Lajanugen Logeswaran, Jaekyeom Kim, Moontae Lee, Honglak Lee, and Lu Wang

Preprint Paper arXiv:2504.09702, 2025.

CLASH: Evaluating Language Models on Judging High-Stakes Dilemmas from Multiple Perspectives

Ayoung Lee, Ryan Sungmo Kwon, Peter Railton, and Lu Wang

Preprint Paper arXiv:2504.10823, 2025.

Do Language Models Think Consistently? A Study of Value Preferences Across Varying Response Lengths

Inderjeet Nair and Lu Wang

Preprint Paper arXiv:2506.02481, 2025.

PRIME: Large Language Model Personalization with Cognitive Memory and Thought Processes

Xinliang Frederick Zhang, Nick Beauchamp, and Lu Wang

Preprint Paper arXiv:2507.04607, 2025.

Answer Convergence as a Signal for Early Stopping in Reasoning

Xin Liu and Lu Wang

Preprint Paper arXiv:2506.02536, 2025.

Evaluation Framework for AI Systems in "the Wild"

Sarah Jabbour, Trenton Chang, Anindya Das Antar, Joseph Peper, Insu Jang, Jiachen Liu, Jae-Won Chung, Shiqi He, Michael Wellman, Bryan Goodman, Elizabeth Bondi-Kelly, Kevin Samy, Rada Mihalcea, Mosharaf Chowhury, David Jurgens, and Lu Wang

Preprint Paper arXiv:2504.16778, 2025.

Structured Moral Reasoning in Language Models: A Value-Grounded Evaluation Framework

Mohna Chakraborty, Lu Wang, and David Jurgens

Preprint Paper arXiv:2506.14948, 2025.

If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs

Muhammad Khalifa, Yi-Chern Tan, Arash Ahmadian, Tom Hosking, Honglak Lee, Lu Wang, Ahmet Üstün, Tom Sherborne, and Matthias Gallé

Preprint Paper arXiv:2412.04144, 2025.

Evaluating the Retrieval Robustness of Large Language Models

Shuyang Cao, Karthik Radhakrishnan, David Rosenberg, Steven Lu, Pengxiang Cheng, Lu Wang, Shiyue Zhang

Preprint Paper arXiv:2505.21870, 2025.

Unstructured Evidence Attribution for Long Context Query Focused Summarization

Dustin Wright, Zain Muhammad Mujahid, Lu Wang, Isabelle Augenstein, and David Jurgens

Preprint Paper arXiv:2502.14409, 2025.

FactBench: A Dynamic Benchmark for In-the-Wild Language Model Factuality Evaluation

Farima Fatahi Bayat, Lechen Zhang, Sheza Munir, and Lu Wang

Conference Paper Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL), 2025.

On Many-Shot In-Context Learning for Long-Context Evaluation

Kaijian Zou, Muhammad Khalifa, and Lu Wang

Conference Paper Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL), 2025.

Efficient Ensemble for Fine-tuning Language Models on Multiple Datasets

Dongyue Li, Ziniu Zhang, Lu Wang, and Hongyang Zhang

Conference Paper Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL), 2025.

MDBench: A Synthetic Multi-Document Reasoning Benchmark Generated with Knowledge Guidance

Joseph J Peper, Wenzhao Qiu, Ali Payani, and Lu Wang

Conference Paper Findings of the Annual Meeting of the Association for Computational Linguistics (Findings of ACL), 2025.

Evaluating Design Choices in Verifiable Generation with Open-source Models

Shuyang Cao and Lu Wang

Workshop Paper NAACL Workshop on Trustworthy NLP (TrustNLP), 2025.

Enhancing Language Model Factuality via Activation-Based Confidence Calibration and Guided Decoding

Xin Liu, Farima Fatahi Bayat, and Lu Wang

Conference Paper Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.

Closing the Loop: Learning to Generate Writing Feedback via Language Model Simulated Student Revisions

Inderjeet Nair, Jiaye Tan, Xiaotian Su, Anne Gere, Xu Wang, and Lu Wang

Conference Paper Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.

Narrative-of-Thought: Improving Temporal Reasoning of Large Language Models via Recounted Narratives

Xinliang Frederick Zhang, Nick Beauchamp, and Lu Wang

Conference Paper Findings of the Conference on Empirical Methods in Natural Language Processing (Findings of EMNLP), 2024.

Shoes-ACOSI: A Dataset for Aspect-Based Sentiment Analysis with Implicit Opinion Extraction

Joseph Peper, Wenzhao Qiu, Ryan Bruggeman, Yi Han, Estefania Chehade, and Lu Wang

Conference Paper Findings of the Conference on Empirical Methods in Natural Language Processing (Findings of EMNLP), 2024.

Scalable Fine-tuning from Multiple Data Sources: A First-Order Approximation Approach

Dongyue Li, Ziniu Zhang, Lu Wang, and Hongyang Zhang

Conference Paper Findings of the Conference on Empirical Methods in Natural Language Processing (Findings of EMNLP), 2024.

Source-Aware Training Enables Knowledge Attribution in Language Models

Muhammad Khalifa, David Wadden, Emma Strubell, Honglak Lee, Lu Wang, Iz Beltagy, and Hao Peng

Conference Paper Conference on Language Modeling (COLM), 2024.

MIDGARD: Self-Consistency Using Minimum Description Length for Structured Commonsense Reasoning

Inderjeet Nair and Lu Wang

Conference Paper Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL), 2024. Area Chair Award.

Small Language Models Need Strong Verifiers to Self-Correct Reasoning

Yunxiang Zhang, Muhammad Khalifa, Lajanugen Logeswaran, Jaekyeom Kim, Moontae Lee, Honglak Lee, and Lu Wang

Conference Paper Findings of the Annual Meeting of the Association for Computational Linguistics (Findings of ACL), 2024.

Enhanced Language Model Truthfulness with Learnable Intervention and Uncertainty Expression

Farima Fatahi Bayat, Xin Liu, H. V. Jagadish, and Lu Wang

Conference Paper Findings of the Annual Meeting of the Association for Computational Linguistics (Findings of ACL), 2024.

Verifiable Generation with Subsentence-Level Fine-Grained Citations

Shuyang Cao and Lu Wang

Conference Paper Findings of the Annual Meeting of the Association for Computational Linguistics (Findings of ACL), 2024.

LitCab: Lightweight Language Model Calibration over Short- and Long-form Responses

Xin Liu, Muhammad Khalifa, and Lu Wang

Conference Paper Proceedings of the International Conference on Learning Representations (ICLR), 2024.

MOKA: Moral Knowledge Augmentation for Moral Event Extraction

Xinliang Frederick Zhang, Winston Wu, Nick Beauchamp, and Lu Wang

Conference Paper Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2024.

PELMS: Pre-training for Effective Low-Shot Multi-Document Summarization

Joseph J. Peper, Wenzhao Qiu, and Lu Wang

Conference Paper Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2024.

AWESOME: GPU Memory-constrained Long Document Summarization using Memory Mechanism and Global Salient Content

Shuyang Cao and Lu Wang

Conference Paper Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2024.

Analyzing Occupational Distribution Representation in Japanese Language Models

Katsumi Ibaraki, Winston Wu, Lu Wang, and Rada Mihalcea

Conference Paper Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING), 2024.

Merging Generated and Retrieved Knowledge for Open-Domain QA

Yunxiang Zhang, Muhammad Khalifa, Lajanugen Logeswaran, Moontae Lee, Honglak Lee, and Lu Wang

Conference Paper Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.

All Things Considered: Detecting Partisan Events from News Media with Cross-Article Comparison

Yujian Liu, Xinliang Frederick Zhang, Kaijian Zou, Ruihong Huang, Nick Beauchamp, and Lu Wang

Conference Paper Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.

Cross-Cultural Analysis of Human Values, Morals, and Biases in Folk Tales

Winston Wu, Lu Wang, and Rada Mihalcea

Conference Paper Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.

GRACE: Discriminator-Guided Chain-of-Thought Reasoning

Muhammad Khalifa, Lajanugen Logeswaran, Moontae Lee, Honglak Lee, and Lu Wang

Conference Paper Findings of the Conference on Empirical Methods in Natural Language Processing (Findings of EMNLP), 2023.

Crossing the Aisle: Unveiling Partisan and Counter-Partisan Events in News Reporting

Kaijian Zou, Xinliang Frederick Zhang, Winston Wu, Nick Beauchamp, and Lu Wang

Conference Paper Findings of the Conference on Empirical Methods in Natural Language Processing (Findings of EMNLP), 2023.

You Are What You Annotate: Towards Better Models through Annotator Representations

Naihao Deng, Xinliang Frederick Zhang, Siyang Liu, Winston Wu, Lu Wang, and Rada Mihalcea

Conference Paper Findings of the Conference on Empirical Methods in Natural Language Processing (Findings of EMNLP), 2023.

Few-shot Reranking for Multi-hop QA via Language Model Prompting

Muhammad Khalifa, Lajanugen Logeswaran, Moontae Lee, Honglak Lee, and Lu Wang

Conference Paper Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL), 2023.

BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases

Xin Liu, Muhammad Khalifa, and Lu Wang

Conference Paper Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL), short paper, 2023.

General then Personal: Decoupling and Pre-training for Personalized Headline Generation

Yun-Zhu Song, Yi-Syuan Chen, Lu Wang, and Hong-Han Shuai

Journal Paper Transactions of the Association for Computational Linguistics (TACL), 2023.

Exploring Demonstration Ensembling for In-context Learning

Muhammad Khalifa, Lajanugen Logeswaran, Moontae Lee, Honglak Lee, and Lu Wang

Workshop Paper ICLR Workshop on Mathematical and Empirical Understanding of Foundation Models (ME-FoMo), 2023.

Word Category Arcs in Literature Across Languages and Genres

Winston Wu, Lu Wang, and Rada Mihalcea

Workshop Paper ACL Workshop of Narrative Understanding (WNU), 2023.

ReadingQuizMaker: A Human-NLP Collaborative System to Support Instructors Design High Quality Reading Quiz Questions

Xinyi Lu, Simin Fan, Jessica Houghton, Lu Wang, and Xu Wang

Conference Paper Proceedings of the ACM CHI Conference on Human Factors in Computing Systems (CHI), 2023. Best Paper Honorable Mention Award.

Generative Entity-to-Entity Stance Detection with Knowledge Graph Augmentation

Xinliang Frederick Zhang, Nick Beauchamp, and Lu Wang

Conference Paper Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022.

Late Fusion with Triplet Margin Objective for Multimodal Ideology Prediction and Analysis

Changyuan Qiu, Winston Wu, Xinliang Frederick Zhang, and Lu Wang

Conference Paper Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022. [Equal contribution by the first two authors]

Sentence-level Media Bias Analysis Informed by Discourse Structures

Yuanyuan Lei, Ruihong Huang, Lu Wang, and Nick Beauchamp

Conference Paper Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022.

Time-aware Prompting for Text Generation

Shuyang Cao and Lu Wang

Conference Paper Findings of the Conference on Empirical Methods in Natural Language Processing (Findings of EMNLP), 2022.

Generative Aspect-Based Sentiment Analysis with Contrastive Learning and Expressive Structure

Joseph J. Peper and Lu Wang

Conference Paper Findings of the Conference on Empirical Methods in Natural Language Processing (Findings of EMNLP), 2022.

HIBRIDS: Attention with Hierarchical Biases for Structure-aware Long Document Summarization

Shuyang Cao and Lu Wang

Conference Paper Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL), 2022.

Efficient Argument Structure Extraction with Transfer Learning and Active Learning

Xinyu Hua and Lu Wang

Conference Paper Findings of the Association for Computational Linguistics (Findings of ACL), 2022.

Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational Needs

Xu Wang, Simin Fan, Jessica Houghton, and Lu Wang

Conference Paper Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2022.

POLITICS: Pretraining with Same-story Article Comparison for Ideology Prediction and Stance Detection

Yujian Liu, Xinliang Frederick Zhang, David Wegsman, Nick Beauchamp, and Lu Wang

Conference Paper Findings of the Conference of the North American Chapter of the Association for Computational Linguistics (Findings of NAACL), 2022. [Equal contribution by the first two authors]

CLIFF: Contrastive Learning for Improving Faithfulness and Factuality in Abstractive Summarization

Shuyang Cao and Lu Wang

Conference Paper Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021.

Controllable Summarization with Constrained Markov Decision Process

Hou Pong Chan, Lu Wang, and Irwin King

Journal Paper Transactions of the Association for Computational Linguistics (TACL), 2021.

Controllable Open-ended Question Generation with A New Question Type Ontology

Shuyang Cao and Lu Wang

Conference Paper Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL), 2021.

DYPLOC: Dynamic Planning of Content Using Mixed Language Models for Text Generation

Xinyu Hua, Ashwin Sreevatsa, and Lu Wang

Conference Paper Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL), 2021.

Efficient Attentions for Long Document Summarization

Luyang Huang, Shuyang Cao, Nikolaus Parulian, Heng Ji, and Lu Wang

Conference Paper Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2021.

Inference Time Style Control for Summarization

Shuyang Cao and Lu Wang

Conference Paper Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), short paper, 2021.

Attention Head Masking for Inference Time Content Selection in Abstractive Summarization

Shuyang Cao and Lu Wang

Conference Paper Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), short paper, 2021.

Learning to Segment Actions from Visual and Language Instructions via Differentiable Weak Sequence Alignment

Yuhan Shen, Lu Wang, and Ehsan Elhamifar

Conference Paper Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR), oral presentation, 2021.

PAIR: Planning and Iterative Refinement in Pre-trained Transformers for Long Text Generation

Xinyu Hua and Lu Wang

Conference Paper Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020.

Modeling Content Importance for Summarization with Pre-trained Language Models

Liqiang Xiao, Lu Wang, Hao He, and Yaohui Jin

Conference Paper Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), short paper, 2020.

Knowledge Graph-Augmented Abstractive Summarization with Semantic-Driven Cloze Reward

Luyang Huang, Lingfei Wu, and Lu Wang

Conference Paper Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL), 2020.

Discourse as a Function of Event: Profiling Discourse Structure in News Articles around the Main Event

Prafulla Kumar Choubey, Aaron Lee, Ruihong Huang, and Lu Wang

Conference Paper Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL), 2020.

Dynamic Online Conversation Recommendation

Xingshan Zeng, Jing Li, Lu Wang, Zhiming Mao, and Kam-Fai Wong

Conference Paper Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL), 2020.

XREF: Entity Linking for Chinese News Comments with Supplementary Article Reference

Xinyu Hua, Lei Li, Lifeng Hua, and Lu Wang

Conference Paper Proceedings of Conference on Automated Knowledge Base Construction (AKBC), 2020.

Copy or Rewrite: Hybrid Summarization with Hierarchical Reinforcement Learning

Liqiang Xiao, Lu Wang, Hao He, and Yaohui Jin

Conference Paper Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2020.

DebateVis: Visualizing Political Debates for Non-Expert Users

Laura South, Michail Schwab, Nick Beauchamp, Lu Wang, John Wihbey, and Michelle A. Borkin

Conference Paper Proceedings of IEEE Visualization Conference (VIS), short paper, 2020.

An Entity-Driven Framework for Abstractive Summarization

Eva Sharma, Luyang Huang, Zhe Hu, and Lu Wang

Conference Paper Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019. [Equal contribution by the first three authors]

Sentence-Level Content Planning and Style Specification for Neural Text Generation

Xinyu Hua and Lu Wang

Conference Paper Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019.

In Plain Sight: Media Bias through the Lens of Factual Reporting

Lisa Fan, Marshall White, Eva Sharma, Ruisi Su, Prafulla Kumar Choubey, Ruihong Huang, and Lu Wang

Conference Paper Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), short paper, 2019. [Equal contribution by the first two authors]

Neural Conversation Recommendation with Online Interaction Modeling

Xingshan Zeng, Jing Li, Lu Wang, and Kam-Fai Wong

Conference Paper Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019.

Argument Generation with Retrieval, Planning, and Realization

Xinyu Hua, Zhe Hu, and Lu Wang

Conference Paper Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), 2019.

BIGPATENT: A Large-Scale Dataset for Abstractive and Coherent Summarization

Eva Sharma, Chen Li, and Lu Wang

Conference Paper Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), short paper, 2019.

Jointly Learning Semantic Parser and Natural Language Generator via Dual Information Maximization

Hai Ye, Wenjie Li, Lu Wang

Conference Paper Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), 2019.

Neural Keyphrase Generation via Reinforcement Learning with Adaptive Rewards

Hou Pong Chan, Wang Chen, Lu Wang, and Irwin King

Conference Paper Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), 2019.

Joint Effects of Context and User History for Predicting Online Conversation Re-entries

Xingshan Zeng, Jing Li, Lu Wang, and Kam-Fai Wong

Conference Paper Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), 2019.

Argument Mining for Understanding Peer Reviews

Xinyu Hua, Mitko Nikolov, Nikhil Badugu, and Lu Wang

Conference Paper Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), short paper, 2019.

Robust Neural Abstractive Summarization Systems and Evaluation against Adversarial Information

Lisa Fan, Dong Yu, and Lu Wang

Workshop Paper NeurIPS Workshop on Interpretability and Robustness in Audio, Speech, and Language (IRASL), 2018.

Semi-Supervised Learning for Neural Keyphrase Generation

Hai Ye and Lu Wang

Conference Paper Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2018.

Neural Argument Generation Augmented with Externally Retrieved Evidence

Xinyu Hua and Lu Wang

Conference Paper Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL), 2018.

Microblog Conversation Recommendation via Joint Modeling of Topics and Discourse

Xingshan Zeng, Jing Li, Lu Wang, Nick Beauchamp, Sarah Shugars, and Kam-Fai Wong

Conference Paper Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2018.

Joint Modeling of Content and Discourse Relations in Dialogues

Kechen Qin, Lu Wang, and Joseph Kim

Conference Paper Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL), 2017.

Understanding and Detecting Supporting Arguments of Diverse Types

Xinyu Hua and Lu Wang

Conference Paper Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL), short paper, 2017. ACL Outstanding Paper Award.

Winning on the Merits: The Joint Effects of Content and Style on Debate Outcomes

Lu Wang, Nick Beauchamp, Sarah Shugars, and Kechen Qin

Journal Paper Transactions of the Association for Computational Linguistics (TACL), 2017.

A Pilot Study of Domain Adaptation Effect for Neural Abstractive Summarization

Xinyu Hua and Lu Wang

Workshop Paper Proceedings of the EMNLP Workshop on New Frontiers in Summarization, 2017.

Weakly-Guided User Stance Prediction via Joint Modeling of Content and Social Interaction

Rui Dong, Yizhou Sun, Lu Wang, Yupeng Gu, and Yuan Zhong

Conference Paper Proceedings of International Conference on Information and Knowledge Management (CIKM), 2017.

Neural Network-Based Abstract Generation for Opinions and Arguments

Lu Wang and Wang Ling

Conference Paper Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2016.

Summarization and Sentiment Analysis for Understanding Socially-Generated Content

Lu Wang

Thesis Ph.D. Thesis, Cornell University, February 2016.

Socially-Informed Timeline Generation for Complex Events

Lu Wang, Claire Cardie, and Galen Marchetti

Conference Paper Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2015.

Query-Focused Opinion Summarization for User-Generated Content

Lu Wang, Hema Raghavan, Claire Cardie, and Vittorio Castelli

Conference Paper Proceedings of the 25th International Conference on Computational Linguistics (COLING), 2014.

A Piece of My Mind: A Sentiment Analysis Approach for Online Dispute Detection

Lu Wang and Claire Cardie

Conference Paper Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL), short paper, 2014.

Improving Agreement and Disagreement Identification in Online Discussions with A Socially-Tuned Sentiment Lexicon

Lu Wang and Claire Cardie

Workshop Paper Proceedings of the ACL Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA), 2014.

Leveraging Semantic Web Search and Browse Sessions for Multi-Turn Spoken Dialog Systems

Lu Wang, Larry Heck, and Dilek Hakkani-Tur

Conference Paper Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014. One of the Two Award Papers of Spoken Language Processing Student Travel Award. [link]

CornPittMich Sentiment Slot-Filling System at TAC 2014

Xilun Chen, Arzoo Katiyar, Xiaoan Yan, Lu Wang, Carmen Banea, Yoonjung Choi, Lingjia Deng, Claire Cardie, Rada Mihalcea, and Janyce Wiebe

Non-refereed Publication Proceedings of the TAC-KBP 2014 Workshop, 2014. Won Second Place in Sentiment Slot-Filling Track.

Cornell Expert Aided Query-focused Summarization (CEAQS): A Summarization Framework to PoliInformatics

Lu Wang, Parvaz Mahdabi, Joonsuk Park, Dinesh Puranam, Bishan Yang, and Claire Cardie

Non-refereed Publication NLP Unshared Task in PoliInformatics 2014.

A Sentence Compression Based Framework to Query-Focused Multi-Document Summarization

Lu Wang, Hema Raghavan, Vittorio Castelli, Radu Florian, and Claire Cardie

Conference Paper Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL), 2013.

Domain-Independent Abstract Generation for Focused Meeting Summarization

Lu Wang and Claire Cardie

Conference Paper Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL), 2013.

Unsupervised Topic Modeling Approaches to Decision Summarization in Spoken Meetings

Lu Wang and Claire Cardie

Conference Paper Proceedings of the Special Interest Group on Discourse and Dialogue (SIGDIAL), 2012. Best Paper Nomination.

Focused Meeting Summarization via Unsupervised Relation Extraction

Lu Wang and Claire Cardie

Conference Paper Proceedings of the Special Interest Group on Discourse and Dialogue (SIGDIAL), 2012.

Summarizing Decisions in Spoken Meetings

Lu Wang and Claire Cardie

Workshop Paper Proceedings of the ACL Workshop on Automatic Summarization for Different Genres, Media, and Languages, 2011.

Lu Wang

University of Michigan

Publications

Filter by type:

ExpertLongBench: Benchmarking Language Models on Expert-Level Long-Form Generation Tasks with Structured Checklists

VeriFact: Enhancing Long-Form Factuality Evaluation with Refined Fact Extraction and Reference Facts

Process Reward Models That Think

Logit Arithmetic Elicits Long Reasoning Capabilities Without Training

MLRC-Bench: Can Language Agents Solve Machine Learning Research Challenges?

CLASH: Evaluating Language Models on Judging High-Stakes Dilemmas from Multiple Perspectives

Do Language Models Think Consistently? A Study of Value Preferences Across Varying Response Lengths

PRIME: Large Language Model Personalization with Cognitive Memory and Thought Processes

Answer Convergence as a Signal for Early Stopping in Reasoning

Evaluation Framework for AI Systems in "the Wild"

Structured Moral Reasoning in Language Models: A Value-Grounded Evaluation Framework

If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs

Evaluating the Retrieval Robustness of Large Language Models

Unstructured Evidence Attribution for Long Context Query Focused Summarization

FactBench: A Dynamic Benchmark for In-the-Wild Language Model Factuality Evaluation

On Many-Shot In-Context Learning for Long-Context Evaluation

Efficient Ensemble for Fine-tuning Language Models on Multiple Datasets

MDBench: A Synthetic Multi-Document Reasoning Benchmark Generated with Knowledge Guidance

Evaluating Design Choices in Verifiable Generation with Open-source Models

Enhancing Language Model Factuality via Activation-Based Confidence Calibration and Guided Decoding

Closing the Loop: Learning to Generate Writing Feedback via Language Model Simulated Student Revisions

Narrative-of-Thought: Improving Temporal Reasoning of Large Language Models via Recounted Narratives

Shoes-ACOSI: A Dataset for Aspect-Based Sentiment Analysis with Implicit Opinion Extraction

Scalable Fine-tuning from Multiple Data Sources: A First-Order Approximation Approach

Source-Aware Training Enables Knowledge Attribution in Language Models

MIDGARD: Self-Consistency Using Minimum Description Length for Structured Commonsense Reasoning

Small Language Models Need Strong Verifiers to Self-Correct Reasoning

Enhanced Language Model Truthfulness with Learnable Intervention and Uncertainty Expression

Verifiable Generation with Subsentence-Level Fine-Grained Citations

LitCab: Lightweight Language Model Calibration over Short- and Long-form Responses

MOKA: Moral Knowledge Augmentation for Moral Event Extraction

PELMS: Pre-training for Effective Low-Shot Multi-Document Summarization

AWESOME: GPU Memory-constrained Long Document Summarization using Memory Mechanism and Global Salient Content

Analyzing Occupational Distribution Representation in Japanese Language Models

Merging Generated and Retrieved Knowledge for Open-Domain QA

All Things Considered: Detecting Partisan Events from News Media with Cross-Article Comparison

Cross-Cultural Analysis of Human Values, Morals, and Biases in Folk Tales

GRACE: Discriminator-Guided Chain-of-Thought Reasoning

Crossing the Aisle: Unveiling Partisan and Counter-Partisan Events in News Reporting

You Are What You Annotate: Towards Better Models through Annotator Representations

Few-shot Reranking for Multi-hop QA via Language Model Prompting

BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases

General then Personal: Decoupling and Pre-training for Personalized Headline Generation

Exploring Demonstration Ensembling for In-context Learning

Word Category Arcs in Literature Across Languages and Genres

ReadingQuizMaker: A Human-NLP Collaborative System to Support Instructors Design High Quality Reading Quiz Questions

Generative Entity-to-Entity Stance Detection with Knowledge Graph Augmentation

Late Fusion with Triplet Margin Objective for Multimodal Ideology Prediction and Analysis

Sentence-level Media Bias Analysis Informed by Discourse Structures

Time-aware Prompting for Text Generation

Generative Aspect-Based Sentiment Analysis with Contrastive Learning and Expressive Structure

HIBRIDS: Attention with Hierarchical Biases for Structure-aware Long Document Summarization

Efficient Argument Structure Extraction with Transfer Learning and Active Learning

Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational Needs

POLITICS: Pretraining with Same-story Article Comparison for Ideology Prediction and Stance Detection

CLIFF: Contrastive Learning for Improving Faithfulness and Factuality in Abstractive Summarization

Controllable Summarization with Constrained Markov Decision Process

Controllable Open-ended Question Generation with A New Question Type Ontology

DYPLOC: Dynamic Planning of Content Using Mixed Language Models for Text Generation

Efficient Attentions for Long Document Summarization

Inference Time Style Control for Summarization

Attention Head Masking for Inference Time Content Selection in Abstractive Summarization

Learning to Segment Actions from Visual and Language Instructions via Differentiable Weak Sequence Alignment

PAIR: Planning and Iterative Refinement in Pre-trained Transformers for Long Text Generation

Modeling Content Importance for Summarization with Pre-trained Language Models

Knowledge Graph-Augmented Abstractive Summarization with Semantic-Driven Cloze Reward

Discourse as a Function of Event: Profiling Discourse Structure in News Articles around the Main Event

Dynamic Online Conversation Recommendation

XREF: Entity Linking for Chinese News Comments with Supplementary Article Reference

Copy or Rewrite: Hybrid Summarization with Hierarchical Reinforcement Learning

DebateVis: Visualizing Political Debates for Non-Expert Users

An Entity-Driven Framework for Abstractive Summarization

Sentence-Level Content Planning and Style Specification for Neural Text Generation

In Plain Sight: Media Bias through the Lens of Factual Reporting

Neural Conversation Recommendation with Online Interaction Modeling

Argument Generation with Retrieval, Planning, and Realization