Aiwei Liu (刘瑷玮)

About Me

Hi! My name is Aiwei Liu (刘瑷玮). I am currently a researcher at WeChat AI, Tencent, working on Large Language Foundation Model.

I received my Ph.D. degree from the School of Software in Tsinghua University in June 2025, where I was advised by Prof. Lijie Wen. Before that, I received my B.Eng. degree from the Software Institue in Nanjing University in 2020.

Previously, I was fortunate to serve as a Visiting Scholar at the UIC BDSC Lab, working under the supervision of Prof. Philip S. Yu (ACM Fellow, IEEE Fellow).

I was also a research intern at Apple's AIML Group, where I worked under the supervision of Dr. Meng Cao.

Additionally, I was serving as a Visiting Scholar at CUHK MISC Lab, working under the supervision of Prof. Irwin King (ACM Fellow, IEEE Fellow).

Open to Collaboration: If you're interested in research collaboration on LLM watermarking, LLM alignment, LLM pretraining or other LLM topics, please feel free to contact me.

Education

Tsinghua University, China

Ph.D. in Software Engineering

Sept. 2020 – June 2025

Nanjing University, China

B.E. in Software Engineering

Sept. 2016 – June 2020

News

2025.12

Released WeDLM - The fastest diffusion language model, capable of outperforming vLLM-optimized autoregressive baselines.

2025.08

Two papers are accepted by EMNLP 2025.

2025.06

Honored to be awarded the Outstanding Graduates of Beijing.

2025.05

Invited as a reviewer for EMNLP 2025 and NeurIPS 2025.

2025.05

One paper accepted by ACL 2025, two papers accepted by ACL 2025 Findings.

2025.01

Two papers accepted by NAACL 2025, three papers accepted by ICLR 2025.

2024.10

MarkLLM is accepted by EMNLP 2024 Demo Track.

2024.08

"A Survey of Text Watermarking in the Era of LLMs" accepted by ACM Computing Surveys.

2024.05

Three papers accepted by ACL 2024 (Alignment & Watermarking).

2024.04

Tutorial "Preventing and Detecting Misinformation Generated by LLMs" accepted by SIGIR 2024.

Research Highlights

WeDLM: Reconciling Diffusion Language Models with Standard Causal Attention for Fast Inference

Aiwei Liu, Minghua He, Shaoxun Zeng, Linhao Zhang, Chuhan Wu, Wei Jia, Yuan Liu, Yang Yu, Xiao Zhou, Jie Zhou

Preprint Fastest DLM

Paper Code Project Models

MarkDiffusion: An Open-Source Toolkit for Generative Watermarking of Latent Diffusion Models

Leyi Pan, Sheng Guan, Zheyu Fu, Luyang Si, Zian Wang, Xuming Hu, Irwin King, Philip S. Yu, Aiwei Liu, Lijie Wen

Preprint

Paper Code

Can LLM Watermarks Robustly Prevent Unauthorized Knowledge Distillation?

Leyi Pan, Aiwei Liu, Shiyu Huang, Yijian Lu, Xuming Hu, Lijie Wen, Irwin King, Philip S. Yu

Proceedings of ACL 2025

Paper Code

Can Watermarked LLMs be Identified by Users via Crafted Prompts?

Aiwei Liu, Sheng Guan, Yiming Liu, Leyi Pan, Yifei Zhang, Liancheng Fang, Lijie Wen, Philip S. Yu, Xuming Hu

Proceedings of ICLR 2025 Spotlight, top 5%

Paper Code Poster

TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights

Aiwei Liu, Haoping Bai, Zhiyun Lu, Yanchao Sun, Xiang Kong, Simon Wang, Jiulong Shan, Albin Madappally Jose, Xiaojiang Liu, Lijie Wen, Philip S. Yu, Meng Cao

Proceedings of ICLR 2025

Paper Code

MarkLLM: An Open-Source Toolkit for LLM Watermarking

Leyi Pan, Aiwei Liu*(Project Lead), Zhiwei He, Zitian Gao, Xuandong Zhao, Yijian Lu, Binglin Zhou, Shuliang Liu, Xuming Hu, Lijie Wen, Irwin King, Philip S. Yu

Proceedings of EMNLP 2024 Demo

Paper Code 机器之心

A Survey of Text Watermarking in the Era of Large Language Models

Aiwei Liu, Leyi Pan, Yijian Lu, Jingjing Li, Xuming Hu, Xi Zhang, Lijie Wen, Irwin King, Hui Xiong, Philip S. Yu

ACM Computing Surveys IF: 23.8

Paper Home

Preventing and Detecting Misinformation Generated by Large Language Models

Aiwei Liu, Qiang Sheng, Xuming Hu

Proceedings of SIGIR 2024 Tutorial

Paper Home Slides

Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation

Aiwei Liu, Haoping Bai, Zhiyun Lu, Xiang Kong, Simon Wang, Jiulong Shan, Meng Cao, Lijie Wen

Proceedings of ACL 2024

Paper Code

A Semantic Invariant Robust Watermark for Large Language Models

Aiwei Liu, Leyi Pan, Xuming Hu, Shiao Meng, Lijie Wen

Proceedings of ICLR 2024

Paper Code

An Unforgeable Publicly Verifiable Watermark for Large Language Models

Aiwei Liu, Leyi Pan, Xuming Hu, Shuang Li, Lijie Wen, Irwin King, Philip S. Yu

Proceedings of ICLR 2024

Paper Code

Exploring the Compositional Generalization in Context Dependent Text-to-SQL Parsing

Aiwei Liu, Wei Liu, Xuming Hu, Shuang Li, Fukun Ma, Yawen Yang, Lijie Wen

Findings of ACL 2023

Paper Code

Character-level White-Box Adversarial Attacks against Transformers via Attachable Subwords Substitution

Aiwei Liu, Honghai Yu, Xuming Hu, Shu'ang Li, Li Lin, Fukun Ma, Yawen Yang, Lijie Wen

Proceedings of EMNLP 2022

Paper Code

Semantic Enhanced Text-to-SQL Parsing via Iteratively Learning Schema Linking Graph

Aiwei Liu, Xuming Hu, Li Lin, Lijie Wen

Proceedings of SIGKDD 2022

Paper Code

View Full Publication List →

Publications

Conference Papers

Exploring Response Uncertainty in MLLMs: An Empirical Evaluation Under Misleading Scenarios

Yunkai Dang, Mengxi Gao, Yibo Yan, Xin Zou, Yanggan Gu, Aiwei Liu, Xuming Hu

Proceedings of EMNLP 2025 [Paper] [Code]

VLA-Mark: A cross modal watermark for large vision-language alignment model

Shuliang Liu, Qi Zheng, Jesse Jiaxi Xu, Yibo Yan, He Geng, Aiwei Liu, Peijie Jiang, Jia Liu, Yik-Cheung Tam, Xuming Hu

Proceedings of EMNLP 2025 [Paper] [Code]

TabGen-ICL: Residual-Aware In-Context Example Selection for Tabular Data Generation

Liancheng Fang, Aiwei Liu, Hengrui Zhang, Henry Peng Zou, Weizhi Zhang, Philip S. Yu

Findings of ACL 2025 [Paper] [Code]

A Survey on Proactive Defense Strategies Against Misinformation in LLMs

Shuliang Liu, Hongyi Liu, Aiwei Liu, Duan Bingchen, Zheng Qi, Yibo Yan, He Geng, Peijie Jiang, Jia Liu, Xuming Hu

Findings of ACL 2025 [Paper]

Mitigating Modality Prior-induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality

Guanyu Zhou, Yibo Yan, Xin Zou, Kun Wang, Aiwei Liu, Xuming Hu

Proceedings of ICLR 2025 [Paper] [Code]

WaterSeeker: Efficient Detection of Watermarked Segments in Large Documents

Leyi Pan, Aiwei Liu, Yijian Lu, Zitian Gao, Yichen Di, Lijie Wen, Irwin King, Philip S. Yu

Findings of NAACL 2025 [Paper] [Code]

Entropy-Based Decoding for Retrieval-Augmented Large Language Models

Zexuan Qiu, Zijing Ou, Bin Wu, Jingjing Li, Aiwei Liu, Irwin King

Proceedings of NAACL 2025 [Paper]

ChatCite: LLM Agent with Human Workflow Guidance for Comparative Literature Summary

Yutong Li, Lu Chen, Aiwei Liu, Kai Yu, Lijie Wen

Proceedings of COLING 2024 [Paper]

Refiner: Restructure Retrieval Content Efficiently to Advance Question-Answering Capabilities

Zhonghao Li, Xuming Hu, Aiwei Liu, Kening Zheng, Sirui Huang, Hui Xiong

Findings of EMNLP 2024 [Paper] [Code]

Preventing and Detecting Misinformation Generated by Large Language Models

Aiwei Liu, Qiang Sheng, Xuming Hu

SIGIR 2024 Tutorial [Paper] [Website]

On the Robustness of Document-Level Relation Extraction Models to Entity Name Variations

Shiao Meng, Xuming Hu, Aiwei Liu, Fukun Ma, Yawen Yang, Shuang Li, Lijie Wen

Findings of ACL 2024 [Paper] [Code]

An Entropy-based Text Watermarking Detection Method

Yijian Lu, Aiwei Liu, Dianzhi Yu, Jingjing Li, Irwin King

Proceedings of ACL 2024 [Paper] [Code]

Can Watermarks Survive Translation? On the Cross-lingual Consistency of Text Watermark for Large Language Models

Zhiwei He, Binglin Zhou, Hongkun Hao, Aiwei Liu, Xing Wang, Zhaopeng Tu, Zhuosheng Zhang, Rui Wang

Proceedings of ACL 2024 [Paper] [Code]

Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation

Aiwei Liu, Haoping Bai, Zhiyun Lu, Xiang Kong, Simon Wang, Jiulong Shan, Meng Cao, Lijie Wen

Proceedings of ACL 2024 [Paper] [Code]

A Semantic Invariant Robust Watermark for Large Language Models

Aiwei Liu, Leyi Pan, Xuming Hu, Shiao Meng, Lijie Wen

Proceedings of ICLR 2024 [Paper] [Code]

An Unforgeable Publicly Verifiable Watermark for Large Language Models

Aiwei Liu, Leyi Pan, Xuming Hu, Shu'ang Li, Lijie Wen, Irwin King, Philip S. Yu

Proceedings of ICLR 2024 [Paper] [Code]

RAPL: A Relation-Aware Prototype Learning Approach for Few-Shot Document-Level Relation Extraction

Shiao Meng, Xuming Hu, Aiwei Liu, Shuang Li, Fukun Ma, Yawen Yang, Lijie Wen

Proceedings of EMNLP 2023 [Paper] [Code]

Prompt me up: Unleashing the power of alignments for multimodal entity and relation extraction

Xuming Hu, Junzhe Chen, Aiwei Liu, Shiao Meng, Lijie Wen, Philip S Yu

Proceedings of MM 2023 [Paper] [Code]

EnTDA: Entity-to-Text based Data Augmentation with Semantic Coherence and Entity Preserving for various NER Tasks

Xuming Hu, Yong Jiang, Aiwei Liu, Zhongqiang Huang, Pengjun Xie, Fei Huang, Lijie Wen and Philip S. Yu

Findings of ACL 2023 [Paper]

GDA: Generative Data Augmentation Techniques for Relation Extraction Tasks

Xuming Hu, Aiwei Liu, Zeqi Tan, Xin Zhang, Chenwei Zhang, Irwin King, Philip S. Yu

Findings of ACL 2023 [Paper]

Exploring the Compositional Generalization in Context Dependent Text-to-SQL Parsing

Aiwei Liu, Wei Liu, Xuming Hu, Shuang Li, Fukun Ma, Yawen Yang and Lijie Wen

Findings of ACL 2023 [Paper] [Code]

Enhancing Cross-lingual Natural Language Inference by Soft Prompting with Multilingual Verbalizer

Shuang Li, Xuming Hu, Aiwei Liu, Yawen Yang, Fukun Ma, Philip S. Yu and Lijie Wen

Findings of ACL 2023 [Paper] [Code]

Semantics Matters: AMR-based Path Aggregation Relational Network for Aspect-based Sentiment Analysis

Fukun Ma, Xuming Hu, Aiwei Liu, Yawen Yang, Shuang Li, Philip S. Yu and Lijie Wen

Proceedings of ACL 2023 [Paper] [Code]

Gaussian Prior Reinforcement Learning for Nested Named Entity Recognition

Yawen Yang, Xuming Hu, Fukun Ma, Shu'ang Li, Aiwei Liu, Lijie Wen and Philip S. Yu

Proceedings of ICASSP 2023 [Paper] [Code]

Semantic Enhanced Text-to-SQL Parsing via Iteratively Learning Schema Linking Graph

Aiwei Liu, Xuming Hu, Li Lin, Lijie Wen

Proceedings of SIGKDD 2022 [Paper] [Code]

Character-level White-Box Adversarial Attacks against Transformers via Attachable Subwords Substitution

Aiwei Liu, Honghai Yu, Xuming Hu, Shu'ang Li, Li Lin, Fukun Ma, Yawen Yang, Lijie Wen

Proceedings of EMNLP 2022 [Paper] [Code]

CHEF: A Pilot Chinese Dataset for Evidence-Based Fact-Checking

Xuming Hu, Zhijiang Guo, Guanyu Wu, Aiwei Liu, Lijie Wen, Philip S. Yu

Proceedings of NAACL 2022 [Paper] [Code]

Journal Papers

Reading Broadly to Open Your Mind: Improving Open Relation Extraction With Search Documents Under Self-Supervisions

Xuming Hu, Zhaochen Hong, Chenwei Zhang, Aiwei Liu, Shiao Meng, Lijie Wen, Irwin King, Philip S. Yu

TKDE [Paper]

A Multi-level Supervised Contrastive Learning Framework for Low-Resource Natural Language Inference

Shu'ang Li, Xuming Hu, Li Lin, Aiwei Liu, Lijie Wen, Philip S Yu

TASLP 2023 [Paper] [Code]

A Survey of AIOps for Failure Management in the Era of Large Language Models

Lingzhe Zhang, Tong Jia, Mengxi Jia, Yifan Wu, Aiwei Liu, Yong Yang, Zhonghai Wu, Xuming Hu, Philip S. Yu, Ying Li

ACM Computing Surveys [Paper]

Preprints

A Comprehensive Evaluation of ChatGPT's Zero-Shot Text-to-SQL Capability

Aiwei Liu, Xuming Hu, Lijie Wen, Philip S. Yu

Preprint [Paper] [Code]

Interpretable Contrastive Monte Carlo Tree Search Reasoning

Zitian Gao, Boye Niu, Xuzheng He, Haotian Xu, Hongzhang Liu, Aiwei Liu, Xuming Hu, Lijie Wen

Preprint [Paper] [Code]

Recent Advances of Multimodal Continual Learning: A Comprehensive Survey

Dianzhi Yu, Xinni Zhang, Yankai Chen, Aiwei Liu, Yifei Zhang, Philip S Yu, Irwin King

Preprint [Paper] [Code]

Less is More: Extreme Gradient Boost Rank-1 Adaption for Efficient Finetuning of LLMs

Yifei Zhang, Hao Zhu, Aiwei Liu, Han Yu, Piotr Koniusz, Irwin King

Preprint [Paper]

Cold-Start Recommendation towards the Era of Large Language Models (LLMs): A Comprehensive Survey and Roadmap

Weizhi Zhang, Yuanchen Bei, Liangwei Yang, Henry Peng Zou, Peilin Zhou, Aiwei Liu, Yinghui Li, Hao Chen, Jianling Wang, Yu Wang, Feiran Huang, Sheng Zhou, Jiajun Bu, Allen Lin, James Caverlee, Fakhri Karray, Irwin King, Philip S Yu

Preprint [Paper] [Code]