- [2025.04] Our survey on LRMs Safety is on arxiv now, check out the paper and repo!
- [2025.01] Our paper is accepted to NAACL 2025 Main Conference.
- [2025.01] I started my internship at Tiktok as an Algorithm Engineer Intern.
- [2024.11] Our paper is accepted to COLING 2025.
|
📑 Publications
* denotes equal contribution, see full list in Google Scholar.
|
2025
|
When Audio and Text Disagree: Benchmarking Text Bias in Large Audio-Language Models under Cross-Modal Inconsistencies
Cheng Wang, Gelei Deng, Xianglin Yang, Tianwei Zhang
Under Review, 2025
|
|
Mitigating Hallucinations in Large Vision-Language Models without Performance Degradation
Xingyu Zhu, Junfeng Fang, Xinfeng Li, Cheng Wang, Shuo Wang, Beier Zhu, Xiangnan He
Under Review, 2025
|
|
GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning
Yue Liu, Shengfang Zhai, Mingzhe Du, Yulin Chen, Tri Cao, Hongcheng Gao, Cheng Wang, Xinfeng Li, Kun Wang, Junfeng Fang, Jiaheng Zhang, Bryan Hooi
Under Review, 2025
Paper / Code
|
|
Safety in Large Reasoning Models: A Survey
Cheng Wang*, Yue Liu, Baolong Bi, Duzhen Zhang, Zhongzhi Li, Junfeng Fang, Bryan Hooi
Under Review, 2025
Paper / GitHub
|
|
Beyond the Last Layer: Improving Sentence Embeddings Elicited from LLMs through Contrastive Layer Information Fusion
Cheng Wang, Yiwei Wang, Yujun Cai, Bryan Hooi
Under Review, 2025
|
|
Tricking Retrievers with Influential Tokens: An Efficient Black-Box Corpus Poisoning Attack
Cheng Wang, Yiwei Wang, Yujun Cai, Bryan Hooi
NAACL 2025 Main Track
Paper
|
|
Con-ReCall: Detecting Pre-training Data in LLMs via Contrastive Decoding
Cheng Wang, Yiwei Wang, Bryan Hooi, Yujun Cai, Nanyun Peng, Kai-Wei Chang
COLING 2025
Paper
|
|
National University of Singapore (NUS)
Period: 2022 - Present
Major: Computer Science & Math
|
💼 Professional & Industry Experience
|
Tiktok | Singapore
Algorithm Engineer Intern
Period: Jan 2025 - June 2025
|
National University of Singapore | Singapore
Teaching Assistant, Introduction to AI and Machine Learning
Period: Jan 2024 - May 2024
|
Template is from Jon Barron. Last update: Apr 2025
|
|