
Hanbo Huang, Yiran Zhang, Hao Zheng, Xuan Gong, Yihan Li, Lin Liu, Zhuotao Liu, Shiyu Liang# (# corresponding author)
ICML 2026
RLCracker studies adaptive reinforcement-learning attacks against LLM watermarks, exposing watermark vulnerabilities under learned black-box attack policies.
Hanbo Huang, Yiran Zhang, Hao Zheng, Xuan Gong, Yihan Li, Lin Liu, Zhuotao Liu, Shiyu Liang# (# corresponding author)
ICML 2026
RLCracker studies adaptive reinforcement-learning attacks against LLM watermarks, exposing watermark vulnerabilities under learned black-box attack policies.

Xuan Gong, Senmiao Wang, Hanbo Huang, Ruoyu Sun, Shiyu Liang# (# corresponding author)
ACL 2026 Main
VCORE introduces variance-controlled optimization-based reweighting for chain-of-thought supervision, improving how reasoning traces contribute to model training.
Xuan Gong, Senmiao Wang, Hanbo Huang, Ruoyu Sun, Shiyu Liang# (# corresponding author)
ACL 2026 Main
VCORE introduces variance-controlled optimization-based reweighting for chain-of-thought supervision, improving how reasoning traces contribute to model training.
Hanbo Huang*, Xuan Gong*, Jing Wang, Lei Bai, Xiang Xiao, Weishu Zhao, Shiyu Liang# (* equal contribution, # corresponding author)
Preprint 2026
GGBound presents a genome-grounded agent for predicting microbial life boundaries by combining genomic evidence with agentic reasoning.
Hanbo Huang*, Xuan Gong*, Jing Wang, Lei Bai, Xiang Xiao, Weishu Zhao, Shiyu Liang# (* equal contribution, # corresponding author)
Preprint 2026
GGBound presents a genome-grounded agent for predicting microbial life boundaries by combining genomic evidence with agentic reasoning.

Xuan Gong, Hanbo Huang, Hao Zheng, Yiran Zhang, Wenbin Dai, Weishu Zhao, Shiyu Liang# (# corresponding author)
CompLearn Workshop @ ICML 2026
This work introduces reflection anchors for propagation-aware visual retention, targeting long-chain multimodal reasoning where visual evidence must remain reliable across extended inference.
Xuan Gong, Hanbo Huang, Hao Zheng, Yiran Zhang, Wenbin Dai, Weishu Zhao, Shiyu Liang# (# corresponding author)
CompLearn Workshop @ ICML 2026
This work introduces reflection anchors for propagation-aware visual retention, targeting long-chain multimodal reasoning where visual evidence must remain reliable across extended inference.
Hanbo Huang, Xuan Gong, Yiran Zhang, Hao Zheng, Shiyu Liang# (# corresponding author)
Preprint 2026
RLSpoofer proposes a lightweight evaluation framework for testing LLM watermark spoofing resilience with reinforcement-learning-based attack behavior.
Hanbo Huang, Xuan Gong, Yiran Zhang, Hao Zheng, Shiyu Liang# (# corresponding author)
Preprint 2026
RLSpoofer proposes a lightweight evaluation framework for testing LLM watermark spoofing resilience with reinforcement-learning-based attack behavior.

Xuan Gong*, Hanbo Huang*, Yiran Zhang*, Shiyu Liang# (* equal contribution, # corresponding author)
ICASSP 2026
We revisit how supervised fine-tuning affects factual knowledge in LLMs, revealing a factuality gap between known and unknown knowledge. This gap can be mitigated at inference via in-context learning (ICL) or out-of-distribution prompts. Our theoretical and empirical results show that test-time prompts can overshadow fine-tuning data, suggesting ICL can compensate for poor fine-tuning and should be considered in evaluating fine-tuning strategies.
Xuan Gong*, Hanbo Huang*, Yiran Zhang*, Shiyu Liang# (* equal contribution, # corresponding author)
ICASSP 2026
We revisit how supervised fine-tuning affects factual knowledge in LLMs, revealing a factuality gap between known and unknown knowledge. This gap can be mitigated at inference via in-context learning (ICL) or out-of-distribution prompts. Our theoretical and empirical results show that test-time prompts can overshadow fine-tuning data, suggesting ICL can compensate for poor fine-tuning and should be considered in evaluating fine-tuning strategies.
Xuan Gong, Hanbo Huang, Hao Zheng, Yiran Zhang, Wenbin Dai, Weishu Zhao, Shiyu Liang# (# corresponding author)
Preprint 2026
This work studies reflection anchors for interpretable compositional visual reasoning in multimodal reinforcement learning.
Xuan Gong, Hanbo Huang, Hao Zheng, Yiran Zhang, Wenbin Dai, Weishu Zhao, Shiyu Liang# (# corresponding author)
Preprint 2026
This work studies reflection anchors for interpretable compositional visual reasoning in multimodal reinforcement learning.

Hanbo Huang, Xuan Gong, Yiran Zhang, Hao Zheng, Wenbin Dai, Jie Kuang, Shiyu Liang# (# corresponding author)
Trustworthy AI for Good (AI4GOOD) Workshop @ ICML 2026
This workshop paper studies sample-efficient black-box spoofing attacks for stress-testing the robustness of LLM watermarks.
Hanbo Huang, Xuan Gong, Yiran Zhang, Hao Zheng, Wenbin Dai, Jie Kuang, Shiyu Liang# (# corresponding author)
Trustworthy AI for Good (AI4GOOD) Workshop @ ICML 2026
This workshop paper studies sample-efficient black-box spoofing attacks for stress-testing the robustness of LLM watermarks.

Xuan Gong, Tianshi Ming, Xinpeng Wang, Zhihua Wei# (# corresponding author)
EMNLP 2024 Main
We propose DAMRO, a training-free method to reduce object hallucination in LVLMs by filtering misleading high-attention background tokens using the ViT CLS token. DAMRO significantly improves hallucination control on models like LLaVA and InstructBLIP across multiple benchmarks.
Xuan Gong, Tianshi Ming, Xinpeng Wang, Zhihua Wei# (# corresponding author)
EMNLP 2024 Main
We propose DAMRO, a training-free method to reduce object hallucination in LVLMs by filtering misleading high-attention background tokens using the ViT CLS token. DAMRO significantly improves hallucination control on models like LLaVA and InstructBLIP across multiple benchmarks.