Picture for Wenwen Qiang

Wenwen Qiang

Group Causal Policy Optimization for Post-Training Large Language Models

Add code
Aug 07, 2025
Viaarxiv icon

Hacking Hallucinations of MLLMs with Causal Sufficiency and Necessity

Add code
Aug 06, 2025
Viaarxiv icon

Causal Reward Adjustment: Mitigating Reward Hacking in External Reasoning via Backdoor Correction

Add code
Aug 06, 2025
Viaarxiv icon

Multi-Modal Learning with Bayesian-Oriented Gradient Calibration

Add code
May 29, 2025
Viaarxiv icon

On the Transferability and Discriminability of Repersentation Learning in Unsupervised Domain Adaptation

Add code
May 28, 2025
Viaarxiv icon

Reward Model Generalization for Compute-Aware Test-Time Reasoning

Add code
May 23, 2025
Viaarxiv icon

CAIFormer: A Causal Informed Transformer for Multivariate Time Series Forecasting

Add code
May 22, 2025
Viaarxiv icon

Learning to Think: Information-Theoretic Reinforcement Fine-Tuning for LLMs

Add code
May 15, 2025
Viaarxiv icon

Causal Prompt Calibration Guided Segment Anything Model for Open-Vocabulary Multi-Entity Segmentation

Add code
May 10, 2025
Viaarxiv icon

Enhancing Human Motion Prediction via Multi-range Decoupling Decoding with Gating-adjusting Aggregation

Add code
Mar 30, 2025
Figure 1 for Enhancing Human Motion Prediction via Multi-range Decoupling Decoding with Gating-adjusting Aggregation
Figure 2 for Enhancing Human Motion Prediction via Multi-range Decoupling Decoding with Gating-adjusting Aggregation
Figure 3 for Enhancing Human Motion Prediction via Multi-range Decoupling Decoding with Gating-adjusting Aggregation
Figure 4 for Enhancing Human Motion Prediction via Multi-range Decoupling Decoding with Gating-adjusting Aggregation
Viaarxiv icon