Picture for Wenwen Qiang

Wenwen Qiang

Group Causal Policy Optimization for Post-Training Large Language Models

Add code
Aug 07, 2025
Viaarxiv icon

Hacking Hallucinations of MLLMs with Causal Sufficiency and Necessity

Add code
Aug 06, 2025
Figure 1 for Hacking Hallucinations of MLLMs with Causal Sufficiency and Necessity
Figure 2 for Hacking Hallucinations of MLLMs with Causal Sufficiency and Necessity
Figure 3 for Hacking Hallucinations of MLLMs with Causal Sufficiency and Necessity
Figure 4 for Hacking Hallucinations of MLLMs with Causal Sufficiency and Necessity
Viaarxiv icon

Causal Reward Adjustment: Mitigating Reward Hacking in External Reasoning via Backdoor Correction

Add code
Aug 06, 2025
Viaarxiv icon

Multi-Modal Learning with Bayesian-Oriented Gradient Calibration

Add code
May 29, 2025
Figure 1 for Multi-Modal Learning with Bayesian-Oriented Gradient Calibration
Figure 2 for Multi-Modal Learning with Bayesian-Oriented Gradient Calibration
Figure 3 for Multi-Modal Learning with Bayesian-Oriented Gradient Calibration
Figure 4 for Multi-Modal Learning with Bayesian-Oriented Gradient Calibration
Viaarxiv icon

On the Transferability and Discriminability of Repersentation Learning in Unsupervised Domain Adaptation

Add code
May 28, 2025
Viaarxiv icon

Reward Model Generalization for Compute-Aware Test-Time Reasoning

Add code
May 23, 2025
Viaarxiv icon

CAIFormer: A Causal Informed Transformer for Multivariate Time Series Forecasting

Add code
May 22, 2025
Figure 1 for CAIFormer: A Causal Informed Transformer for Multivariate Time Series Forecasting
Figure 2 for CAIFormer: A Causal Informed Transformer for Multivariate Time Series Forecasting
Figure 3 for CAIFormer: A Causal Informed Transformer for Multivariate Time Series Forecasting
Figure 4 for CAIFormer: A Causal Informed Transformer for Multivariate Time Series Forecasting
Viaarxiv icon

Learning to Think: Information-Theoretic Reinforcement Fine-Tuning for LLMs

Add code
May 15, 2025
Viaarxiv icon

Causal Prompt Calibration Guided Segment Anything Model for Open-Vocabulary Multi-Entity Segmentation

Add code
May 10, 2025
Viaarxiv icon

Enhancing Human Motion Prediction via Multi-range Decoupling Decoding with Gating-adjusting Aggregation

Add code
Mar 30, 2025
Figure 1 for Enhancing Human Motion Prediction via Multi-range Decoupling Decoding with Gating-adjusting Aggregation
Figure 2 for Enhancing Human Motion Prediction via Multi-range Decoupling Decoding with Gating-adjusting Aggregation
Figure 3 for Enhancing Human Motion Prediction via Multi-range Decoupling Decoding with Gating-adjusting Aggregation
Figure 4 for Enhancing Human Motion Prediction via Multi-range Decoupling Decoding with Gating-adjusting Aggregation
Viaarxiv icon