Picture for Jingang Wang

Jingang Wang

Making Mathematical Reasoning Adaptive

Add code
Oct 06, 2025
Figure 1 for Making Mathematical Reasoning Adaptive
Figure 2 for Making Mathematical Reasoning Adaptive
Figure 3 for Making Mathematical Reasoning Adaptive
Figure 4 for Making Mathematical Reasoning Adaptive
Viaarxiv icon

Fine-tuning Done Right in Model Editing

Add code
Sep 26, 2025
Figure 1 for Fine-tuning Done Right in Model Editing
Figure 2 for Fine-tuning Done Right in Model Editing
Figure 3 for Fine-tuning Done Right in Model Editing
Figure 4 for Fine-tuning Done Right in Model Editing
Viaarxiv icon

IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method

Add code
Sep 26, 2025
Figure 1 for IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method
Figure 2 for IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method
Figure 3 for IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method
Figure 4 for IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method
Viaarxiv icon

TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making

Add code
Sep 10, 2025
Figure 1 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Figure 2 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Figure 3 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Figure 4 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Viaarxiv icon

OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation

Add code
Sep 03, 2025
Figure 1 for OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation
Figure 2 for OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation
Figure 3 for OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation
Figure 4 for OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation
Viaarxiv icon

Stop Spinning Wheels: Mitigating LLM Overthinking via Mining Patterns for Early Reasoning Exit

Add code
Aug 25, 2025
Figure 1 for Stop Spinning Wheels: Mitigating LLM Overthinking via Mining Patterns for Early Reasoning Exit
Figure 2 for Stop Spinning Wheels: Mitigating LLM Overthinking via Mining Patterns for Early Reasoning Exit
Figure 3 for Stop Spinning Wheels: Mitigating LLM Overthinking via Mining Patterns for Early Reasoning Exit
Figure 4 for Stop Spinning Wheels: Mitigating LLM Overthinking via Mining Patterns for Early Reasoning Exit
Viaarxiv icon

Too Consistent to Detect: A Study of Self-Consistent Errors in LLMs

Add code
May 23, 2025
Viaarxiv icon

Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective

Add code
May 23, 2025
Figure 1 for Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective
Figure 2 for Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective
Figure 3 for Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective
Figure 4 for Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective
Viaarxiv icon

Two Minds Better Than One: Collaborative Reward Modeling for LLM Alignment

Add code
May 19, 2025
Viaarxiv icon

Dynamic Fisher-weighted Model Merging via Bayesian Optimization

Add code
Apr 26, 2025
Figure 1 for Dynamic Fisher-weighted Model Merging via Bayesian Optimization
Figure 2 for Dynamic Fisher-weighted Model Merging via Bayesian Optimization
Figure 3 for Dynamic Fisher-weighted Model Merging via Bayesian Optimization
Figure 4 for Dynamic Fisher-weighted Model Merging via Bayesian Optimization
Viaarxiv icon