Picture for Zheming Yang

Zheming Yang

When Is Thinking Enough? Early Exit via Sufficiency Assessment for Efficient Reasoning

Add code
Apr 08, 2026
Viaarxiv icon

Not All Negative Samples Are Equal: LLMs Learn Better from Plausible Reasoning

Add code
Feb 03, 2026
Viaarxiv icon

From Atoms to Chains: Divergence-Guided Reasoning Curriculum for Unlabeled LLM Domain Adaptation

Add code
Jan 27, 2026
Viaarxiv icon

Mimic Human Cognition, Master Multi-Image Reasoning: A Meta-Action Framework for Enhanced Visual Understanding

Add code
Jan 12, 2026
Viaarxiv icon

ThinkDrive: Chain-of-Thought Guided Progressive Reinforcement Learning Fine-Tuning for Autonomous Driving

Add code
Jan 08, 2026
Viaarxiv icon

AIVD: Adaptive Edge-Cloud Collaboration for Accurate and Efficient Industrial Visual Detection

Add code
Jan 08, 2026
Viaarxiv icon

SearchAttack: Red-Teaming LLMs against Real-World Threats via Framing Unsafe Web Information-Seeking Tasks

Add code
Jan 07, 2026
Viaarxiv icon

Reinforced MLLM: A Survey on RL-Based Reasoning in Multimodal Large Language Models

Add code
Apr 30, 2025
Figure 1 for Reinforced MLLM: A Survey on RL-Based Reasoning in Multimodal Large Language Models
Figure 2 for Reinforced MLLM: A Survey on RL-Based Reasoning in Multimodal Large Language Models
Figure 3 for Reinforced MLLM: A Survey on RL-Based Reasoning in Multimodal Large Language Models
Figure 4 for Reinforced MLLM: A Survey on RL-Based Reasoning in Multimodal Large Language Models
Viaarxiv icon

DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers

Add code
Mar 15, 2024
Figure 1 for DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers
Figure 2 for DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers
Figure 3 for DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers
Figure 4 for DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers
Viaarxiv icon