Picture for Dong Yu

Dong Yu

Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning

Add code
Dec 17, 2025
Figure 1 for Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning
Figure 2 for Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning
Figure 3 for Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning
Figure 4 for Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning
Viaarxiv icon

MotionEdit: Benchmarking and Learning Motion-Centric Image Editing

Add code
Dec 14, 2025
Figure 1 for MotionEdit: Benchmarking and Learning Motion-Centric Image Editing
Figure 2 for MotionEdit: Benchmarking and Learning Motion-Centric Image Editing
Figure 3 for MotionEdit: Benchmarking and Learning Motion-Centric Image Editing
Figure 4 for MotionEdit: Benchmarking and Learning Motion-Centric Image Editing
Viaarxiv icon

Auden-Voice: General-Purpose Voice Encoder for Speech and Language Understanding

Add code
Nov 19, 2025
Viaarxiv icon

TTA: Transcribe, Translate and Alignment for Cross-lingual Speech Representation

Add code
Nov 18, 2025
Viaarxiv icon

DeepCompress: A Dual Reward Strategy for Dynamically Exploring and Compressing Reasoning Chains

Add code
Oct 31, 2025
Viaarxiv icon

Understanding and Enhancing Mamba-Transformer Hybrids for Memory Recall and Language Modeling

Add code
Oct 30, 2025
Viaarxiv icon

Explore to Evolve: Scaling Evolved Aggregation Logic via Proactive Online Exploration for Deep Research Agents

Add code
Oct 16, 2025
Viaarxiv icon

CLUE: Non-parametric Verification from Experience via Hidden-State Clustering

Add code
Oct 02, 2025
Figure 1 for CLUE: Non-parametric Verification from Experience via Hidden-State Clustering
Figure 2 for CLUE: Non-parametric Verification from Experience via Hidden-State Clustering
Figure 3 for CLUE: Non-parametric Verification from Experience via Hidden-State Clustering
Figure 4 for CLUE: Non-parametric Verification from Experience via Hidden-State Clustering
Viaarxiv icon

VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning

Add code
Oct 01, 2025
Figure 1 for VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning
Figure 2 for VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning
Figure 3 for VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning
Figure 4 for VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning
Viaarxiv icon

UniGist: Towards General and Hardware-aligned Sequence-level Long Context Compression

Add code
Sep 19, 2025
Viaarxiv icon