Picture for Yang Yue

Yang Yue

Shenzhen University

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Add code
Apr 18, 2025
Viaarxiv icon

CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning

Add code
Apr 18, 2025
Viaarxiv icon

EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance

Add code
Apr 17, 2025
Viaarxiv icon

Leanabell-Prover: Posttraining Scaling in Formal Reasoning

Add code
Apr 09, 2025
Viaarxiv icon

Vision-to-Music Generation: A Survey

Add code
Mar 27, 2025
Viaarxiv icon

Hierarchical Context Transformer for Multi-level Semantic Scene Understanding

Add code
Feb 21, 2025
Viaarxiv icon

3D Prior is All You Need: Cross-Task Few-shot 2D Gaze Estimation

Add code
Feb 06, 2025
Viaarxiv icon

Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video Recognition

Add code
Dec 15, 2024
Viaarxiv icon

How Far is Video Generation from World Model: A Physical Law Perspective

Add code
Nov 04, 2024
Figure 1 for How Far is Video Generation from World Model: A Physical Law Perspective
Figure 2 for How Far is Video Generation from World Model: A Physical Law Perspective
Figure 3 for How Far is Video Generation from World Model: A Physical Law Perspective
Figure 4 for How Far is Video Generation from World Model: A Physical Law Perspective
Viaarxiv icon

DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution

Add code
Nov 04, 2024
Figure 1 for DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
Figure 2 for DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
Figure 3 for DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
Figure 4 for DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
Viaarxiv icon