Picture for Shiji Song

Shiji Song

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

Add code
Mar 19, 2026
Viaarxiv icon

UltraStar: Semantic-Aware Star Graph Modeling for Echocardiography Navigation

Add code
Mar 02, 2026
Viaarxiv icon

Emulating Human-like Adaptive Vision for Efficient and Flexible Machine Visual Perception

Add code
Sep 18, 2025
Viaarxiv icon

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Add code
Apr 18, 2025
Viaarxiv icon

CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning

Add code
Apr 18, 2025
Figure 1 for CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning
Figure 2 for CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning
Figure 3 for CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning
Figure 4 for CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning
Viaarxiv icon

EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance

Add code
Apr 17, 2025
Figure 1 for EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance
Figure 2 for EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance
Figure 3 for EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance
Figure 4 for EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance
Viaarxiv icon

Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video Recognition

Add code
Dec 15, 2024
Figure 1 for Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video Recognition
Figure 2 for Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video Recognition
Figure 3 for Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video Recognition
Figure 4 for Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video Recognition
Viaarxiv icon

Bridging the Divide: Reconsidering Softmax and Linear Attention

Add code
Dec 09, 2024
Figure 1 for Bridging the Divide: Reconsidering Softmax and Linear Attention
Figure 2 for Bridging the Divide: Reconsidering Softmax and Linear Attention
Figure 3 for Bridging the Divide: Reconsidering Softmax and Linear Attention
Figure 4 for Bridging the Divide: Reconsidering Softmax and Linear Attention
Viaarxiv icon

A Unified Interaction Control Framework for Safe Robotic Ultrasound Scanning with Human-Intention-Aware Compliance

Add code
Nov 29, 2024
Figure 1 for A Unified Interaction Control Framework for Safe Robotic Ultrasound Scanning with Human-Intention-Aware Compliance
Figure 2 for A Unified Interaction Control Framework for Safe Robotic Ultrasound Scanning with Human-Intention-Aware Compliance
Figure 3 for A Unified Interaction Control Framework for Safe Robotic Ultrasound Scanning with Human-Intention-Aware Compliance
Figure 4 for A Unified Interaction Control Framework for Safe Robotic Ultrasound Scanning with Human-Intention-Aware Compliance
Viaarxiv icon

Advancing Generalization in PINNs through Latent-Space Representations

Add code
Nov 28, 2024
Figure 1 for Advancing Generalization in PINNs through Latent-Space Representations
Figure 2 for Advancing Generalization in PINNs through Latent-Space Representations
Figure 3 for Advancing Generalization in PINNs through Latent-Space Representations
Figure 4 for Advancing Generalization in PINNs through Latent-Space Representations
Viaarxiv icon