Picture for Yang Song

Yang Song

MR Research Collaboration Team, Siemens Healthineers, Shanghai, China

When Rules Fall Short: Agent-Driven Discovery of Emerging Content Issues in Short Video Platforms

Add code
Jan 14, 2026
Viaarxiv icon

Reinforcement Learning for Follow-the-Leader Robotic Endoscopic Navigation via Synthetic Data

Add code
Jan 06, 2026
Viaarxiv icon

A Universal and Robust Framework for Multiple Gas Recognition Based-on Spherical Normalization-Coupled Mahalanobis Algorithm

Add code
Jan 05, 2026
Viaarxiv icon

Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations

Add code
Dec 24, 2025
Figure 1 for Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations
Figure 2 for Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations
Figure 3 for Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations
Figure 4 for Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations
Viaarxiv icon

Principles2Plan: LLM-Guided System for Operationalising Ethical Principles into Plans

Add code
Dec 09, 2025
Viaarxiv icon

A Physics-Driven Neural Network with Parameter Embedding for Generating Quantitative MR Maps from Weighted Images

Add code
Aug 11, 2025
Viaarxiv icon

IPBA: Imperceptible Perturbation Backdoor Attack in Federated Self-Supervised Learning

Add code
Aug 11, 2025
Figure 1 for IPBA: Imperceptible Perturbation Backdoor Attack in Federated Self-Supervised Learning
Figure 2 for IPBA: Imperceptible Perturbation Backdoor Attack in Federated Self-Supervised Learning
Figure 3 for IPBA: Imperceptible Perturbation Backdoor Attack in Federated Self-Supervised Learning
Figure 4 for IPBA: Imperceptible Perturbation Backdoor Attack in Federated Self-Supervised Learning
Viaarxiv icon

SAGE: A Visual Language Model for Anomaly Detection via Fact Enhancement and Entropy-aware Alignment

Add code
Jul 10, 2025
Viaarxiv icon

From Long Videos to Engaging Clips: A Human-Inspired Video Editing Framework with Multimodal Narrative Understanding

Add code
Jul 03, 2025
Viaarxiv icon

Memory-Augmented Incomplete Multimodal Survival Prediction via Cross-Slide and Gene-Attentive Hypergraph Learning

Add code
Jun 24, 2025
Viaarxiv icon