Picture for Tonghua Su

Tonghua Su

CogPortrait: Fine-Grained Eye-Region Control in Portrait Animation via Hierarchical Agent Planning

Add code
May 27, 2026
Viaarxiv icon

WorldMAP: Bootstrapping Vision-Language Navigation Trajectory Prediction with Generative World Models

Add code
Apr 09, 2026
Viaarxiv icon

Chain of World: World Model Thinking in Latent Motion

Add code
Mar 03, 2026
Viaarxiv icon

HMVLA: Hyperbolic Multimodal Fusion for Vision-Language-Action Models

Add code
Jan 28, 2026
Viaarxiv icon

A Comprehensive Evaluation of LLM Reasoning: From Single-Model to Multi-Agent Paradigms

Add code
Jan 19, 2026
Viaarxiv icon

Multimodal Continual Instruction Tuning with Dynamic Gradient Guidance

Add code
Nov 19, 2025
Figure 1 for Multimodal Continual Instruction Tuning with Dynamic Gradient Guidance
Figure 2 for Multimodal Continual Instruction Tuning with Dynamic Gradient Guidance
Figure 3 for Multimodal Continual Instruction Tuning with Dynamic Gradient Guidance
Figure 4 for Multimodal Continual Instruction Tuning with Dynamic Gradient Guidance
Viaarxiv icon

Memory-Augmented Incomplete Multimodal Survival Prediction via Cross-Slide and Gene-Attentive Hypergraph Learning

Add code
Jun 24, 2025
Viaarxiv icon

Multimodal Cancer Survival Analysis via Hypergraph Learning with Cross-Modality Rebalance

Add code
May 17, 2025
Viaarxiv icon

DUKAE: DUal-level Knowledge Accumulation and Ensemble for Pre-Trained Model-Based Continual Learning

Add code
Apr 09, 2025
Viaarxiv icon

Adams Bashforth Moulton Solver for Inversion and Editing in Rectified Flow

Add code
Mar 17, 2025
Viaarxiv icon