Picture for Siming Fu

Siming Fu

Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models

Add code
Mar 17, 2026
Viaarxiv icon

OmniForcing: Unleashing Real-time Joint Audio-Visual Generation

Add code
Mar 12, 2026
Viaarxiv icon

SAIL: Self-Amplified Iterative Learning for Diffusion Model Alignment with Minimal Human Feedback

Add code
Feb 05, 2026
Viaarxiv icon

Exchange Is All You Need for Remote Sensing Change Detection

Add code
Jan 12, 2026
Viaarxiv icon

CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers

Add code
Feb 10, 2025
Viaarxiv icon

A Remote Sensing Image Change Detection Method Integrating Layer Exchange and Channel-Spatial Differences

Add code
Jan 19, 2025
Figure 1 for A Remote Sensing Image Change Detection Method Integrating Layer Exchange and Channel-Spatial Differences
Figure 2 for A Remote Sensing Image Change Detection Method Integrating Layer Exchange and Channel-Spatial Differences
Figure 3 for A Remote Sensing Image Change Detection Method Integrating Layer Exchange and Channel-Spatial Differences
Figure 4 for A Remote Sensing Image Change Detection Method Integrating Layer Exchange and Channel-Spatial Differences
Viaarxiv icon

RestorerID: Towards Tuning-Free Face Restoration with ID Preservation

Add code
Nov 21, 2024
Figure 1 for RestorerID: Towards Tuning-Free Face Restoration with ID Preservation
Figure 2 for RestorerID: Towards Tuning-Free Face Restoration with ID Preservation
Figure 3 for RestorerID: Towards Tuning-Free Face Restoration with ID Preservation
Figure 4 for RestorerID: Towards Tuning-Free Face Restoration with ID Preservation
Viaarxiv icon

Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation

Add code
Sep 26, 2024
Figure 1 for Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation
Figure 2 for Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation
Figure 3 for Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation
Figure 4 for Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation
Viaarxiv icon

LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation

Add code
Aug 28, 2024
Viaarxiv icon

MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis

Add code
Jul 11, 2024
Figure 1 for MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
Figure 2 for MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
Figure 3 for MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
Figure 4 for MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
Viaarxiv icon