Picture for Fan Ma

Fan Ma

PhyEdit: Towards Real-World Object Manipulation via Physically-Grounded Image Editing

Add code
Apr 09, 2026
Viaarxiv icon

LogiStory: A Logic-Aware Framework for Multi-Image Story Visualization

Add code
Mar 30, 2026
Viaarxiv icon

FoleyDirector: Fine-Grained Temporal Steering for Video-to-Audio Generation via Structured Scripts

Add code
Mar 20, 2026
Viaarxiv icon

EHRNavigator: A Multi-Agent System for Patient-Level Clinical Question Answering over Heterogeneous Electronic Health Records

Add code
Jan 15, 2026
Viaarxiv icon

Dynamic Experts Search: Enhancing Reasoning in Mixture-of-Experts LLMs at Test Time

Add code
Sep 26, 2025
Figure 1 for Dynamic Experts Search: Enhancing Reasoning in Mixture-of-Experts LLMs at Test Time
Figure 2 for Dynamic Experts Search: Enhancing Reasoning in Mixture-of-Experts LLMs at Test Time
Figure 3 for Dynamic Experts Search: Enhancing Reasoning in Mixture-of-Experts LLMs at Test Time
Figure 4 for Dynamic Experts Search: Enhancing Reasoning in Mixture-of-Experts LLMs at Test Time
Viaarxiv icon

From Trial to Triumph: Advancing Long Video Understanding via Visual Context Sample Scaling and Self-reward Alignment

Add code
Mar 26, 2025
Viaarxiv icon

DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization

Add code
Feb 05, 2025
Figure 1 for DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization
Figure 2 for DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization
Figure 3 for DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization
Figure 4 for DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization
Viaarxiv icon

TV-Dialogue: Crafting Theme-Aware Video Dialogues with Immersive Interaction

Add code
Jan 31, 2025
Figure 1 for TV-Dialogue: Crafting Theme-Aware Video Dialogues with Immersive Interaction
Figure 2 for TV-Dialogue: Crafting Theme-Aware Video Dialogues with Immersive Interaction
Figure 3 for TV-Dialogue: Crafting Theme-Aware Video Dialogues with Immersive Interaction
Figure 4 for TV-Dialogue: Crafting Theme-Aware Video Dialogues with Immersive Interaction
Viaarxiv icon

BrainGuard: Privacy-Preserving Multisubject Image Reconstructions from Brain Activities

Add code
Jan 24, 2025
Figure 1 for BrainGuard: Privacy-Preserving Multisubject Image Reconstructions from Brain Activities
Figure 2 for BrainGuard: Privacy-Preserving Multisubject Image Reconstructions from Brain Activities
Figure 3 for BrainGuard: Privacy-Preserving Multisubject Image Reconstructions from Brain Activities
Figure 4 for BrainGuard: Privacy-Preserving Multisubject Image Reconstructions from Brain Activities
Viaarxiv icon

InfiniDreamer: Arbitrarily Long Human Motion Generation via Segment Score Distillation

Add code
Nov 27, 2024
Figure 1 for InfiniDreamer: Arbitrarily Long Human Motion Generation via Segment Score Distillation
Figure 2 for InfiniDreamer: Arbitrarily Long Human Motion Generation via Segment Score Distillation
Figure 3 for InfiniDreamer: Arbitrarily Long Human Motion Generation via Segment Score Distillation
Figure 4 for InfiniDreamer: Arbitrarily Long Human Motion Generation via Segment Score Distillation
Viaarxiv icon