Picture for Fangzhou Mu

Fangzhou Mu

Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance

Add code
Jun 11, 2024
Viaarxiv icon

SnAG: Scalable and Accurate Video Grounding

Add code
Apr 05, 2024
Viaarxiv icon

Towards 3D Vision with Low-Cost Single-Photon Cameras

Add code
Mar 29, 2024
Figure 1 for Towards 3D Vision with Low-Cost Single-Photon Cameras
Figure 2 for Towards 3D Vision with Low-Cost Single-Photon Cameras
Figure 3 for Towards 3D Vision with Low-Cost Single-Photon Cameras
Figure 4 for Towards 3D Vision with Low-Cost Single-Photon Cameras
Viaarxiv icon

Towards Few-Shot Adaptation of Foundation Models via Multitask Finetuning

Add code
Feb 22, 2024
Figure 1 for Towards Few-Shot Adaptation of Foundation Models via Multitask Finetuning
Figure 2 for Towards Few-Shot Adaptation of Foundation Models via Multitask Finetuning
Figure 3 for Towards Few-Shot Adaptation of Foundation Models via Multitask Finetuning
Figure 4 for Towards Few-Shot Adaptation of Foundation Models via Multitask Finetuning
Viaarxiv icon

FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition

Add code
Dec 12, 2023
Figure 1 for FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition
Figure 2 for FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition
Figure 3 for FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition
Figure 4 for FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition
Viaarxiv icon

Towards 4D Human Video Stylization

Add code
Dec 07, 2023
Figure 1 for Towards 4D Human Video Stylization
Figure 2 for Towards 4D Human Video Stylization
Figure 3 for Towards 4D Human Video Stylization
Figure 4 for Towards 4D Human Video Stylization
Viaarxiv icon

NMS Threshold matters for Ego4D Moment Queries -- 2nd place solution to the Ego4D Moment Queries Challenge 2023

Add code
Jul 05, 2023
Figure 1 for NMS Threshold matters for Ego4D Moment Queries -- 2nd place solution to the Ego4D Moment Queries Challenge 2023
Figure 2 for NMS Threshold matters for Ego4D Moment Queries -- 2nd place solution to the Ego4D Moment Queries Challenge 2023
Figure 3 for NMS Threshold matters for Ego4D Moment Queries -- 2nd place solution to the Ego4D Moment Queries Challenge 2023
Figure 4 for NMS Threshold matters for Ego4D Moment Queries -- 2nd place solution to the Ego4D Moment Queries Challenge 2023
Viaarxiv icon

SimHaze: game engine simulated data for real-world dehazing

Add code
May 25, 2023
Figure 1 for SimHaze: game engine simulated data for real-world dehazing
Figure 2 for SimHaze: game engine simulated data for real-world dehazing
Figure 3 for SimHaze: game engine simulated data for real-world dehazing
Viaarxiv icon

GLIGEN: Open-Set Grounded Text-to-Image Generation

Add code
Jan 17, 2023
Figure 1 for GLIGEN: Open-Set Grounded Text-to-Image Generation
Figure 2 for GLIGEN: Open-Set Grounded Text-to-Image Generation
Figure 3 for GLIGEN: Open-Set Grounded Text-to-Image Generation
Figure 4 for GLIGEN: Open-Set Grounded Text-to-Image Generation
Viaarxiv icon

A Simple Transformer-Based Model for Ego4D Natural Language Queries Challenge

Add code
Nov 16, 2022
Figure 1 for A Simple Transformer-Based Model for Ego4D Natural Language Queries Challenge
Figure 2 for A Simple Transformer-Based Model for Ego4D Natural Language Queries Challenge
Figure 3 for A Simple Transformer-Based Model for Ego4D Natural Language Queries Challenge
Viaarxiv icon