Picture for Lei Zhang

Lei Zhang

Sid

TAPTR: Tracking Any Point with Transformers as Detection

Add code
Mar 19, 2024
Viaarxiv icon

CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility

Add code
Mar 18, 2024
Viaarxiv icon

Robust Overfitting Does Matter: Test-Time Adversarial Purification With FGSM

Add code
Mar 18, 2024
Figure 1 for Robust Overfitting Does Matter: Test-Time Adversarial Purification With FGSM
Figure 2 for Robust Overfitting Does Matter: Test-Time Adversarial Purification With FGSM
Figure 3 for Robust Overfitting Does Matter: Test-Time Adversarial Purification With FGSM
Figure 4 for Robust Overfitting Does Matter: Test-Time Adversarial Purification With FGSM
Viaarxiv icon

Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models

Add code
Mar 17, 2024
Figure 1 for Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models
Figure 2 for Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models
Figure 3 for Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models
Figure 4 for Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models
Viaarxiv icon

Self-Supervised Video Desmoking for Laparoscopic Surgery

Add code
Mar 17, 2024
Viaarxiv icon

A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment

Add code
Mar 16, 2024
Figure 1 for A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment
Figure 2 for A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment
Figure 3 for A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment
Figure 4 for A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment
Viaarxiv icon

Ctrl123: Consistent Novel View Synthesis via Closed-Loop Transcription

Add code
Mar 16, 2024
Figure 1 for Ctrl123: Consistent Novel View Synthesis via Closed-Loop Transcription
Figure 2 for Ctrl123: Consistent Novel View Synthesis via Closed-Loop Transcription
Figure 3 for Ctrl123: Consistent Novel View Synthesis via Closed-Loop Transcription
Figure 4 for Ctrl123: Consistent Novel View Synthesis via Closed-Loop Transcription
Viaarxiv icon

Vosh: Voxel-Mesh Hybrid Representation for Real-Time View Synthesis

Add code
Mar 11, 2024
Viaarxiv icon

Parameterized quantum comb and simpler circuits for reversing unknown qubit-unitary operations

Add code
Mar 06, 2024
Figure 1 for Parameterized quantum comb and simpler circuits for reversing unknown qubit-unitary operations
Figure 2 for Parameterized quantum comb and simpler circuits for reversing unknown qubit-unitary operations
Figure 3 for Parameterized quantum comb and simpler circuits for reversing unknown qubit-unitary operations
Figure 4 for Parameterized quantum comb and simpler circuits for reversing unknown qubit-unitary operations
Viaarxiv icon

Optimizing Mobile-Friendly Viewport Prediction for Live 360-Degree Video Streaming

Add code
Mar 05, 2024
Figure 1 for Optimizing Mobile-Friendly Viewport Prediction for Live 360-Degree Video Streaming
Figure 2 for Optimizing Mobile-Friendly Viewport Prediction for Live 360-Degree Video Streaming
Figure 3 for Optimizing Mobile-Friendly Viewport Prediction for Live 360-Degree Video Streaming
Figure 4 for Optimizing Mobile-Friendly Viewport Prediction for Live 360-Degree Video Streaming
Viaarxiv icon