Picture for Bryan Russell

Bryan Russell

ResidualViT for Efficient Temporally Dense Video Encoding

Add code
Sep 16, 2025
Viaarxiv icon

Discovering Divergent Representations between Text-to-Image Models

Add code
Sep 10, 2025
Viaarxiv icon

Improving Personalized Search with Regularized Low-Rank Parameter Updates

Add code
Jun 11, 2025
Viaarxiv icon

Video-Guided Foley Sound Generation with Multimodal Controls

Add code
Nov 26, 2024
Viaarxiv icon

Generative Timelines for Instructed Visual Assembly

Add code
Nov 19, 2024
Viaarxiv icon

Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval

Add code
May 06, 2024
Figure 1 for Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval
Figure 2 for Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval
Figure 3 for Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval
Figure 4 for Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval
Viaarxiv icon

Koala: Key frame-conditioned long video-LLM

Add code
Apr 05, 2024
Figure 1 for Koala: Key frame-conditioned long video-LLM
Figure 2 for Koala: Key frame-conditioned long video-LLM
Figure 3 for Koala: Key frame-conditioned long video-LLM
Figure 4 for Koala: Key frame-conditioned long video-LLM
Viaarxiv icon

Customizing Motion in Text-to-Video Diffusion Models

Add code
Dec 07, 2023
Figure 1 for Customizing Motion in Text-to-Video Diffusion Models
Figure 2 for Customizing Motion in Text-to-Video Diffusion Models
Figure 3 for Customizing Motion in Text-to-Video Diffusion Models
Figure 4 for Customizing Motion in Text-to-Video Diffusion Models
Viaarxiv icon

Meta-Personalizing Vision-Language Models to Find Named Instances in Video

Add code
Jun 16, 2023
Viaarxiv icon

Language-Guided Music Recommendation for Video via Prompt Analogies

Add code
Jun 15, 2023
Viaarxiv icon