Picture for Ami Baid

Ami Baid

Personal Visual Context Learning in Large Multimodal Models

Add code
May 11, 2026
Viaarxiv icon

Don't Let the Video Speak: Audio-Contrastive Preference Optimization for Audio-Visual Language Models

Add code
Apr 15, 2026
Viaarxiv icon

Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos

Add code
Jun 13, 2024
Figure 1 for Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos
Figure 2 for Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos
Figure 3 for Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos
Figure 4 for Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos
Viaarxiv icon