Picture for Rishabh Saraf

Rishabh Saraf

Competitive Audio-Language Models with Data-Efficient Single-Stage Training on Public Data

Add code
Sep 09, 2025
Viaarxiv icon

Learning Video Models from Text: Zero-Shot Anticipation for Procedural Actions

Add code
Jun 06, 2021
Figure 1 for Learning Video Models from Text: Zero-Shot Anticipation for Procedural Actions
Figure 2 for Learning Video Models from Text: Zero-Shot Anticipation for Procedural Actions
Figure 3 for Learning Video Models from Text: Zero-Shot Anticipation for Procedural Actions
Figure 4 for Learning Video Models from Text: Zero-Shot Anticipation for Procedural Actions
Viaarxiv icon