Picture for Mohan Kankanhalli

Mohan Kankanhalli

STAR: Skeleton-aware Text-based 4D Avatar Generation with In-Network Motion Retargeting

Add code
Jun 07, 2024
Viaarxiv icon

Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR

Add code
May 27, 2024
Figure 1 for Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR
Figure 2 for Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR
Figure 3 for Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR
Figure 4 for Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR
Viaarxiv icon

Multi-Modal Recommendation Unlearning

Add code
May 24, 2024
Viaarxiv icon

TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment

Add code
May 22, 2024
Figure 1 for TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment
Figure 2 for TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment
Figure 3 for TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment
Figure 4 for TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment
Viaarxiv icon

Bridging the Intent Gap: Knowledge-Enhanced Visual Generation

Add code
May 21, 2024
Figure 1 for Bridging the Intent Gap: Knowledge-Enhanced Visual Generation
Figure 2 for Bridging the Intent Gap: Knowledge-Enhanced Visual Generation
Figure 3 for Bridging the Intent Gap: Knowledge-Enhanced Visual Generation
Figure 4 for Bridging the Intent Gap: Knowledge-Enhanced Visual Generation
Viaarxiv icon

Cluster-based Graph Collaborative Filtering

Add code
Apr 16, 2024
Figure 1 for Cluster-based Graph Collaborative Filtering
Figure 2 for Cluster-based Graph Collaborative Filtering
Figure 3 for Cluster-based Graph Collaborative Filtering
Figure 4 for Cluster-based Graph Collaborative Filtering
Viaarxiv icon

S3Editor: A Sparse Semantic-Disentangled Self-Training Framework for Face Video Editing

Add code
Apr 11, 2024
Viaarxiv icon

How to Understand Named Entities: Using Common Sense for News Captioning

Add code
Mar 11, 2024
Figure 1 for How to Understand Named Entities: Using Common Sense for News Captioning
Figure 2 for How to Understand Named Entities: Using Common Sense for News Captioning
Figure 3 for How to Understand Named Entities: Using Common Sense for News Captioning
Figure 4 for How to Understand Named Entities: Using Common Sense for News Captioning
Viaarxiv icon

EcoVal: An Efficient Data Valuation Framework for Machine Learning

Add code
Feb 15, 2024
Viaarxiv icon

Diffusion Facial Forgery Detection

Add code
Jan 29, 2024
Viaarxiv icon