Picture for Shyamgopal Karthik

Shyamgopal Karthik

A Good CREPE needs more than just Sugar: Investigating Biases in Compositional Vision-Language Benchmarks

Add code
Jun 09, 2025
Viaarxiv icon

Concept-Guided Interpretability via Neural Chunking

Add code
May 16, 2025
Viaarxiv icon

Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models

Add code
Apr 03, 2025
Viaarxiv icon

Post-hoc Probabilistic Vision-Language Models

Add code
Dec 08, 2024
Figure 1 for Post-hoc Probabilistic Vision-Language Models
Figure 2 for Post-hoc Probabilistic Vision-Language Models
Figure 3 for Post-hoc Probabilistic Vision-Language Models
Figure 4 for Post-hoc Probabilistic Vision-Language Models
Viaarxiv icon

Scalable Ranked Preference Optimization for Text-to-Image Generation

Add code
Oct 23, 2024
Figure 1 for Scalable Ranked Preference Optimization for Text-to-Image Generation
Figure 2 for Scalable Ranked Preference Optimization for Text-to-Image Generation
Figure 3 for Scalable Ranked Preference Optimization for Text-to-Image Generation
Figure 4 for Scalable Ranked Preference Optimization for Text-to-Image Generation
Viaarxiv icon

EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval

Add code
Jul 23, 2024
Viaarxiv icon

ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization

Add code
Jun 06, 2024
Figure 1 for ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization
Figure 2 for ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization
Figure 3 for ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization
Figure 4 for ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization
Viaarxiv icon

Vision-by-Language for Training-Free Compositional Image Retrieval

Add code
Oct 13, 2023
Figure 1 for Vision-by-Language for Training-Free Compositional Image Retrieval
Figure 2 for Vision-by-Language for Training-Free Compositional Image Retrieval
Figure 3 for Vision-by-Language for Training-Free Compositional Image Retrieval
Figure 4 for Vision-by-Language for Training-Free Compositional Image Retrieval
Viaarxiv icon

ProbVLM: Probabilistic Adapter for Frozen Vison-Language Models

Add code
Jul 01, 2023
Viaarxiv icon

If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection

Add code
May 22, 2023
Figure 1 for If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection
Figure 2 for If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection
Figure 3 for If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection
Figure 4 for If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection
Viaarxiv icon