Picture for Mohit Bansal

Mohit Bansal

Shammie

SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data

Add code
Mar 11, 2024
Figure 1 for SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
Figure 2 for SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
Figure 3 for SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
Figure 4 for SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
Viaarxiv icon

Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training

Add code
Mar 04, 2024
Figure 1 for Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training
Figure 2 for Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training
Figure 3 for Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training
Figure 4 for Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training
Viaarxiv icon

NewsQs: Multi-Source Question Generation for the Inquiring Mind

Add code
Feb 28, 2024
Figure 1 for NewsQs: Multi-Source Question Generation for the Inquiring Mind
Figure 2 for NewsQs: Multi-Source Question Generation for the Inquiring Mind
Figure 3 for NewsQs: Multi-Source Question Generation for the Inquiring Mind
Figure 4 for NewsQs: Multi-Source Question Generation for the Inquiring Mind
Viaarxiv icon

Evaluating Very Long-Term Conversational Memory of LLM Agents

Add code
Feb 27, 2024
Viaarxiv icon

Soft Self-Consistency Improves Language Model Agents

Add code
Feb 20, 2024
Viaarxiv icon

GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations

Add code
Feb 19, 2024
Viaarxiv icon

Rethinking Machine Unlearning for Large Language Models

Add code
Feb 15, 2024
Viaarxiv icon

Inducing Systematicity in Transformers by Attending to Structurally Quantized Embeddings

Add code
Feb 09, 2024
Figure 1 for Inducing Systematicity in Transformers by Attending to Structurally Quantized Embeddings
Figure 2 for Inducing Systematicity in Transformers by Attending to Structurally Quantized Embeddings
Figure 3 for Inducing Systematicity in Transformers by Attending to Structurally Quantized Embeddings
Figure 4 for Inducing Systematicity in Transformers by Attending to Structurally Quantized Embeddings
Viaarxiv icon

CREMA: Multimodal Compositional Video Reasoning via Efficient Modular Adaptation and Fusion

Add code
Feb 08, 2024
Viaarxiv icon

VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation

Add code
Feb 07, 2024
Figure 1 for VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation
Figure 2 for VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation
Figure 3 for VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation
Figure 4 for VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation
Viaarxiv icon