Picture for Hritik Bansal

Hritik Bansal

Towards a Holistic Framework for Multimodal Large Language Models in Three-dimensional Brain CT Report Generation

Add code
Jul 02, 2024
Viaarxiv icon

DataComp-LM: In search of the next generation of training sets for language models

Add code
Jun 18, 2024
Viaarxiv icon

VideoPhy: Evaluating Physical Commonsense for Video Generation

Add code
Jun 05, 2024
Viaarxiv icon

TALC: Time-Aligned Captions for Multi-Scene Text-to-Video Generation

Add code
May 07, 2024
Figure 1 for TALC: Time-Aligned Captions for Multi-Scene Text-to-Video Generation
Figure 2 for TALC: Time-Aligned Captions for Multi-Scene Text-to-Video Generation
Figure 3 for TALC: Time-Aligned Captions for Multi-Scene Text-to-Video Generation
Figure 4 for TALC: Time-Aligned Captions for Multi-Scene Text-to-Video Generation
Viaarxiv icon

GenEARL: A Training-Free Generative Framework for Multimodal Event Argument Role Labeling

Add code
Apr 07, 2024
Viaarxiv icon

Survey of Bias In Text-to-Image Generation: Definition, Evaluation, and Mitigation

Add code
Apr 02, 2024
Viaarxiv icon

Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization

Add code
Mar 31, 2024
Figure 1 for Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization
Figure 2 for Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization
Figure 3 for Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization
Figure 4 for Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization
Viaarxiv icon

Improving Event Definition Following For Zero-Shot Event Detection

Add code
Mar 05, 2024
Figure 1 for Improving Event Definition Following For Zero-Shot Event Detection
Figure 2 for Improving Event Definition Following For Zero-Shot Event Detection
Figure 3 for Improving Event Definition Following For Zero-Shot Event Detection
Figure 4 for Improving Event Definition Following For Zero-Shot Event Detection
Viaarxiv icon

ConTextual: Evaluating Context-Sensitive Text-Rich Visual Reasoning in Large Multimodal Models

Add code
Jan 24, 2024
Viaarxiv icon

Scaling transformer neural networks for skillful and reliable medium-range weather forecasting

Add code
Dec 06, 2023
Figure 1 for Scaling transformer neural networks for skillful and reliable medium-range weather forecasting
Figure 2 for Scaling transformer neural networks for skillful and reliable medium-range weather forecasting
Figure 3 for Scaling transformer neural networks for skillful and reliable medium-range weather forecasting
Figure 4 for Scaling transformer neural networks for skillful and reliable medium-range weather forecasting
Viaarxiv icon