Alert button
Picture for Mohit Bansal

Mohit Bansal

Alert button

GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations

Add code
Bookmark button
Alert button
Feb 19, 2024
Jinhao Duan, Renming Zhang, James Diffenderfer, Bhavya Kailkhura, Lichao Sun, Elias Stengel-Eskin, Mohit Bansal, Tianlong Chen, Kaidi Xu

Viaarxiv icon

Rethinking Machine Unlearning for Large Language Models

Add code
Bookmark button
Alert button
Feb 15, 2024
Sijia Liu, Yuanshun Yao, Jinghan Jia, Stephen Casper, Nathalie Baracaldo, Peter Hase, Xiaojun Xu, Yuguang Yao, Hang Li, Kush R. Varshney, Mohit Bansal, Sanmi Koyejo, Yang Liu

Viaarxiv icon

Inducing Systematicity in Transformers by Attending to Structurally Quantized Embeddings

Add code
Bookmark button
Alert button
Feb 09, 2024
Yichen Jiang, Xiang Zhou, Mohit Bansal

Viaarxiv icon

CREMA: Multimodal Compositional Video Reasoning via Efficient Modular Adaptation and Fusion

Add code
Bookmark button
Alert button
Feb 08, 2024
Shoubin Yu, Jaehong Yoon, Mohit Bansal

Viaarxiv icon

VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation

Add code
Bookmark button
Alert button
Feb 07, 2024
Jialu Li, Aishwarya Padmakumar, Gaurav Sukhatme, Mohit Bansal

Viaarxiv icon

MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models

Add code
Bookmark button
Alert button
Feb 02, 2024
Justin Chih-Yao Chen, Swarnadeep Saha, Elias Stengel-Eskin, Mohit Bansal

Viaarxiv icon

ReGAL: Refactoring Programs to Discover Generalizable Abstractions

Add code
Bookmark button
Alert button
Jan 29, 2024
Elias Stengel-Eskin, Archiki Prasad, Mohit Bansal

Viaarxiv icon

TrustLLM: Trustworthiness in Large Language Models

Add code
Bookmark button
Alert button
Jan 25, 2024
Lichao Sun, Yue Huang, Haoran Wang, Siyuan Wu, Qihui Zhang, Chujie Gao, Yixin Huang, Wenhan Lyu, Yixuan Zhang, Xiner Li, Zhengliang Liu, Yixin Liu, Yijue Wang, Zhikun Zhang, Bhavya Kailkhura, Caiming Xiong, Chaowei Xiao, Chunyuan Li, Eric Xing, Furong Huang, Hao Liu, Heng Ji, Hongyi Wang, Huan Zhang, Huaxiu Yao, Manolis Kellis, Marinka Zitnik, Meng Jiang, Mohit Bansal, James Zou, Jian Pei, Jian Liu, Jianfeng Gao, Jiawei Han, Jieyu Zhao, Jiliang Tang, Jindong Wang, John Mitchell, Kai Shu, Kaidi Xu, Kai-Wei Chang, Lifang He, Lifu Huang, Michael Backes, Neil Zhenqiang Gong, Philip S. Yu, Pin-Yu Chen, Quanquan Gu, Ran Xu, Rex Ying, Shuiwang Ji, Suman Jana, Tianlong Chen, Tianming Liu, Tianyi Zhou, William Wang, Xiang Li, Xiangliang Zhang, Xiao Wang, Xing Xie, Xun Chen, Xuyu Wang, Yan Liu, Yanfang Ye, Yinzhi Cao, Yong Chen, Yue Zhao

Figure 1 for TrustLLM: Trustworthiness in Large Language Models
Figure 2 for TrustLLM: Trustworthiness in Large Language Models
Figure 3 for TrustLLM: Trustworthiness in Large Language Models
Figure 4 for TrustLLM: Trustworthiness in Large Language Models
Viaarxiv icon

Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences

Add code
Bookmark button
Alert button
Jan 25, 2024
Xiyao Wang, Yuhang Zhou, Xiaoyu Liu, Hongjin Lu, Yuancheng Xu, Feihong He, Jaehong Yoon, Taixi Lu, Gedas Bertasius, Mohit Bansal, Huaxiu Yao, Furong Huang

Viaarxiv icon