Picture for Anirban Das

Anirban Das

Project Auto-World: Towards Automated Benchmarking of Neural Relational Reasoners

Add code
Jun 23, 2026
Viaarxiv icon

T1-Bench: Benchmarking Multi-Scenario Agents in Real-World Domains

Add code
Jun 09, 2026
Viaarxiv icon

AVSD: Adaptive-View Self-Distillation by Balancing Consensus and Teacher-Specific Privileged Signals

Add code
May 20, 2026
Viaarxiv icon

Your Model Diversity, Not Method, Determines Reasoning Strategy

Add code
Apr 12, 2026
Viaarxiv icon

Can Large Language Models Understand, Reason About, and Generate Code-Switched Text?

Add code
Jan 12, 2026
Viaarxiv icon

T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning

Add code
May 22, 2025
Figure 1 for T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning
Figure 2 for T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning
Figure 3 for T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning
Figure 4 for T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning
Viaarxiv icon

Continual Pre-training of MoEs: How robust is your router?

Add code
Mar 06, 2025
Figure 1 for Continual Pre-training of MoEs: How robust is your router?
Figure 2 for Continual Pre-training of MoEs: How robust is your router?
Figure 3 for Continual Pre-training of MoEs: How robust is your router?
Figure 4 for Continual Pre-training of MoEs: How robust is your router?
Viaarxiv icon

WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines

Add code
Oct 16, 2024
Figure 1 for WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
Figure 2 for WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
Figure 3 for WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
Figure 4 for WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
Viaarxiv icon

RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization

Add code
Oct 05, 2024
Figure 1 for RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization
Figure 2 for RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization
Figure 3 for RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization
Figure 4 for RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization
Viaarxiv icon

Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey

Add code
Sep 17, 2024
Figure 1 for Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey
Figure 2 for Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey
Figure 3 for Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey
Figure 4 for Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey
Viaarxiv icon