Picture for Shravan Nayak

Shravan Nayak

LatentLens: Revealing Highly Interpretable Visual Tokens in LLMs

Add code
Jan 31, 2026
Viaarxiv icon

Grammar Search for Multi-Agent Systems

Add code
Dec 16, 2025
Figure 1 for Grammar Search for Multi-Agent Systems
Figure 2 for Grammar Search for Multi-Agent Systems
Figure 3 for Grammar Search for Multi-Agent Systems
Figure 4 for Grammar Search for Multi-Agent Systems
Viaarxiv icon

Grounding Computer Use Agents on Human Demonstrations

Add code
Nov 10, 2025
Figure 1 for Grounding Computer Use Agents on Human Demonstrations
Figure 2 for Grounding Computer Use Agents on Human Demonstrations
Figure 3 for Grounding Computer Use Agents on Human Demonstrations
Figure 4 for Grounding Computer Use Agents on Human Demonstrations
Viaarxiv icon

Value Drifts: Tracing Value Alignment During LLM Post-Training

Add code
Oct 30, 2025
Viaarxiv icon

CulturalFrames: Assessing Cultural Expectation Alignment in Text-to-Image Models and Evaluation Metrics

Add code
Jun 10, 2025
Viaarxiv icon

UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction

Add code
Mar 19, 2025
Viaarxiv icon

Exploiting Domain-Specific Parallel Data on Multilingual Language Models for Low-resource Language Translation

Add code
Dec 27, 2024
Viaarxiv icon

BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks

Add code
Dec 05, 2024
Figure 1 for BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks
Figure 2 for BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks
Figure 3 for BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks
Figure 4 for BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks
Viaarxiv icon

Towards Adversarially Robust Vision-Language Models: Insights from Design Choices and Prompt Formatting Techniques

Add code
Jul 15, 2024
Viaarxiv icon

Benchmarking Vision Language Models for Cultural Understanding

Add code
Jul 15, 2024
Figure 1 for Benchmarking Vision Language Models for Cultural Understanding
Figure 2 for Benchmarking Vision Language Models for Cultural Understanding
Figure 3 for Benchmarking Vision Language Models for Cultural Understanding
Figure 4 for Benchmarking Vision Language Models for Cultural Understanding
Viaarxiv icon