Picture for Yueqi Song

Yueqi Song

Grounding Multilingual Multimodal LLMs With Cultural Knowledge

Add code
Aug 12, 2025
Viaarxiv icon

Synthetic Socratic Debates: Examining Persona Effects on Moral Decision and Persuasion Dynamics

Add code
Jun 14, 2025
Viaarxiv icon

FieldWorkArena: Agentic AI Benchmark for Real Field Work Tasks

Add code
May 26, 2025
Viaarxiv icon

VisualPuzzles: Decoupling Multimodal Reasoning Evaluation from Domain Knowledge

Add code
Apr 15, 2025
Viaarxiv icon

SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills

Add code
Apr 09, 2025
Viaarxiv icon

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Add code
Mar 10, 2025
Viaarxiv icon

Beyond Browsing: API-Based Web Agents

Add code
Oct 21, 2024
Figure 1 for Beyond Browsing: API-Based Web Agents
Figure 2 for Beyond Browsing: API-Based Web Agents
Figure 3 for Beyond Browsing: API-Based Web Agents
Figure 4 for Beyond Browsing: API-Based Web Agents
Viaarxiv icon

Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages

Add code
Oct 21, 2024
Viaarxiv icon

OpenDevin: An Open Platform for AI Software Developers as Generalist Agents

Add code
Jul 23, 2024
Figure 1 for OpenDevin: An Open Platform for AI Software Developers as Generalist Agents
Figure 2 for OpenDevin: An Open Platform for AI Software Developers as Generalist Agents
Figure 3 for OpenDevin: An Open Platform for AI Software Developers as Generalist Agents
Figure 4 for OpenDevin: An Open Platform for AI Software Developers as Generalist Agents
Viaarxiv icon

An image speaks a thousand words, but can everyone listen? On translating images for cultural relevance

Add code
Apr 01, 2024
Figure 1 for An image speaks a thousand words, but can everyone listen? On translating images for cultural relevance
Figure 2 for An image speaks a thousand words, but can everyone listen? On translating images for cultural relevance
Figure 3 for An image speaks a thousand words, but can everyone listen? On translating images for cultural relevance
Figure 4 for An image speaks a thousand words, but can everyone listen? On translating images for cultural relevance
Viaarxiv icon