Picture for Lianhui Qin

Lianhui Qin

Shammie

DeliveryBench: Can Agents Earn Profit in Real World?

Add code
Dec 22, 2025
Figure 1 for DeliveryBench: Can Agents Earn Profit in Real World?
Figure 2 for DeliveryBench: Can Agents Earn Profit in Real World?
Figure 3 for DeliveryBench: Can Agents Earn Profit in Real World?
Figure 4 for DeliveryBench: Can Agents Earn Profit in Real World?
Viaarxiv icon

SimWorld-Robotics: Synthesizing Photorealistic and Dynamic Urban Environments for Multimodal Robot Navigation and Collaboration

Add code
Dec 10, 2025
Viaarxiv icon

Multi-agent Self-triage System with Medical Flowcharts

Add code
Nov 16, 2025
Viaarxiv icon

Small Drafts, Big Verdict: Information-Intensive Visual Reasoning via Speculation

Add code
Oct 23, 2025
Viaarxiv icon

LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning

Add code
Oct 06, 2025
Figure 1 for LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning
Figure 2 for LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning
Figure 3 for LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning
Figure 4 for LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning
Viaarxiv icon

ArcMemo: Abstract Reasoning Composition with Lifelong LLM Memory

Add code
Sep 04, 2025
Figure 1 for ArcMemo: Abstract Reasoning Composition with Lifelong LLM Memory
Figure 2 for ArcMemo: Abstract Reasoning Composition with Lifelong LLM Memory
Figure 3 for ArcMemo: Abstract Reasoning Composition with Lifelong LLM Memory
Figure 4 for ArcMemo: Abstract Reasoning Composition with Lifelong LLM Memory
Viaarxiv icon

Collaborating Action by Action: A Multi-agent LLM Framework for Embodied Reasoning

Add code
Apr 24, 2025
Viaarxiv icon

Political-LLM: Large Language Models in Political Science

Add code
Dec 09, 2024
Figure 1 for Political-LLM: Large Language Models in Political Science
Figure 2 for Political-LLM: Large Language Models in Political Science
Figure 3 for Political-LLM: Large Language Models in Political Science
Figure 4 for Political-LLM: Large Language Models in Political Science
Viaarxiv icon

Benchmarking the Capabilities of Large Language Models in Transportation System Engineering: Accuracy, Consistency, and Reasoning Behaviors

Add code
Aug 15, 2024
Viaarxiv icon

WeatherQA: Can Multimodal Language Models Reason about Severe Weather?

Add code
Jun 17, 2024
Figure 1 for WeatherQA: Can Multimodal Language Models Reason about Severe Weather?
Figure 2 for WeatherQA: Can Multimodal Language Models Reason about Severe Weather?
Figure 3 for WeatherQA: Can Multimodal Language Models Reason about Severe Weather?
Figure 4 for WeatherQA: Can Multimodal Language Models Reason about Severe Weather?
Viaarxiv icon