Picture for Tianshi Zheng

Tianshi Zheng

PatchWorld: Gradient-Free Optimization of Executable World Models

Add code
May 29, 2026
Viaarxiv icon

HypoAgent: An Agentic Framework for Interactive Abductive Hypothesis Generation over Knowledge Graphs

Add code
May 29, 2026
Viaarxiv icon

Can LLMs Time Travel? Enhancing Temporal Consistency in Legal Agentic Search through Reinforcement Learning

Add code
May 25, 2026
Viaarxiv icon

MemLens: Benchmarking Multimodal Long-Term Memory in Large Vision-Language Models

Add code
May 14, 2026
Viaarxiv icon

NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems

Add code
Jan 16, 2026
Viaarxiv icon

NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents

Add code
Oct 08, 2025
Viaarxiv icon

The Cognitive Bandwidth Bottleneck: Shifting Long-Horizon Agent from Planning with Actions to Planning with Schemas

Add code
Oct 08, 2025
Viaarxiv icon

LLM-Hanabi: Evaluating Multi-Agent Gameplays with Theory-of-Mind and Rationale Inference in Imperfect Information Collaboration Game

Add code
Oct 06, 2025
Figure 1 for LLM-Hanabi: Evaluating Multi-Agent Gameplays with Theory-of-Mind and Rationale Inference in Imperfect Information Collaboration Game
Figure 2 for LLM-Hanabi: Evaluating Multi-Agent Gameplays with Theory-of-Mind and Rationale Inference in Imperfect Information Collaboration Game
Figure 3 for LLM-Hanabi: Evaluating Multi-Agent Gameplays with Theory-of-Mind and Rationale Inference in Imperfect Information Collaboration Game
Figure 4 for LLM-Hanabi: Evaluating Multi-Agent Gameplays with Theory-of-Mind and Rationale Inference in Imperfect Information Collaboration Game
Viaarxiv icon

Structuring the Unstructured: A Systematic Review of Text-to-Structure Generation for Agentic AI with a Universal Evaluation Framework

Add code
Aug 17, 2025
Figure 1 for Structuring the Unstructured: A Systematic Review of Text-to-Structure Generation for Agentic AI with a Universal Evaluation Framework
Figure 2 for Structuring the Unstructured: A Systematic Review of Text-to-Structure Generation for Agentic AI with a Universal Evaluation Framework
Figure 3 for Structuring the Unstructured: A Systematic Review of Text-to-Structure Generation for Agentic AI with a Universal Evaluation Framework
Figure 4 for Structuring the Unstructured: A Systematic Review of Text-to-Structure Generation for Agentic AI with a Universal Evaluation Framework
Viaarxiv icon

Prospect Theory Fails for LLMs: Revealing Instability of Decision-Making under Epistemic Uncertainty

Add code
Aug 12, 2025
Figure 1 for Prospect Theory Fails for LLMs: Revealing Instability of Decision-Making under Epistemic Uncertainty
Figure 2 for Prospect Theory Fails for LLMs: Revealing Instability of Decision-Making under Epistemic Uncertainty
Figure 3 for Prospect Theory Fails for LLMs: Revealing Instability of Decision-Making under Epistemic Uncertainty
Figure 4 for Prospect Theory Fails for LLMs: Revealing Instability of Decision-Making under Epistemic Uncertainty
Viaarxiv icon