Picture for Tianyi Men

Tianyi Men

Empowering GUI Agents via Autonomous Experience Exploration and Hindsight Experience Utilization for Task Planning

Add code
Jun 25, 2026
Viaarxiv icon

Agentic Environment Engineering for Large Language Models: A Survey of Environment Modeling, Synthesis, Evaluation, and Application

Add code
Jun 10, 2026
Viaarxiv icon

Large Language Models for Planning: A Comprehensive and Systematic Survey

Add code
May 26, 2025
Figure 1 for Large Language Models for Planning: A Comprehensive and Systematic Survey
Figure 2 for Large Language Models for Planning: A Comprehensive and Systematic Survey
Figure 3 for Large Language Models for Planning: A Comprehensive and Systematic Survey
Figure 4 for Large Language Models for Planning: A Comprehensive and Systematic Survey
Viaarxiv icon

RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment

Add code
Dec 18, 2024
Figure 1 for RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
Figure 2 for RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
Figure 3 for RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
Figure 4 for RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
Viaarxiv icon

A Troublemaker with Contagious Jailbreak Makes Chaos in Honest Towns

Add code
Oct 21, 2024
Viaarxiv icon

Unlocking the Future: Exploring Look-Ahead Planning Mechanistic Interpretability in Large Language Models

Add code
Jun 23, 2024
Viaarxiv icon