Picture for Jinyoung Yeo

Jinyoung Yeo

On Training Large Language Models for Long-Horizon Tasks: An Empirical Study of Horizon Length

Add code
May 04, 2026
Viaarxiv icon

PAC-BENCH: Evaluating Multi-Agent Collaboration under Privacy Constraints

Add code
Apr 13, 2026
Viaarxiv icon

CONDESION-BENCH: Conditional Decision-Making of Large Language Models in Compositional Action Space

Add code
Apr 10, 2026
Viaarxiv icon

AgenticShop: Benchmarking Agentic Product Curation for Personalized Web Shopping

Add code
Feb 12, 2026
Viaarxiv icon

Fast and Fluent Diffusion Language Models via Convolutional Decoding and Rejective Fine-tuning

Add code
Sep 18, 2025
Viaarxiv icon

Quantifying Self-Awareness of Knowledge in Large Language Models

Add code
Sep 18, 2025
Viaarxiv icon

Designing Memory-Augmented AR Agents for Spatiotemporal Reasoning in Personalized Task Assistance

Add code
Aug 12, 2025
Viaarxiv icon

ToolHaystack: Stress-Testing Tool-Augmented Language Models in Realistic Long-Term Interactions

Add code
May 29, 2025
Viaarxiv icon

LLM Meets Scene Graph: Can Large Language Models Understand and Generate Scene Graphs? A Benchmark and Empirical Study

Add code
May 26, 2025
Figure 1 for LLM Meets Scene Graph: Can Large Language Models Understand and Generate Scene Graphs? A Benchmark and Empirical Study
Figure 2 for LLM Meets Scene Graph: Can Large Language Models Understand and Generate Scene Graphs? A Benchmark and Empirical Study
Figure 3 for LLM Meets Scene Graph: Can Large Language Models Understand and Generate Scene Graphs? A Benchmark and Empirical Study
Figure 4 for LLM Meets Scene Graph: Can Large Language Models Understand and Generate Scene Graphs? A Benchmark and Empirical Study
Viaarxiv icon

Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized Assistance

Add code
May 22, 2025
Viaarxiv icon