Picture for Ying Shen

Ying Shen

UPN

EvoConfig: Self-Evolving Multi-Agent Systems for Efficient Autonomous Environment Configuration

Add code
Jan 23, 2026
Viaarxiv icon

TangramPuzzle: Evaluating Multimodal Large Language Models with Compositional Spatial Reasoning

Add code
Jan 23, 2026
Viaarxiv icon

Enhancing Multimodal Retrieval via Complementary Information Extraction and Alignment

Add code
Jan 08, 2026
Viaarxiv icon

SmartSplat: Feature-Smart Gaussians for Scalable Compression of Ultra-High-Resolution Images

Add code
Dec 23, 2025
Viaarxiv icon

SuperFlow: Training Flow Matching Models with RL on the Fly

Add code
Dec 17, 2025
Viaarxiv icon

Too Good to be Bad: On the Failure of LLMs to Role-Play Villains

Add code
Nov 12, 2025
Viaarxiv icon

M3HG: Multimodal, Multi-scale, and Multi-type Node Heterogeneous Graph for Emotion Cause Triplet Extraction in Conversations

Add code
Aug 26, 2025
Figure 1 for M3HG: Multimodal, Multi-scale, and Multi-type Node Heterogeneous Graph for Emotion Cause Triplet Extraction in Conversations
Figure 2 for M3HG: Multimodal, Multi-scale, and Multi-type Node Heterogeneous Graph for Emotion Cause Triplet Extraction in Conversations
Figure 3 for M3HG: Multimodal, Multi-scale, and Multi-type Node Heterogeneous Graph for Emotion Cause Triplet Extraction in Conversations
Figure 4 for M3HG: Multimodal, Multi-scale, and Multi-type Node Heterogeneous Graph for Emotion Cause Triplet Extraction in Conversations
Viaarxiv icon

Attention Basin: Why Contextual Position Matters in Large Language Models

Add code
Aug 07, 2025
Viaarxiv icon

LaTtE-Flow: Layerwise Timestep-Expert Flow-based Transformer

Add code
Jun 08, 2025
Viaarxiv icon

R2I-Bench: Benchmarking Reasoning-Driven Text-to-Image Generation

Add code
May 29, 2025
Figure 1 for R2I-Bench: Benchmarking Reasoning-Driven Text-to-Image Generation
Figure 2 for R2I-Bench: Benchmarking Reasoning-Driven Text-to-Image Generation
Figure 3 for R2I-Bench: Benchmarking Reasoning-Driven Text-to-Image Generation
Figure 4 for R2I-Bench: Benchmarking Reasoning-Driven Text-to-Image Generation
Viaarxiv icon