Picture for Junwen Chen

Junwen Chen

Hierarchical Visual Agent: Managing Contexts in Joint Image-Text Space for Advanced Chart Reasoning

Add code
May 05, 2026
Viaarxiv icon

STORM: End-to-End Referring Multi-Object Tracking in Videos

Add code
Apr 12, 2026
Viaarxiv icon

When Rules Fall Short: Agent-Driven Discovery of Emerging Content Issues in Short Video Platforms

Add code
Jan 14, 2026
Viaarxiv icon

PosBridge: Multi-View Positional Embedding Transplant for Identity-Aware Image Editing

Add code
Aug 24, 2025
Viaarxiv icon

MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning

Add code
Jun 12, 2025
Viaarxiv icon

PrismLayers: Open Data for High-Quality Multi-Layer Transparent Image Generative Models

Add code
May 28, 2025
Viaarxiv icon

Transformation vs Tradition: Artificial General Intelligence (AGI) for Arts and Humanities

Add code
Oct 30, 2023
Figure 1 for Transformation vs Tradition: Artificial General Intelligence (AGI) for Arts and Humanities
Figure 2 for Transformation vs Tradition: Artificial General Intelligence (AGI) for Arts and Humanities
Figure 3 for Transformation vs Tradition: Artificial General Intelligence (AGI) for Arts and Humanities
Figure 4 for Transformation vs Tradition: Artificial General Intelligence (AGI) for Arts and Humanities
Viaarxiv icon

ATM: Action Temporality Modeling for Video Question Answering

Add code
Sep 05, 2023
Figure 1 for ATM: Action Temporality Modeling for Video Question Answering
Figure 2 for ATM: Action Temporality Modeling for Video Question Answering
Figure 3 for ATM: Action Temporality Modeling for Video Question Answering
Figure 4 for ATM: Action Temporality Modeling for Video Question Answering
Viaarxiv icon

Defending Adversarial Patches via Joint Region Localizing and Inpainting

Add code
Jul 26, 2023
Figure 1 for Defending Adversarial Patches via Joint Region Localizing and Inpainting
Figure 2 for Defending Adversarial Patches via Joint Region Localizing and Inpainting
Figure 3 for Defending Adversarial Patches via Joint Region Localizing and Inpainting
Figure 4 for Defending Adversarial Patches via Joint Region Localizing and Inpainting
Viaarxiv icon

Focusing on what to decode and what to train: Efficient Training with HOI Split Decoders and Specific Target Guided DeNoising

Add code
Jul 05, 2023
Viaarxiv icon