Picture for Chi Chen

Chi Chen

Cheers: Decoupling Patch Details from Semantic Representations Enables Unified Multimodal Comprehension and Generation

Add code
Mar 13, 2026
Viaarxiv icon

PolyCrysDiff: Controllable Generation of Three-Dimensional Computable Polycrystalline Material Structures

Add code
Mar 12, 2026
Viaarxiv icon

Imagination Helps Visual Reasoning, But Not Yet in Latent Space

Add code
Feb 26, 2026
Viaarxiv icon

MM-UAVBench: How Well Do Multimodal Large Language Models See, Think, and Plan in Low-Altitude UAV Scenarios?

Add code
Dec 29, 2025
Viaarxiv icon

Towards Class-wise Fair Adversarial Training via Anti-Bias Soft Label Distillation

Add code
Jun 10, 2025
Viaarxiv icon

SPPSFormer: High-quality Superpoint-based Transformer for Roof Plane Instance Segmentation from Point Clouds

Add code
May 30, 2025
Figure 1 for SPPSFormer: High-quality Superpoint-based Transformer for Roof Plane Instance Segmentation from Point Clouds
Figure 2 for SPPSFormer: High-quality Superpoint-based Transformer for Roof Plane Instance Segmentation from Point Clouds
Figure 3 for SPPSFormer: High-quality Superpoint-based Transformer for Roof Plane Instance Segmentation from Point Clouds
Figure 4 for SPPSFormer: High-quality Superpoint-based Transformer for Roof Plane Instance Segmentation from Point Clouds
Viaarxiv icon

MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding

Add code
May 27, 2025
Viaarxiv icon

Visual Abstract Thinking Empowers Multimodal Reasoning

Add code
May 26, 2025
Viaarxiv icon

ChartEdit: How Far Are MLLMs From Automating Chart Analysis? Evaluating MLLMs' Capability via Chart Editing

Add code
May 17, 2025
Viaarxiv icon

Think in Safety: Unveiling and Mitigating Safety Alignment Collapse in Multimodal Large Reasoning Model

Add code
May 10, 2025
Figure 1 for Think in Safety: Unveiling and Mitigating Safety Alignment Collapse in Multimodal Large Reasoning Model
Figure 2 for Think in Safety: Unveiling and Mitigating Safety Alignment Collapse in Multimodal Large Reasoning Model
Figure 3 for Think in Safety: Unveiling and Mitigating Safety Alignment Collapse in Multimodal Large Reasoning Model
Figure 4 for Think in Safety: Unveiling and Mitigating Safety Alignment Collapse in Multimodal Large Reasoning Model
Viaarxiv icon