Picture for Zhongjiang He

Zhongjiang He

WAT: Online Video Understanding Needs Watching Before Thinking

Add code
Mar 12, 2026
Viaarxiv icon

Benchmarking Semantic Segmentation Models via Appearance and Geometry Attribute Editing

Add code
Mar 02, 2026
Viaarxiv icon

Geometric Image Editing via Effects-Sensitive In-Context Inpainting with Diffusion Transformers

Add code
Feb 09, 2026
Viaarxiv icon

ReasonTabQA: A Comprehensive Benchmark for Table Question Answering from Real World Industrial Scenarios

Add code
Jan 12, 2026
Viaarxiv icon

MR-UIE: Multi-Perspective Reasoning with Reinforcement Learning for Universal Information Extraction

Add code
Sep 11, 2025
Viaarxiv icon

TELEVAL: A Dynamic Benchmark Designed for Spoken Language Models in Chinese Interactive Scenarios

Add code
Jul 24, 2025
Viaarxiv icon

Technical Report of TeleChat2, TeleChat2.5 and T1

Add code
Jul 24, 2025
Figure 1 for Technical Report of TeleChat2, TeleChat2.5 and T1
Figure 2 for Technical Report of TeleChat2, TeleChat2.5 and T1
Figure 3 for Technical Report of TeleChat2, TeleChat2.5 and T1
Figure 4 for Technical Report of TeleChat2, TeleChat2.5 and T1
Viaarxiv icon

BoSS: Beyond-Semantic Speech

Add code
Jul 23, 2025
Viaarxiv icon

FairHuman: Boosting Hand and Face Quality in Human Image Generation with Minimum Potential Delay Fairness in Diffusion Models

Add code
Jul 03, 2025
Viaarxiv icon

Detailed Object Description with Controllable Dimensions

Add code
Nov 28, 2024
Figure 1 for Detailed Object Description with Controllable Dimensions
Figure 2 for Detailed Object Description with Controllable Dimensions
Figure 3 for Detailed Object Description with Controllable Dimensions
Figure 4 for Detailed Object Description with Controllable Dimensions
Viaarxiv icon