Picture for Jun Du

Jun Du

Beyond Monologue: Interactive Talking-Listening Avatar Generation with Conversational Audio Context-Aware Kernels

Add code
Apr 11, 2026
Viaarxiv icon

KAT-Coder-V2 Technical Report

Add code
Mar 29, 2026
Viaarxiv icon

WebTestBench: Evaluating Computer-Use Agents towards End-to-End Automated Web Testing

Add code
Mar 26, 2026
Viaarxiv icon

TDATR: Improving End-to-End Table Recognition via Table Detail-Aware Learning and Cell-Level Visual Alignment

Add code
Mar 24, 2026
Viaarxiv icon

VSD-MOT: End-to-End Multi-Object Tracking in Low-Quality Video Scenes Guided by Visual Semantic Distillation

Add code
Mar 21, 2026
Viaarxiv icon

EARTalking: End-to-end GPT-style Autoregressive Talking Head Synthesis with Frame-wise Control

Add code
Mar 19, 2026
Viaarxiv icon

The USTC-NERCSLIP Systems for the CHiME-9 MCoRec Challenge

Add code
Mar 02, 2026
Viaarxiv icon

Step Potential Advantage Estimation: Harnessing Intermediate Confidence and Correctness for Efficient Mathematical Reasoning

Add code
Jan 07, 2026
Viaarxiv icon

Rethinking Secure Semantic Communications in the Age of Generative and Agentic AI: Threats and Opportunities

Add code
Jan 06, 2026
Viaarxiv icon

REST: Diffusion-based Real-time End-to-end Streaming Talking Head Generation via ID-Context Caching and Asynchronous Streaming Distillation

Add code
Dec 12, 2025
Viaarxiv icon