Picture for Hongkai Chen

Hongkai Chen

GeoSVG-RL: Geometry-Aware Reinforcement Learning for Layout-Constrained Text-to-SVG Diagram Generation

Add code
May 25, 2026
Viaarxiv icon

Pro$^2$Assist: Continuous Step-Aware Proactive Assistance with Multimodal Egocentric Perception for Long-Horizon Procedural Tasks

Add code
May 05, 2026
Viaarxiv icon

AudioRouter: Data Efficient Audio Understanding via RL based Dual Reasoning

Add code
Feb 11, 2026
Viaarxiv icon

OptiSQL: Executable SQL Generation from Optical Tokens

Add code
Jan 21, 2026
Viaarxiv icon

DeepFeature: Iterative Context-aware Feature Generation for Wearable Biosignals

Add code
Dec 09, 2025
Viaarxiv icon

Detecting and Mitigating Insertion Hallucination in Video-to-Audio Generation

Add code
Oct 09, 2025
Viaarxiv icon

ContextAgent: Context-Aware Proactive LLM Agents with Open-World Sensory Perceptions

Add code
May 20, 2025
Viaarxiv icon

An LLM-Empowered Low-Resolution Vision System for On-Device Human Behavior Understanding

Add code
May 03, 2025
Viaarxiv icon

OpenTCM: A GraphRAG-Empowered LLM-based System for Traditional Chinese Medicine Knowledge Retrieval and Diagnosis

Add code
Apr 28, 2025
Viaarxiv icon

MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models

Add code
Apr 07, 2025
Viaarxiv icon