Picture for Li Liu

Li Liu

3DCity-LLM: Empowering Multi-modality Large Language Models for 3D City-scale Perception and Understanding

Add code
Mar 24, 2026
Viaarxiv icon

Boosting ASR Robustness via Test-Time Reinforcement Learning with Audio-Text Semantic Rewards

Add code
Mar 05, 2026
Viaarxiv icon

AG-REPA: Causal Layer Selection for Representation Alignment in Audio Flow Matching

Add code
Mar 01, 2026
Viaarxiv icon

Resp-Agent: An Agent-Based System for Multimodal Respiratory Sound Generation and Disease Diagnosis

Add code
Feb 19, 2026
Viaarxiv icon

PhyAVBench: A Challenging Audio Physics-Sensitivity Benchmark for Physically Grounded Text-to-Audio-Video Generation

Add code
Dec 30, 2025
Viaarxiv icon

How Far are Modern Trackers from UAV-Anti-UAV? A Million-Scale Benchmark and New Baseline

Add code
Dec 08, 2025
Viaarxiv icon

A Specialized Large Language Model for Clinical Reasoning and Diagnosis in Rare Diseases

Add code
Nov 18, 2025
Viaarxiv icon

Hi-Reco: High-Fidelity Real-Time Conversational Digital Humans

Add code
Nov 16, 2025
Viaarxiv icon

Fifty Years of SAR Automatic Target Recognition: The Road Forward

Add code
Sep 26, 2025
Viaarxiv icon

LAVA: Language Model Assisted Verbal Autopsy for Cause-of-Death Determination

Add code
Sep 11, 2025
Viaarxiv icon