Picture for Hong Jia

Hong Jia

RAIL: Rethinking Auditory Intelligence in Large Audio-Language Models with a CHC-Grounded Benchmark

Add code
Jun 09, 2026
Viaarxiv icon

Titans-as-a-Layer: Test-Time Memory for Conversational Speech Emotion Recognition

Add code
Jun 07, 2026
Viaarxiv icon

Pocket-Dentist: On-Device Dental Image Understanding via Efficient Multimodal Large Language Models

Add code
May 28, 2026
Viaarxiv icon

VitalAgent: A Tool-Augmented Agent for Reactive and Proactive Physiological Monitoring over Wearable Health Data

Add code
May 28, 2026
Viaarxiv icon

Why Can't They Remember? Uncovering Representation and Retrieval Bottlenecks in Multi-Turn Acoustic Memory

Add code
May 26, 2026
Viaarxiv icon

Do LLMs Need to See Everything? A Benchmark and Study of Failures in LLM-driven Smartphone Automation using Screentext vs. Screenshots

Add code
Apr 20, 2026
Viaarxiv icon

Adaptive Federated Fine-Tuning of Self-Supervised Speech Representations

Add code
Mar 23, 2026
Viaarxiv icon

Localizing and Editing Knowledge in Large Audio-Language Models

Add code
Mar 15, 2026
Viaarxiv icon

Disentangling Reasoning in Large Audio-Language Models for Ambiguous Emotion Prediction

Add code
Mar 09, 2026
Viaarxiv icon

Spiking Graph Predictive Coding for Reliable OOD Generalization

Add code
Feb 22, 2026
Viaarxiv icon