Picture for Jun Gao

Jun Gao

NVIDIA, University of Toronto, Vector Institute

StuPASE: Towards Low-Hallucination Studio-Quality Generative Speech Enhancement

Add code
Mar 10, 2026
Viaarxiv icon

AutoQRA: Joint Optimization of Mixed-Precision Quantization and Low-rank Adapters for Efficient LLM Fine-Tuning

Add code
Feb 25, 2026
Viaarxiv icon

Amber-Image: Efficient Compression of Large-Scale Diffusion Transformers

Add code
Feb 19, 2026
Viaarxiv icon

Improving Medical Visual Reinforcement Fine-Tuning via Perception and Reasoning Augmentation

Add code
Feb 11, 2026
Viaarxiv icon

SoulX-FlashHead: Oracle-guided Generation of Infinite Real-time Streaming Talking Heads

Add code
Feb 07, 2026
Viaarxiv icon

HUMANLLM: Benchmarking and Reinforcing LLM Anthropomorphism via Human Cognitive Patterns

Add code
Jan 15, 2026
Viaarxiv icon

Motion Attribution for Video Generation

Add code
Jan 13, 2026
Viaarxiv icon

The Semantic Architect: How FEAML Bridges Structured Data and LLMs for Multi-Label Tasks

Add code
Dec 17, 2025
Viaarxiv icon

ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation

Add code
Oct 05, 2025
Viaarxiv icon

SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding

Add code
Sep 18, 2025
Figure 1 for SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding
Figure 2 for SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding
Figure 3 for SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding
Figure 4 for SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding
Viaarxiv icon