Picture for Jun Yu

Jun Yu

Lehigh University

Anchoring Emotions in Text: Robust Multimodal Fusion for Mimicry Intensity Estimation

Add code
Mar 16, 2026
Viaarxiv icon

Too Vivid to Be Real? Benchmarking and Calibrating Generative Color Fidelity

Add code
Mar 11, 2026
Viaarxiv icon

Prune Redundancy, Preserve Essence: Vision Token Compression in VLMs via Synergistic Importance-Diversity

Add code
Mar 11, 2026
Viaarxiv icon

Hierarchical Granularity Alignment and State Space Modeling for Robust Multimodal AU Detection in the Wild

Add code
Mar 11, 2026
Viaarxiv icon

See, Plan, Rewind: Progress-Aware Vision-Language-Action Models for Robust Robotic Manipulation

Add code
Mar 10, 2026
Viaarxiv icon

Solution to the 10th ABAW Expression Recognition Challenge: A Robust Multimodal Framework with Safe Cross-Attention and Modality Dropout

Add code
Mar 09, 2026
Viaarxiv icon

DiffTrans: Differentiable Geometry-Materials Decomposition for Reconstructing Transparent Objects

Add code
Feb 28, 2026
Viaarxiv icon

DMP-3DAD: Cross-Category 3D Anomaly Detection via Realistic Depth Map Projection with Few Normal Samples

Add code
Feb 11, 2026
Viaarxiv icon

DeltaKV: Residual-Based KV Cache Compression via Long-Range Similarity

Add code
Feb 08, 2026
Viaarxiv icon

ERNIE 5.0 Technical Report

Add code
Feb 04, 2026
Viaarxiv icon