Picture for Lingsi Zhu

Lingsi Zhu

Anchoring Emotions in Text: Robust Multimodal Fusion for Mimicry Intensity Estimation

Add code
Mar 16, 2026
Viaarxiv icon

Hierarchical Granularity Alignment and State Space Modeling for Robust Multimodal AU Detection in the Wild

Add code
Mar 11, 2026
Viaarxiv icon

Solution to the 10th ABAW Expression Recognition Challenge: A Robust Multimodal Framework with Safe Cross-Attention and Modality Dropout

Add code
Mar 09, 2026
Viaarxiv icon

Solution for 8th Competition on Affective & Behavior Analysis in-the-wild

Add code
Mar 14, 2025
Viaarxiv icon

Dual-Stage Cross-Modal Network with Dynamic Feature Fusion for Emotional Mimicry Intensity Estimation

Add code
Mar 13, 2025
Viaarxiv icon

DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance

Add code
Mar 05, 2025
Figure 1 for DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance
Figure 2 for DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance
Figure 3 for DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance
Figure 4 for DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance
Viaarxiv icon