Picture for Feng Li

Feng Li

LightBagel: A Light-weighted, Double Fusion Framework for Unified Multimodal Understanding and Generation

Add code
Oct 27, 2025
Viaarxiv icon

Probing Social Identity Bias in Chinese LLMs with Gendered Pronouns and Social Groups

Add code
Oct 08, 2025
Viaarxiv icon

Intelligent Virtual Sonographer (IVS): Enhancing Physician-Robot-Patient Communication

Add code
Jul 17, 2025
Viaarxiv icon

SignAligner: Harmonizing Complementary Pose Modalities for Coherent Sign Language Generation

Add code
Jun 13, 2025
Viaarxiv icon

CLEAR: A Clinically-Grounded Tabular Framework for Radiology Report Evaluation

Add code
May 22, 2025
Viaarxiv icon

Emerging Properties in Unified Multimodal Pretraining

Add code
May 20, 2025
Figure 1 for Emerging Properties in Unified Multimodal Pretraining
Figure 2 for Emerging Properties in Unified Multimodal Pretraining
Figure 3 for Emerging Properties in Unified Multimodal Pretraining
Figure 4 for Emerging Properties in Unified Multimodal Pretraining
Viaarxiv icon

InstanceBEV: Unifying Instance and BEV Representation for Global Modeling

Add code
May 20, 2025
Figure 1 for InstanceBEV: Unifying Instance and BEV Representation for Global Modeling
Figure 2 for InstanceBEV: Unifying Instance and BEV Representation for Global Modeling
Figure 3 for InstanceBEV: Unifying Instance and BEV Representation for Global Modeling
Figure 4 for InstanceBEV: Unifying Instance and BEV Representation for Global Modeling
Viaarxiv icon

Seed1.5-VL Technical Report

Add code
May 11, 2025
Viaarxiv icon

EvEnhancer: Empowering Effectiveness, Efficiency and Generalizability for Continuous Space-Time Video Super-Resolution with Events

Add code
May 07, 2025
Figure 1 for EvEnhancer: Empowering Effectiveness, Efficiency and Generalizability for Continuous Space-Time Video Super-Resolution with Events
Figure 2 for EvEnhancer: Empowering Effectiveness, Efficiency and Generalizability for Continuous Space-Time Video Super-Resolution with Events
Figure 3 for EvEnhancer: Empowering Effectiveness, Efficiency and Generalizability for Continuous Space-Time Video Super-Resolution with Events
Figure 4 for EvEnhancer: Empowering Effectiveness, Efficiency and Generalizability for Continuous Space-Time Video Super-Resolution with Events
Viaarxiv icon

NTIRE 2025 Challenge on Event-Based Image Deblurring: Methods and Results

Add code
Apr 16, 2025
Viaarxiv icon