Picture for Sen Xing

Sen Xing

MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost

Add code
Dec 02, 2024
Figure 1 for MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost
Figure 2 for MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost
Figure 3 for MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost
Figure 4 for MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost
Viaarxiv icon

VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks

Add code
Jun 12, 2024
Viaarxiv icon

InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

Add code
Jan 15, 2024
Viaarxiv icon

Asymmetric Masked Distillation for Pre-Training Small Foundation Models

Add code
Nov 06, 2023
Figure 1 for Asymmetric Masked Distillation for Pre-Training Small Foundation Models
Figure 2 for Asymmetric Masked Distillation for Pre-Training Small Foundation Models
Figure 3 for Asymmetric Masked Distillation for Pre-Training Small Foundation Models
Figure 4 for Asymmetric Masked Distillation for Pre-Training Small Foundation Models
Viaarxiv icon

InternVideo: General Video Foundation Models via Generative and Discriminative Learning

Add code
Dec 07, 2022
Viaarxiv icon

InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges

Add code
Nov 17, 2022
Figure 1 for InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges
Figure 2 for InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges
Figure 3 for InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges
Figure 4 for InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges
Viaarxiv icon