Picture for Xinyu Wei

Xinyu Wei

Separation and Collaboration: Two-Level Routing Grouped Mixture-of-Experts for Multi-Domain Continual Learning

Add code
Aug 11, 2025
Viaarxiv icon

Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos

Add code
Jun 05, 2025
Viaarxiv icon

Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO

Add code
May 22, 2025
Viaarxiv icon

Are Large Language Models Good In-context Learners for Financial Sentiment Analysis?

Add code
Mar 06, 2025
Viaarxiv icon

MAVIS: Mathematical Visual Instruction Tuning

Add code
Jul 11, 2024
Figure 1 for MAVIS: Mathematical Visual Instruction Tuning
Figure 2 for MAVIS: Mathematical Visual Instruction Tuning
Figure 3 for MAVIS: Mathematical Visual Instruction Tuning
Figure 4 for MAVIS: Mathematical Visual Instruction Tuning
Viaarxiv icon

MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception

Add code
Jun 22, 2024
Figure 1 for MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception
Figure 2 for MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception
Figure 3 for MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception
Figure 4 for MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception
Viaarxiv icon

Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want

Add code
Apr 01, 2024
Figure 1 for Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Figure 2 for Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Figure 3 for Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Figure 4 for Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Viaarxiv icon

IDAdapter: Learning Mixed Features for Tuning-Free Personalization of Text-to-Image Models

Add code
Mar 21, 2024
Viaarxiv icon

Cloud-Device Collaborative Learning for Multimodal Large Language Models

Add code
Dec 26, 2023
Figure 1 for Cloud-Device Collaborative Learning for Multimodal Large Language Models
Figure 2 for Cloud-Device Collaborative Learning for Multimodal Large Language Models
Figure 3 for Cloud-Device Collaborative Learning for Multimodal Large Language Models
Figure 4 for Cloud-Device Collaborative Learning for Multimodal Large Language Models
Viaarxiv icon

Accretionary Learning with Deep Neural Networks

Add code
Nov 21, 2021
Figure 1 for Accretionary Learning with Deep Neural Networks
Figure 2 for Accretionary Learning with Deep Neural Networks
Figure 3 for Accretionary Learning with Deep Neural Networks
Figure 4 for Accretionary Learning with Deep Neural Networks
Viaarxiv icon