Picture for Dahua Lin

Dahua Lin

Eric

Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering

Add code
May 22, 2025
Figure 1 for Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering
Figure 2 for Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering
Figure 3 for Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering
Figure 4 for Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering
Viaarxiv icon

Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models

Add code
May 22, 2025
Figure 1 for Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models
Figure 2 for Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models
Figure 3 for Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models
Figure 4 for Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models
Viaarxiv icon

Visual Agentic Reinforcement Fine-Tuning

Add code
May 20, 2025
Viaarxiv icon

MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-Design

Add code
May 09, 2025
Figure 1 for MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-Design
Figure 2 for MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-Design
Figure 3 for MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-Design
Figure 4 for MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-Design
Viaarxiv icon

Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulation

Add code
Apr 17, 2025
Viaarxiv icon

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Add code
Apr 15, 2025
Viaarxiv icon

GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography

Add code
Apr 10, 2025
Viaarxiv icon

MM-IFEngine: Towards Multimodal Instruction Following

Add code
Apr 10, 2025
Viaarxiv icon

HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance

Add code
Apr 08, 2025
Viaarxiv icon

Multi-identity Human Image Animation with Structural Video Diffusion

Add code
Apr 05, 2025
Viaarxiv icon