Picture for Xu Luo

Xu Luo

Unlocking Smarter Device Control: Foresighted Planning with a World Model-Driven Code Execution Approach

Add code
May 22, 2025
Viaarxiv icon

InSpire: Vision-Language-Action Models with Intrinsic Spatial Reasoning

Add code
May 20, 2025
Viaarxiv icon

Policy Contrastive Decoding for Robotic Foundation Models

Add code
May 19, 2025
Viaarxiv icon

Semantic Data Augmentation Enhanced Invariant Risk Minimization for Medical Image Domain Generalization

Add code
Feb 08, 2025
Viaarxiv icon

Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers

Add code
May 09, 2024
Figure 1 for Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers
Figure 2 for Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers
Figure 3 for Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers
Figure 4 for Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers
Viaarxiv icon

CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language Model

Add code
Mar 13, 2024
Figure 1 for CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language Model
Figure 2 for CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language Model
Figure 3 for CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language Model
Figure 4 for CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language Model
Viaarxiv icon

Symmetrical Bidirectional Knowledge Alignment for Zero-Shot Sketch-Based Image Retrieval

Add code
Dec 16, 2023
Viaarxiv icon

3DAxiesPrompts: Unleashing the 3D Spatial Task Capabilities of GPT-4V

Add code
Dec 15, 2023
Viaarxiv icon

Less is More: On the Feature Redundancy of Pretrained Models When Transferring to Few-shot Tasks

Add code
Oct 05, 2023
Viaarxiv icon

Language-Enhanced Session-Based Recommendation with Decoupled Contrastive Learning

Add code
Jul 20, 2023
Figure 1 for Language-Enhanced Session-Based Recommendation with Decoupled Contrastive Learning
Figure 2 for Language-Enhanced Session-Based Recommendation with Decoupled Contrastive Learning
Figure 3 for Language-Enhanced Session-Based Recommendation with Decoupled Contrastive Learning
Figure 4 for Language-Enhanced Session-Based Recommendation with Decoupled Contrastive Learning
Viaarxiv icon