Picture for Jie Luo

Jie Luo

Drive Any Mesh: 4D Latent Diffusion for Mesh Deformation from Video

Add code
Jun 09, 2025
Viaarxiv icon

VoQA: Visual-only Question Answering

Add code
May 20, 2025
Viaarxiv icon

Enhancing the Efficiency of Complex Systems Crystal Structure Prediction by Active Learning Guided Machine Learning Potential

Add code
May 13, 2025
Viaarxiv icon

An Empirical Study of Qwen3 Quantization

Add code
May 04, 2025
Viaarxiv icon

1-Tb/s/λ Transmission over Record 10714-km AR-HCF

Add code
Apr 02, 2025
Viaarxiv icon

Coarse-to-Fine Semantic Communication Systems for Text Transmission

Add code
Apr 02, 2025
Viaarxiv icon

Bridging Domain Gaps between Pretrained Multimodal Models and Recommendations

Add code
Feb 21, 2025
Viaarxiv icon

Clustering Properties of Self-Supervised Learning

Add code
Jan 30, 2025
Viaarxiv icon

MMedAgent: Learning to Use Medical Tools with Multi-modal Agent

Add code
Jul 02, 2024
Figure 1 for MMedAgent: Learning to Use Medical Tools with Multi-modal Agent
Figure 2 for MMedAgent: Learning to Use Medical Tools with Multi-modal Agent
Figure 3 for MMedAgent: Learning to Use Medical Tools with Multi-modal Agent
Figure 4 for MMedAgent: Learning to Use Medical Tools with Multi-modal Agent
Viaarxiv icon

TinyLLaVA Factory: A Modularized Codebase for Small-scale Large Multimodal Models

Add code
May 20, 2024
Figure 1 for TinyLLaVA Factory: A Modularized Codebase for Small-scale Large Multimodal Models
Figure 2 for TinyLLaVA Factory: A Modularized Codebase for Small-scale Large Multimodal Models
Figure 3 for TinyLLaVA Factory: A Modularized Codebase for Small-scale Large Multimodal Models
Viaarxiv icon