Picture for Xinyang Li

Xinyang Li

VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?

Add code
Nov 19, 2024
Figure 1 for VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?
Figure 2 for VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?
Figure 3 for VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?
Figure 4 for VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?
Viaarxiv icon

Look a Group at Once: Multi-Slide Modeling for Survival Prediction

Add code
Nov 18, 2024
Viaarxiv icon

Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching

Add code
Oct 08, 2024
Figure 1 for Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching
Figure 2 for Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching
Figure 3 for Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching
Figure 4 for Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching
Viaarxiv icon

CMR-Agent: Learning a Cross-Modal Agent for Iterative Image-to-Point Cloud Registration

Add code
Aug 05, 2024
Viaarxiv icon

MaFreeI2P: A Matching-Free Image-to-Point Cloud Registration Paradigm with Active Camera Pose Retrieval

Add code
Aug 05, 2024
Viaarxiv icon

R-SFLLM: Jamming Resilient Framework for Split Federated Learning with Large Language Models

Add code
Jul 16, 2024
Figure 1 for R-SFLLM: Jamming Resilient Framework for Split Federated Learning with Large Language Models
Figure 2 for R-SFLLM: Jamming Resilient Framework for Split Federated Learning with Large Language Models
Figure 3 for R-SFLLM: Jamming Resilient Framework for Split Federated Learning with Large Language Models
Figure 4 for R-SFLLM: Jamming Resilient Framework for Split Federated Learning with Large Language Models
Viaarxiv icon

FAGhead: Fully Animate Gaussian Head from Monocular Videos

Add code
Jun 27, 2024
Figure 1 for FAGhead: Fully Animate Gaussian Head from Monocular Videos
Figure 2 for FAGhead: Fully Animate Gaussian Head from Monocular Videos
Figure 3 for FAGhead: Fully Animate Gaussian Head from Monocular Videos
Figure 4 for FAGhead: Fully Animate Gaussian Head from Monocular Videos
Viaarxiv icon

Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text

Add code
Jun 25, 2024
Figure 1 for Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text
Figure 2 for Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text
Figure 3 for Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text
Figure 4 for Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text
Viaarxiv icon

GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane

Add code
May 27, 2024
Viaarxiv icon

GGAvatar: Geometric Adjustment of Gaussian Head Avatar

Add code
May 20, 2024
Figure 1 for GGAvatar: Geometric Adjustment of Gaussian Head Avatar
Figure 2 for GGAvatar: Geometric Adjustment of Gaussian Head Avatar
Figure 3 for GGAvatar: Geometric Adjustment of Gaussian Head Avatar
Figure 4 for GGAvatar: Geometric Adjustment of Gaussian Head Avatar
Viaarxiv icon