Picture for Lu Zhang

Lu Zhang

Tony

3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding

Add code
Jan 14, 2025
Viaarxiv icon

AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation

Add code
Jan 14, 2025
Figure 1 for AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation
Figure 2 for AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation
Figure 3 for AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation
Figure 4 for AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation
Viaarxiv icon

Evaluating Human Perception of Novel View Synthesis: Subjective Quality Assessment of Gaussian Splatting and NeRF in Dynamic Scenes

Add code
Jan 13, 2025
Viaarxiv icon

Large Language Models for Bioinformatics

Add code
Jan 10, 2025
Figure 1 for Large Language Models for Bioinformatics
Viaarxiv icon

From Dense to Sparse: Event Response for Enhanced Residential Load Forecasting

Add code
Jan 08, 2025
Figure 1 for From Dense to Sparse: Event Response for Enhanced Residential Load Forecasting
Figure 2 for From Dense to Sparse: Event Response for Enhanced Residential Load Forecasting
Figure 3 for From Dense to Sparse: Event Response for Enhanced Residential Load Forecasting
Figure 4 for From Dense to Sparse: Event Response for Enhanced Residential Load Forecasting
Viaarxiv icon

SAT-LDM: Provably Generalizable Image Watermarking for Latent Diffusion Models with Self-Augmented Training

Add code
Dec 31, 2024
Figure 1 for SAT-LDM: Provably Generalizable Image Watermarking for Latent Diffusion Models with Self-Augmented Training
Figure 2 for SAT-LDM: Provably Generalizable Image Watermarking for Latent Diffusion Models with Self-Augmented Training
Figure 3 for SAT-LDM: Provably Generalizable Image Watermarking for Latent Diffusion Models with Self-Augmented Training
Figure 4 for SAT-LDM: Provably Generalizable Image Watermarking for Latent Diffusion Models with Self-Augmented Training
Viaarxiv icon

MRP-LLM: Multitask Reflective Large Language Models for Privacy-Preserving Next POI Recommendation

Add code
Dec 03, 2024
Figure 1 for MRP-LLM: Multitask Reflective Large Language Models for Privacy-Preserving Next POI Recommendation
Figure 2 for MRP-LLM: Multitask Reflective Large Language Models for Privacy-Preserving Next POI Recommendation
Figure 3 for MRP-LLM: Multitask Reflective Large Language Models for Privacy-Preserving Next POI Recommendation
Figure 4 for MRP-LLM: Multitask Reflective Large Language Models for Privacy-Preserving Next POI Recommendation
Viaarxiv icon

Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding

Add code
Nov 29, 2024
Viaarxiv icon

DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting

Add code
Nov 26, 2024
Viaarxiv icon

Towards Next-Generation Medical Agent: How o1 is Reshaping Decision-Making in Medical Scenarios

Add code
Nov 16, 2024
Figure 1 for Towards Next-Generation Medical Agent: How o1 is Reshaping Decision-Making in Medical Scenarios
Figure 2 for Towards Next-Generation Medical Agent: How o1 is Reshaping Decision-Making in Medical Scenarios
Figure 3 for Towards Next-Generation Medical Agent: How o1 is Reshaping Decision-Making in Medical Scenarios
Figure 4 for Towards Next-Generation Medical Agent: How o1 is Reshaping Decision-Making in Medical Scenarios
Viaarxiv icon