Picture for Bin Zhu

Bin Zhu

RoDE: Linear Rectified Mixture of Diverse Experts for Food Large Multi-Modal Models

Add code
Jul 17, 2024
Viaarxiv icon

Model Inversion Attacks Through Target-Specific Conditional Diffusion Models

Add code
Jul 16, 2024
Figure 1 for Model Inversion Attacks Through Target-Specific Conditional Diffusion Models
Figure 2 for Model Inversion Attacks Through Target-Specific Conditional Diffusion Models
Figure 3 for Model Inversion Attacks Through Target-Specific Conditional Diffusion Models
Figure 4 for Model Inversion Attacks Through Target-Specific Conditional Diffusion Models
Viaarxiv icon

EfficientGS: Streamlining Gaussian Splatting for Large-Scale High-Resolution Scene Representation

Add code
Apr 19, 2024
Figure 1 for EfficientGS: Streamlining Gaussian Splatting for Large-Scale High-Resolution Scene Representation
Figure 2 for EfficientGS: Streamlining Gaussian Splatting for Large-Scale High-Resolution Scene Representation
Figure 3 for EfficientGS: Streamlining Gaussian Splatting for Large-Scale High-Resolution Scene Representation
Figure 4 for EfficientGS: Streamlining Gaussian Splatting for Large-Scale High-Resolution Scene Representation
Viaarxiv icon

AURORA: Navigating UI Tarpits via Automated Neural Screen Understanding

Add code
Apr 01, 2024
Viaarxiv icon

From Canteen Food to Daily Meals: Generalizing Food Recognition to More Practical Scenarios

Add code
Mar 12, 2024
Figure 1 for From Canteen Food to Daily Meals: Generalizing Food Recognition to More Practical Scenarios
Figure 2 for From Canteen Food to Daily Meals: Generalizing Food Recognition to More Practical Scenarios
Figure 3 for From Canteen Food to Daily Meals: Generalizing Food Recognition to More Practical Scenarios
Figure 4 for From Canteen Food to Daily Meals: Generalizing Food Recognition to More Practical Scenarios
Viaarxiv icon

LLMBind: A Unified Modality-Task Integration Framework

Add code
Mar 08, 2024
Viaarxiv icon

MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

Add code
Feb 04, 2024
Viaarxiv icon

Video Editing for Video Retrieval

Add code
Feb 04, 2024
Viaarxiv icon

FoodLMM: A Versatile Food Assistant using Large Multi-modal Model

Add code
Dec 22, 2023
Viaarxiv icon

Benchmarking and Defending Against Indirect Prompt Injection Attacks on Large Language Models

Add code
Dec 21, 2023
Viaarxiv icon