Picture for Letian Zhang

Letian Zhang

GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset

Add code
Jul 28, 2025
Viaarxiv icon

EvdCLIP: Improving Vision-Language Retrieval with Entity Visual Descriptions from Large Language Models

Add code
May 24, 2025
Viaarxiv icon

FedALT: Federated Fine-Tuning through Adaptive Local Training with Rest-of-the-World LoRA

Add code
Mar 14, 2025
Viaarxiv icon

Oasis: One Image is All You Need for Multimodal Instruction Data Synthesis

Add code
Mar 13, 2025
Viaarxiv icon

LoRA-FAIR: Federated LoRA Fine-Tuning with Aggregation and Initialization Refinement

Add code
Nov 22, 2024
Figure 1 for LoRA-FAIR: Federated LoRA Fine-Tuning with Aggregation and Initialization Refinement
Figure 2 for LoRA-FAIR: Federated LoRA Fine-Tuning with Aggregation and Initialization Refinement
Figure 3 for LoRA-FAIR: Federated LoRA Fine-Tuning with Aggregation and Initialization Refinement
Figure 4 for LoRA-FAIR: Federated LoRA Fine-Tuning with Aggregation and Initialization Refinement
Viaarxiv icon

Learning the Optimal Path and DNN Partition for Collaborative Edge Inference

Add code
Oct 02, 2024
Viaarxiv icon

Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning

Add code
Jun 18, 2024
Figure 1 for Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning
Figure 2 for Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning
Figure 3 for Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning
Figure 4 for Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning
Viaarxiv icon

ImageNet3D: Towards General-Purpose Object-Level 3D Understanding

Add code
Jun 13, 2024
Figure 1 for ImageNet3D: Towards General-Purpose Object-Level 3D Understanding
Figure 2 for ImageNet3D: Towards General-Purpose Object-Level 3D Understanding
Figure 3 for ImageNet3D: Towards General-Purpose Object-Level 3D Understanding
Figure 4 for ImageNet3D: Towards General-Purpose Object-Level 3D Understanding
Viaarxiv icon

Taming Cross-Domain Representation Variance in Federated Prototype Learning with Heterogeneous Data Domains

Add code
Mar 14, 2024
Figure 1 for Taming Cross-Domain Representation Variance in Federated Prototype Learning with Heterogeneous Data Domains
Figure 2 for Taming Cross-Domain Representation Variance in Federated Prototype Learning with Heterogeneous Data Domains
Figure 3 for Taming Cross-Domain Representation Variance in Federated Prototype Learning with Heterogeneous Data Domains
Figure 4 for Taming Cross-Domain Representation Variance in Federated Prototype Learning with Heterogeneous Data Domains
Viaarxiv icon

IL-NeRF: Incremental Learning for Neural Radiance Fields with Camera Pose Alignment

Add code
Dec 10, 2023
Viaarxiv icon