Picture for Lifu Huang

Lifu Huang

UC Davis

UniHGKR: Unified Instruction-aware Heterogeneous Knowledge Retrievers

Add code
Oct 26, 2024
Figure 1 for UniHGKR: Unified Instruction-aware Heterogeneous Knowledge Retrievers
Figure 2 for UniHGKR: Unified Instruction-aware Heterogeneous Knowledge Retrievers
Figure 3 for UniHGKR: Unified Instruction-aware Heterogeneous Knowledge Retrievers
Figure 4 for UniHGKR: Unified Instruction-aware Heterogeneous Knowledge Retrievers
Viaarxiv icon

RoRA-VLM: Robust Retrieval-Augmented Vision Language Models

Add code
Oct 11, 2024
Figure 1 for RoRA-VLM: Robust Retrieval-Augmented Vision Language Models
Figure 2 for RoRA-VLM: Robust Retrieval-Augmented Vision Language Models
Figure 3 for RoRA-VLM: Robust Retrieval-Augmented Vision Language Models
Figure 4 for RoRA-VLM: Robust Retrieval-Augmented Vision Language Models
Viaarxiv icon

DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models

Add code
Oct 09, 2024
Figure 1 for DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models
Figure 2 for DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models
Figure 3 for DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models
Figure 4 for DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models
Viaarxiv icon

Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models

Add code
Oct 04, 2024
Figure 1 for Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models
Figure 2 for Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models
Figure 3 for Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models
Figure 4 for Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models
Viaarxiv icon

SPARTUN3D: Situated Spatial Understanding of 3D World in Large Language Models

Add code
Oct 04, 2024
Viaarxiv icon

DiPT: Enhancing LLM reasoning through diversified perspective-taking

Add code
Sep 10, 2024
Figure 1 for DiPT: Enhancing LLM reasoning through diversified perspective-taking
Figure 2 for DiPT: Enhancing LLM reasoning through diversified perspective-taking
Figure 3 for DiPT: Enhancing LLM reasoning through diversified perspective-taking
Figure 4 for DiPT: Enhancing LLM reasoning through diversified perspective-taking
Viaarxiv icon

Advancing Chart Question Answering with Robust Chart Component Recognition

Add code
Jul 19, 2024
Viaarxiv icon

AMD: Automatic Multi-step Distillation of Large-scale Vision Models

Add code
Jul 05, 2024
Figure 1 for AMD: Automatic Multi-step Distillation of Large-scale Vision Models
Figure 2 for AMD: Automatic Multi-step Distillation of Large-scale Vision Models
Figure 3 for AMD: Automatic Multi-step Distillation of Large-scale Vision Models
Figure 4 for AMD: Automatic Multi-step Distillation of Large-scale Vision Models
Viaarxiv icon

Lateralization LoRA: Interleaved Instruction Tuning with Modality-Specialized Adaptations

Add code
Jul 04, 2024
Viaarxiv icon

Holistic Evaluation for Interleaved Text-and-Image Generation

Add code
Jun 20, 2024
Figure 1 for Holistic Evaluation for Interleaved Text-and-Image Generation
Figure 2 for Holistic Evaluation for Interleaved Text-and-Image Generation
Figure 3 for Holistic Evaluation for Interleaved Text-and-Image Generation
Figure 4 for Holistic Evaluation for Interleaved Text-and-Image Generation
Viaarxiv icon