Picture for Gen Luo

Gen Luo

Deep Instruction Tuning for Segment Anything Model

Add code
Mar 31, 2024
Viaarxiv icon

Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization

Add code
Mar 11, 2024
Figure 1 for Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
Figure 2 for Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
Figure 3 for Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
Figure 4 for Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
Viaarxiv icon

Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models

Add code
Mar 05, 2024
Figure 1 for Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models
Figure 2 for Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models
Figure 3 for Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models
Figure 4 for Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models
Viaarxiv icon

Towards Omni-supervised Referring Expression Segmentation

Add code
Nov 01, 2023
Viaarxiv icon

3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation

Add code
Aug 31, 2023
Figure 1 for 3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation
Figure 2 for 3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation
Figure 3 for 3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation
Figure 4 for 3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation
Viaarxiv icon

Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models

Add code
May 24, 2023
Figure 1 for Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models
Figure 2 for Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models
Figure 3 for Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models
Figure 4 for Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models
Viaarxiv icon

Active Teacher for Semi-Supervised Object Detection

Add code
Mar 15, 2023
Figure 1 for Active Teacher for Semi-Supervised Object Detection
Figure 2 for Active Teacher for Semi-Supervised Object Detection
Figure 3 for Active Teacher for Semi-Supervised Object Detection
Figure 4 for Active Teacher for Semi-Supervised Object Detection
Viaarxiv icon

Towards End-to-end Semi-supervised Learning for One-stage Object Detection

Add code
Feb 22, 2023
Figure 1 for Towards End-to-end Semi-supervised Learning for One-stage Object Detection
Figure 2 for Towards End-to-end Semi-supervised Learning for One-stage Object Detection
Figure 3 for Towards End-to-end Semi-supervised Learning for One-stage Object Detection
Figure 4 for Towards End-to-end Semi-supervised Learning for One-stage Object Detection
Viaarxiv icon

Towards Efficient Visual Adaption via Structural Re-parameterization

Add code
Feb 16, 2023
Figure 1 for Towards Efficient Visual Adaption via Structural Re-parameterization
Figure 2 for Towards Efficient Visual Adaption via Structural Re-parameterization
Figure 3 for Towards Efficient Visual Adaption via Structural Re-parameterization
Figure 4 for Towards Efficient Visual Adaption via Structural Re-parameterization
Viaarxiv icon

What Goes beyond Multi-modal Fusion in One-stage Referring Expression Comprehension: An Empirical Study

Add code
Apr 17, 2022
Figure 1 for What Goes beyond Multi-modal Fusion in One-stage Referring Expression Comprehension: An Empirical Study
Figure 2 for What Goes beyond Multi-modal Fusion in One-stage Referring Expression Comprehension: An Empirical Study
Figure 3 for What Goes beyond Multi-modal Fusion in One-stage Referring Expression Comprehension: An Empirical Study
Figure 4 for What Goes beyond Multi-modal Fusion in One-stage Referring Expression Comprehension: An Empirical Study
Viaarxiv icon