Picture for Ling Shao

Ling Shao

Terminus Group, Beijing, China

Historical Test-time Prompt Tuning for Vision Foundation Models

Add code
Oct 27, 2024
Figure 1 for Historical Test-time Prompt Tuning for Vision Foundation Models
Figure 2 for Historical Test-time Prompt Tuning for Vision Foundation Models
Figure 3 for Historical Test-time Prompt Tuning for Vision Foundation Models
Figure 4 for Historical Test-time Prompt Tuning for Vision Foundation Models
Viaarxiv icon

LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models

Add code
Oct 15, 2024
Figure 1 for LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models
Figure 2 for LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models
Figure 3 for LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models
Figure 4 for LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models
Viaarxiv icon

MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders

Add code
May 13, 2024
Viaarxiv icon

StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting

Add code
Mar 12, 2024
Figure 1 for StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting
Figure 2 for StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting
Figure 3 for StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting
Figure 4 for StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting
Viaarxiv icon

Latent Semantic Consensus For Deterministic Geometric Model Fitting

Add code
Mar 11, 2024
Figure 1 for Latent Semantic Consensus For Deterministic Geometric Model Fitting
Figure 2 for Latent Semantic Consensus For Deterministic Geometric Model Fitting
Figure 3 for Latent Semantic Consensus For Deterministic Geometric Model Fitting
Figure 4 for Latent Semantic Consensus For Deterministic Geometric Model Fitting
Viaarxiv icon

Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm Perspectives

Add code
Feb 05, 2024
Figure 1 for Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm Perspectives
Figure 2 for Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm Perspectives
Figure 3 for Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm Perspectives
Figure 4 for Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm Perspectives
Viaarxiv icon

Graph Transformer GANs with Graph Masked Modeling for Architectural Layout Generation

Add code
Jan 15, 2024
Viaarxiv icon

DA-BEV: Unsupervised Domain Adaptation for Bird's Eye View Perception

Add code
Jan 13, 2024
Viaarxiv icon

Domain Adaptation for Large-Vocabulary Object Detectors

Add code
Jan 13, 2024
Viaarxiv icon

ECEA: Extensible Co-Existing Attention for Few-Shot Object Detection

Add code
Sep 15, 2023
Figure 1 for ECEA: Extensible Co-Existing Attention for Few-Shot Object Detection
Figure 2 for ECEA: Extensible Co-Existing Attention for Few-Shot Object Detection
Figure 3 for ECEA: Extensible Co-Existing Attention for Few-Shot Object Detection
Figure 4 for ECEA: Extensible Co-Existing Attention for Few-Shot Object Detection
Viaarxiv icon