Picture for Yi Zhang

Yi Zhang

Carnegie Mellon University

Lite-SAM Is Actually What You Need for Segment Everything

Add code
Jul 12, 2024
Figure 1 for Lite-SAM Is Actually What You Need for Segment Everything
Figure 2 for Lite-SAM Is Actually What You Need for Segment Everything
Figure 3 for Lite-SAM Is Actually What You Need for Segment Everything
Figure 4 for Lite-SAM Is Actually What You Need for Segment Everything
Viaarxiv icon

NODE-Adapter: Neural Ordinary Differential Equations for Better Vision-Language Reasoning

Add code
Jul 11, 2024
Figure 1 for NODE-Adapter: Neural Ordinary Differential Equations for Better Vision-Language Reasoning
Figure 2 for NODE-Adapter: Neural Ordinary Differential Equations for Better Vision-Language Reasoning
Figure 3 for NODE-Adapter: Neural Ordinary Differential Equations for Better Vision-Language Reasoning
Figure 4 for NODE-Adapter: Neural Ordinary Differential Equations for Better Vision-Language Reasoning
Viaarxiv icon

Language Models Encode Collaborative Signals in Recommendation

Add code
Jul 07, 2024
Figure 1 for Language Models Encode Collaborative Signals in Recommendation
Figure 2 for Language Models Encode Collaborative Signals in Recommendation
Figure 3 for Language Models Encode Collaborative Signals in Recommendation
Figure 4 for Language Models Encode Collaborative Signals in Recommendation
Viaarxiv icon

Conceptual Codebook Learning for Vision-Language Models

Add code
Jul 02, 2024
Figure 1 for Conceptual Codebook Learning for Vision-Language Models
Figure 2 for Conceptual Codebook Learning for Vision-Language Models
Figure 3 for Conceptual Codebook Learning for Vision-Language Models
Figure 4 for Conceptual Codebook Learning for Vision-Language Models
Viaarxiv icon

Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding

Add code
Jun 27, 2024
Figure 1 for Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding
Figure 2 for Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding
Figure 3 for Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding
Figure 4 for Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding
Viaarxiv icon

Learning Visual Conditioning Tokens to Correct Domain Shift for Fully Test-time Adaptation

Add code
Jun 27, 2024
Figure 1 for Learning Visual Conditioning Tokens to Correct Domain Shift for Fully Test-time Adaptation
Figure 2 for Learning Visual Conditioning Tokens to Correct Domain Shift for Fully Test-time Adaptation
Figure 3 for Learning Visual Conditioning Tokens to Correct Domain Shift for Fully Test-time Adaptation
Figure 4 for Learning Visual Conditioning Tokens to Correct Domain Shift for Fully Test-time Adaptation
Viaarxiv icon

Deep-learning-based groupwise registration for motion correction of cardiac $T_1$ mapping

Add code
Jun 21, 2024
Viaarxiv icon

Recurrent Inference Machine for Medical Image Registration

Add code
Jun 19, 2024
Viaarxiv icon

PCA-Relax: Deep-learning-based groupwise registration for motion correction of cardiac $T_1$ mapping

Add code
Jun 18, 2024
Viaarxiv icon

TourRank: Utilizing Large Language Models for Documents Ranking with a Tournament-Inspired Strategy

Add code
Jun 17, 2024
Figure 1 for TourRank: Utilizing Large Language Models for Documents Ranking with a Tournament-Inspired Strategy
Figure 2 for TourRank: Utilizing Large Language Models for Documents Ranking with a Tournament-Inspired Strategy
Figure 3 for TourRank: Utilizing Large Language Models for Documents Ranking with a Tournament-Inspired Strategy
Figure 4 for TourRank: Utilizing Large Language Models for Documents Ranking with a Tournament-Inspired Strategy
Viaarxiv icon