Alert button
Picture for Linjie Yang

Linjie Yang

Alert button

Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters

Mar 05, 2024
Weizhi Wang, Khalil Mrini, Linjie Yang, Sateesh Kumar, Yu Tian, Xifeng Yan, Heng Wang

Figure 1 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Figure 2 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Figure 3 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Figure 4 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Viaarxiv icon

Video Recognition in Portrait Mode

Dec 21, 2023
Mingfei Han, Linjie Yang, Xiaojie Jin, Jiashi Feng, Xiaojun Chang, Heng Wang

Viaarxiv icon

Shot2Story20K: A New Benchmark for Comprehensive Understanding of Multi-shot Videos

Dec 19, 2023
Mingfei Han, Linjie Yang, Xiaojun Chang, Heng Wang

Viaarxiv icon

Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling

Oct 11, 2023
Haogeng Liu, Qihang Fan, Tingkai Liu, Linjie Yang, Yunzhe Tao, Huaibo Huang, Ran He, Hongxia Yang

Figure 1 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Figure 2 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Figure 3 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Figure 4 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Viaarxiv icon

Selective Feature Adapter for Dense Vision Transformers

Oct 03, 2023
Xueqing Deng, Qi Fan, Xiaojie Jin, Linjie Yang, Peng Wang

Figure 1 for Selective Feature Adapter for Dense Vision Transformers
Figure 2 for Selective Feature Adapter for Dense Vision Transformers
Figure 3 for Selective Feature Adapter for Dense Vision Transformers
Figure 4 for Selective Feature Adapter for Dense Vision Transformers
Viaarxiv icon

The Devil is in the Details: A Deep Dive into the Rabbit Hole of Data Filtering

Sep 27, 2023
Haichao Yu, Yu Tian, Sateesh Kumar, Linjie Yang, Heng Wang

Viaarxiv icon

Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation

Jul 27, 2023
Yiming Cui, Linjie Yang, Haichao Yu

Figure 1 for Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation
Figure 2 for Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation
Figure 3 for Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation
Figure 4 for Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation
Viaarxiv icon

DQ-Det: Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation

Jul 23, 2023
Yiming Cui, Linjie Yang, Haichao Yu

Figure 1 for DQ-Det: Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation
Figure 2 for DQ-Det: Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation
Figure 3 for DQ-Det: Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation
Figure 4 for DQ-Det: Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation
Viaarxiv icon

Why Is Prompt Tuning for Vision-Language Models Robust to Noisy Labels?

Jul 22, 2023
Cheng-En Wu, Yu Tian, Haichao Yu, Heng Wang, Pedro Morgado, Yu Hen Hu, Linjie Yang

Figure 1 for Why Is Prompt Tuning for Vision-Language Models Robust to Noisy Labels?
Figure 2 for Why Is Prompt Tuning for Vision-Language Models Robust to Noisy Labels?
Figure 3 for Why Is Prompt Tuning for Vision-Language Models Robust to Noisy Labels?
Figure 4 for Why Is Prompt Tuning for Vision-Language Models Robust to Noisy Labels?
Viaarxiv icon

Exploring the Role of Audio in Video Captioning

Jun 21, 2023
Yuhan Shen, Linjie Yang, Longyin Wen, Haichao Yu, Ehsan Elhamifar, Heng Wang

Figure 1 for Exploring the Role of Audio in Video Captioning
Figure 2 for Exploring the Role of Audio in Video Captioning
Figure 3 for Exploring the Role of Audio in Video Captioning
Figure 4 for Exploring the Role of Audio in Video Captioning
Viaarxiv icon