Picture for Yulei Niu

Yulei Niu

Investigating Video Reasoning Capability of Large Language Models with Tropes in Movies

Add code
Jun 16, 2024
Viaarxiv icon

WIDIn: Wording Image for Domain-Invariant Representation in Single-Source Domain Generalization

Add code
May 28, 2024
Viaarxiv icon

RAP: Retrieval-Augmented Planner for Adaptive Procedure Planning in Instructional Videos

Add code
Mar 27, 2024
Figure 1 for RAP: Retrieval-Augmented Planner for Adaptive Procedure Planning in Instructional Videos
Figure 2 for RAP: Retrieval-Augmented Planner for Adaptive Procedure Planning in Instructional Videos
Figure 3 for RAP: Retrieval-Augmented Planner for Adaptive Procedure Planning in Instructional Videos
Figure 4 for RAP: Retrieval-Augmented Planner for Adaptive Procedure Planning in Instructional Videos
Viaarxiv icon

SCHEMA: State CHangEs MAtter for Procedure Planning in Instructional Videos

Add code
Mar 03, 2024
Figure 1 for SCHEMA: State CHangEs MAtter for Procedure Planning in Instructional Videos
Figure 2 for SCHEMA: State CHangEs MAtter for Procedure Planning in Instructional Videos
Figure 3 for SCHEMA: State CHangEs MAtter for Procedure Planning in Instructional Videos
Figure 4 for SCHEMA: State CHangEs MAtter for Procedure Planning in Instructional Videos
Viaarxiv icon

Language Models are Causal Knowledge Extractors for Zero-shot Video Question Answering

Add code
Apr 07, 2023
Figure 1 for Language Models are Causal Knowledge Extractors for Zero-shot Video Question Answering
Figure 2 for Language Models are Causal Knowledge Extractors for Zero-shot Video Question Answering
Figure 3 for Language Models are Causal Knowledge Extractors for Zero-shot Video Question Answering
Figure 4 for Language Models are Causal Knowledge Extractors for Zero-shot Video Question Answering
Viaarxiv icon

DiGeo: Discriminative Geometry-Aware Learning for Generalized Few-Shot Object Detection

Add code
Mar 16, 2023
Figure 1 for DiGeo: Discriminative Geometry-Aware Learning for Generalized Few-Shot Object Detection
Figure 2 for DiGeo: Discriminative Geometry-Aware Learning for Generalized Few-Shot Object Detection
Figure 3 for DiGeo: Discriminative Geometry-Aware Learning for Generalized Few-Shot Object Detection
Figure 4 for DiGeo: Discriminative Geometry-Aware Learning for Generalized Few-Shot Object Detection
Viaarxiv icon

Debiased Fine-Tuning for Vision-language Models by Prompt Regularization

Add code
Jan 29, 2023
Figure 1 for Debiased Fine-Tuning for Vision-language Models by Prompt Regularization
Figure 2 for Debiased Fine-Tuning for Vision-language Models by Prompt Regularization
Figure 3 for Debiased Fine-Tuning for Vision-language Models by Prompt Regularization
Figure 4 for Debiased Fine-Tuning for Vision-language Models by Prompt Regularization
Viaarxiv icon

In Defense of Structural Symbolic Representation for Video Event-Relation Prediction

Add code
Jan 06, 2023
Figure 1 for In Defense of Structural Symbolic Representation for Video Event-Relation Prediction
Figure 2 for In Defense of Structural Symbolic Representation for Video Event-Relation Prediction
Figure 3 for In Defense of Structural Symbolic Representation for Video Event-Relation Prediction
Figure 4 for In Defense of Structural Symbolic Representation for Video Event-Relation Prediction
Viaarxiv icon

Respecting Transfer Gap in Knowledge Distillation

Add code
Oct 23, 2022
Figure 1 for Respecting Transfer Gap in Knowledge Distillation
Figure 2 for Respecting Transfer Gap in Knowledge Distillation
Figure 3 for Respecting Transfer Gap in Knowledge Distillation
Figure 4 for Respecting Transfer Gap in Knowledge Distillation
Viaarxiv icon

Explicit Image Caption Editing

Add code
Jul 20, 2022
Figure 1 for Explicit Image Caption Editing
Figure 2 for Explicit Image Caption Editing
Figure 3 for Explicit Image Caption Editing
Figure 4 for Explicit Image Caption Editing
Viaarxiv icon