Picture for Wei Ji

Wei Ji

Grounding is All You Need? Dual Temporal Grounding for Video Dialog

Add code
Oct 08, 2024
Figure 1 for Grounding is All You Need? Dual Temporal Grounding for Video Dialog
Figure 2 for Grounding is All You Need? Dual Temporal Grounding for Video Dialog
Figure 3 for Grounding is All You Need? Dual Temporal Grounding for Video Dialog
Figure 4 for Grounding is All You Need? Dual Temporal Grounding for Video Dialog
Viaarxiv icon

Personalized Knowledge Tracing through Student Representation Reconstruction and Class Imbalance Mitigation

Add code
Sep 10, 2024
Viaarxiv icon

Semantic Alignment for Multimodal Large Language Models

Add code
Aug 23, 2024
Figure 1 for Semantic Alignment for Multimodal Large Language Models
Figure 2 for Semantic Alignment for Multimodal Large Language Models
Figure 3 for Semantic Alignment for Multimodal Large Language Models
Figure 4 for Semantic Alignment for Multimodal Large Language Models
Viaarxiv icon

Learning Spectral-Decomposed Tokens for Domain Generalized Semantic Segmentation

Add code
Jul 29, 2024
Viaarxiv icon

DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving

Add code
Jul 22, 2024
Figure 1 for DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving
Figure 2 for DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving
Figure 3 for DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving
Figure 4 for DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving
Viaarxiv icon

Described Spatial-Temporal Video Detection

Add code
Jul 08, 2024
Figure 1 for Described Spatial-Temporal Video Detection
Figure 2 for Described Spatial-Temporal Video Detection
Figure 3 for Described Spatial-Temporal Video Detection
Figure 4 for Described Spatial-Temporal Video Detection
Viaarxiv icon

Spider: A Unified Framework for Context-dependent Concept Understanding

Add code
May 02, 2024
Viaarxiv icon

GOOD: Towards Domain Generalized Orientated Object Detection

Add code
Feb 20, 2024
Viaarxiv icon

Cross-Level Multi-Instance Distillation for Self-Supervised Fine-Grained Visual Categorization

Add code
Jan 16, 2024
Viaarxiv icon

De-fine: Decomposing and Refining Visual Programs with Auto-Feedback

Add code
Nov 25, 2023
Viaarxiv icon