Picture for Wei-Shi Zheng

Wei-Shi Zheng

MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning

Add code
Feb 04, 2025
Figure 1 for MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning
Figure 2 for MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning
Figure 3 for MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning
Figure 4 for MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning
Viaarxiv icon

LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models

Add code
Jan 31, 2025
Figure 1 for LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models
Figure 2 for LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models
Figure 3 for LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models
Figure 4 for LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models
Viaarxiv icon

ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations

Add code
Jan 24, 2025
Figure 1 for ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations
Figure 2 for ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations
Figure 3 for ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations
Figure 4 for ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations
Viaarxiv icon

Learning Implicit Features with Flow Infused Attention for Realistic Virtual Try-On

Add code
Dec 16, 2024
Figure 1 for Learning Implicit Features with Flow Infused Attention for Realistic Virtual Try-On
Figure 2 for Learning Implicit Features with Flow Infused Attention for Realistic Virtual Try-On
Figure 3 for Learning Implicit Features with Flow Infused Attention for Realistic Virtual Try-On
Figure 4 for Learning Implicit Features with Flow Infused Attention for Realistic Virtual Try-On
Viaarxiv icon

Light-T2M: A Lightweight and Fast Model for Text-to-motion Generation

Add code
Dec 15, 2024
Figure 1 for Light-T2M: A Lightweight and Fast Model for Text-to-motion Generation
Figure 2 for Light-T2M: A Lightweight and Fast Model for Text-to-motion Generation
Figure 3 for Light-T2M: A Lightweight and Fast Model for Text-to-motion Generation
Figure 4 for Light-T2M: A Lightweight and Fast Model for Text-to-motion Generation
Viaarxiv icon

TechCoach: Towards Technical Keypoint-Aware Descriptive Action Coaching

Add code
Nov 26, 2024
Figure 1 for TechCoach: Towards Technical Keypoint-Aware Descriptive Action Coaching
Figure 2 for TechCoach: Towards Technical Keypoint-Aware Descriptive Action Coaching
Figure 3 for TechCoach: Towards Technical Keypoint-Aware Descriptive Action Coaching
Figure 4 for TechCoach: Towards Technical Keypoint-Aware Descriptive Action Coaching
Viaarxiv icon

InTraGen: Trajectory-controlled Video Generation for Object Interactions

Add code
Nov 25, 2024
Figure 1 for InTraGen: Trajectory-controlled Video Generation for Object Interactions
Figure 2 for InTraGen: Trajectory-controlled Video Generation for Object Interactions
Figure 3 for InTraGen: Trajectory-controlled Video Generation for Object Interactions
Figure 4 for InTraGen: Trajectory-controlled Video Generation for Object Interactions
Viaarxiv icon

Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models

Add code
Oct 25, 2024
Figure 1 for Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models
Figure 2 for Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models
Figure 3 for Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models
Figure 4 for Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models
Viaarxiv icon

Real-to-Sim Grasp: Rethinking the Gap between Simulation and Real World in Grasp Detection

Add code
Oct 09, 2024
Figure 1 for Real-to-Sim Grasp: Rethinking the Gap between Simulation and Real World in Grasp Detection
Figure 2 for Real-to-Sim Grasp: Rethinking the Gap between Simulation and Real World in Grasp Detection
Figure 3 for Real-to-Sim Grasp: Rethinking the Gap between Simulation and Real World in Grasp Detection
Figure 4 for Real-to-Sim Grasp: Rethinking the Gap between Simulation and Real World in Grasp Detection
Viaarxiv icon

Towards Completeness: A Generalizable Action Proposal Generator for Zero-Shot Temporal Action Localization

Add code
Aug 25, 2024
Figure 1 for Towards Completeness: A Generalizable Action Proposal Generator for Zero-Shot Temporal Action Localization
Figure 2 for Towards Completeness: A Generalizable Action Proposal Generator for Zero-Shot Temporal Action Localization
Figure 3 for Towards Completeness: A Generalizable Action Proposal Generator for Zero-Shot Temporal Action Localization
Figure 4 for Towards Completeness: A Generalizable Action Proposal Generator for Zero-Shot Temporal Action Localization
Viaarxiv icon