Picture for Xiang Zhang

Xiang Zhang

Victor

BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation

Add code
Jul 25, 2024
Figure 1 for BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation
Figure 2 for BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation
Figure 3 for BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation
Figure 4 for BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation
Viaarxiv icon

Guidelines for Augmentation Selection in Contrastive Learning for Time Series Classification

Add code
Jul 12, 2024
Viaarxiv icon

HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution

Add code
Jul 08, 2024
Figure 1 for HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution
Figure 2 for HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution
Figure 3 for HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution
Figure 4 for HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution
Viaarxiv icon

Stephanie: Step-by-Step Dialogues for Mimicking Human Interactions in Social Conversations

Add code
Jul 04, 2024
Figure 1 for Stephanie: Step-by-Step Dialogues for Mimicking Human Interactions in Social Conversations
Figure 2 for Stephanie: Step-by-Step Dialogues for Mimicking Human Interactions in Social Conversations
Figure 3 for Stephanie: Step-by-Step Dialogues for Mimicking Human Interactions in Social Conversations
Figure 4 for Stephanie: Step-by-Step Dialogues for Mimicking Human Interactions in Social Conversations
Viaarxiv icon

Sparse Diffusion Policy: A Sparse, Reusable, and Flexible Policy for Robot Learning

Add code
Jul 01, 2024
Figure 1 for Sparse Diffusion Policy: A Sparse, Reusable, and Flexible Policy for Robot Learning
Figure 2 for Sparse Diffusion Policy: A Sparse, Reusable, and Flexible Policy for Robot Learning
Figure 3 for Sparse Diffusion Policy: A Sparse, Reusable, and Flexible Policy for Robot Learning
Figure 4 for Sparse Diffusion Policy: A Sparse, Reusable, and Flexible Policy for Robot Learning
Viaarxiv icon

Through the Theory of Mind's Eye: Reading Minds with Multimodal Video Large Language Models

Add code
Jun 19, 2024
Figure 1 for Through the Theory of Mind's Eye: Reading Minds with Multimodal Video Large Language Models
Figure 2 for Through the Theory of Mind's Eye: Reading Minds with Multimodal Video Large Language Models
Figure 3 for Through the Theory of Mind's Eye: Reading Minds with Multimodal Video Large Language Models
Figure 4 for Through the Theory of Mind's Eye: Reading Minds with Multimodal Video Large Language Models
Viaarxiv icon

OmniControlNet: Dual-stage Integration for Conditional Image Generation

Add code
Jun 09, 2024
Figure 1 for OmniControlNet: Dual-stage Integration for Conditional Image Generation
Figure 2 for OmniControlNet: Dual-stage Integration for Conditional Image Generation
Figure 3 for OmniControlNet: Dual-stage Integration for Conditional Image Generation
Figure 4 for OmniControlNet: Dual-stage Integration for Conditional Image Generation
Viaarxiv icon

History-Aware Planning for Risk-free Autonomous Navigation on Unknown Uneven Terrain

Add code
Jun 04, 2024
Viaarxiv icon

LLM and GNN are Complementary: Distilling LLM for Multimodal Graph Learning

Add code
Jun 03, 2024
Figure 1 for LLM and GNN are Complementary: Distilling LLM for Multimodal Graph Learning
Figure 2 for LLM and GNN are Complementary: Distilling LLM for Multimodal Graph Learning
Figure 3 for LLM and GNN are Complementary: Distilling LLM for Multimodal Graph Learning
Figure 4 for LLM and GNN are Complementary: Distilling LLM for Multimodal Graph Learning
Viaarxiv icon

UnitNorm: Rethinking Normalization for Transformers in Time Series

Add code
May 24, 2024
Viaarxiv icon