Picture for Xiaoshui Huang

Xiaoshui Huang

Non-target Divergence Hypothesis: Toward Understanding Domain Gaps in Cross-Modal Knowledge Distillation

Add code
Sep 04, 2024
Figure 1 for Non-target Divergence Hypothesis: Toward Understanding Domain Gaps in Cross-Modal Knowledge Distillation
Figure 2 for Non-target Divergence Hypothesis: Toward Understanding Domain Gaps in Cross-Modal Knowledge Distillation
Figure 3 for Non-target Divergence Hypothesis: Toward Understanding Domain Gaps in Cross-Modal Knowledge Distillation
Figure 4 for Non-target Divergence Hypothesis: Toward Understanding Domain Gaps in Cross-Modal Knowledge Distillation
Viaarxiv icon

COMOGen: A Controllable Text-to-3D Multi-object Generation Framework

Add code
Sep 01, 2024
Figure 1 for COMOGen: A Controllable Text-to-3D Multi-object Generation Framework
Figure 2 for COMOGen: A Controllable Text-to-3D Multi-object Generation Framework
Figure 3 for COMOGen: A Controllable Text-to-3D Multi-object Generation Framework
Figure 4 for COMOGen: A Controllable Text-to-3D Multi-object Generation Framework
Viaarxiv icon

TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation

Add code
Jul 13, 2024
Figure 1 for TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation
Figure 2 for TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation
Figure 3 for TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation
Figure 4 for TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation
Viaarxiv icon

Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B

Add code
Jun 11, 2024
Figure 1 for Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B
Figure 2 for Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B
Figure 3 for Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B
Figure 4 for Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B
Viaarxiv icon

Diverse Teacher-Students for Deep Safe Semi-Supervised Learning under Class Mismatch

Add code
May 25, 2024
Figure 1 for Diverse Teacher-Students for Deep Safe Semi-Supervised Learning under Class Mismatch
Figure 2 for Diverse Teacher-Students for Deep Safe Semi-Supervised Learning under Class Mismatch
Figure 3 for Diverse Teacher-Students for Deep Safe Semi-Supervised Learning under Class Mismatch
Figure 4 for Diverse Teacher-Students for Deep Safe Semi-Supervised Learning under Class Mismatch
Viaarxiv icon

3DBench: A Scalable 3D Benchmark and Instruction-Tuning Dataset

Add code
Apr 23, 2024
Figure 1 for 3DBench: A Scalable 3D Benchmark and Instruction-Tuning Dataset
Figure 2 for 3DBench: A Scalable 3D Benchmark and Instruction-Tuning Dataset
Figure 3 for 3DBench: A Scalable 3D Benchmark and Instruction-Tuning Dataset
Figure 4 for 3DBench: A Scalable 3D Benchmark and Instruction-Tuning Dataset
Viaarxiv icon

Taming Stable Diffusion for Text to 360° Panorama Image Generation

Add code
Apr 11, 2024
Viaarxiv icon

GVGEN: Text-to-3D Generation with Volumetric Representation

Add code
Mar 19, 2024
Figure 1 for GVGEN: Text-to-3D Generation with Volumetric Representation
Figure 2 for GVGEN: Text-to-3D Generation with Volumetric Representation
Figure 3 for GVGEN: Text-to-3D Generation with Volumetric Representation
Figure 4 for GVGEN: Text-to-3D Generation with Volumetric Representation
Viaarxiv icon

THOR: Text to Human-Object Interaction Diffusion via Relation Intervention

Add code
Mar 17, 2024
Figure 1 for THOR: Text to Human-Object Interaction Diffusion via Relation Intervention
Figure 2 for THOR: Text to Human-Object Interaction Diffusion via Relation Intervention
Figure 3 for THOR: Text to Human-Object Interaction Diffusion via Relation Intervention
Figure 4 for THOR: Text to Human-Object Interaction Diffusion via Relation Intervention
Viaarxiv icon

NeRF-Det++: Incorporating Semantic Cues and Perspective-aware Depth Supervision for Indoor Multi-View 3D Detection

Add code
Feb 22, 2024
Figure 1 for NeRF-Det++: Incorporating Semantic Cues and Perspective-aware Depth Supervision for Indoor Multi-View 3D Detection
Figure 2 for NeRF-Det++: Incorporating Semantic Cues and Perspective-aware Depth Supervision for Indoor Multi-View 3D Detection
Figure 3 for NeRF-Det++: Incorporating Semantic Cues and Perspective-aware Depth Supervision for Indoor Multi-View 3D Detection
Figure 4 for NeRF-Det++: Incorporating Semantic Cues and Perspective-aware Depth Supervision for Indoor Multi-View 3D Detection
Viaarxiv icon