Picture for Wanli Ouyang

Wanli Ouyang

School of Electrical and Information Engineering, The University of Sydney, Australia

DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM

Add code
Mar 19, 2024
Viaarxiv icon

GVGEN: Text-to-3D Generation with Volumetric Representation

Add code
Mar 19, 2024
Figure 1 for GVGEN: Text-to-3D Generation with Volumetric Representation
Figure 2 for GVGEN: Text-to-3D Generation with Volumetric Representation
Figure 3 for GVGEN: Text-to-3D Generation with Volumetric Representation
Figure 4 for GVGEN: Text-to-3D Generation with Volumetric Representation
Viaarxiv icon

Agent3D-Zero: An Agent for Zero-shot 3D Understanding

Add code
Mar 18, 2024
Figure 1 for Agent3D-Zero: An Agent for Zero-shot 3D Understanding
Figure 2 for Agent3D-Zero: An Agent for Zero-shot 3D Understanding
Figure 3 for Agent3D-Zero: An Agent for Zero-shot 3D Understanding
Figure 4 for Agent3D-Zero: An Agent for Zero-shot 3D Understanding
Viaarxiv icon

HVDistill: Transferring Knowledge from Images to Point Clouds via Unsupervised Hybrid-View Distillation

Add code
Mar 18, 2024
Viaarxiv icon

PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of Interest

Add code
Mar 14, 2024
Figure 1 for PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of Interest
Figure 2 for PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of Interest
Figure 3 for PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of Interest
Figure 4 for PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of Interest
Viaarxiv icon

LOCR: Location-Guided Transformer for Optical Character Recognition

Add code
Mar 04, 2024
Viaarxiv icon

Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation

Add code
Mar 02, 2024
Figure 1 for Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation
Figure 2 for Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation
Figure 3 for Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation
Figure 4 for Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation
Viaarxiv icon

ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models

Add code
Feb 23, 2024
Figure 1 for ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models
Figure 2 for ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models
Figure 3 for ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models
Figure 4 for ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models
Viaarxiv icon

NeRF-Det++: Incorporating Semantic Cues and Perspective-aware Depth Supervision for Indoor Multi-View 3D Detection

Add code
Feb 22, 2024
Figure 1 for NeRF-Det++: Incorporating Semantic Cues and Perspective-aware Depth Supervision for Indoor Multi-View 3D Detection
Figure 2 for NeRF-Det++: Incorporating Semantic Cues and Perspective-aware Depth Supervision for Indoor Multi-View 3D Detection
Figure 3 for NeRF-Det++: Incorporating Semantic Cues and Perspective-aware Depth Supervision for Indoor Multi-View 3D Detection
Figure 4 for NeRF-Det++: Incorporating Semantic Cues and Perspective-aware Depth Supervision for Indoor Multi-View 3D Detection
Viaarxiv icon

MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues

Add code
Feb 22, 2024
Figure 1 for MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues
Figure 2 for MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues
Figure 3 for MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues
Figure 4 for MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues
Viaarxiv icon