Picture for Hao Guo

Hao Guo

Chalmers University of Technology

Muses: Designing, Composing, Generating Nonexistent Fantasy 3D Creatures without Training

Add code
Jan 06, 2026
Viaarxiv icon

Towards Natural Language-Based Document Image Retrieval: New Dataset and Benchmark

Add code
Dec 23, 2025
Viaarxiv icon

BrepLLM: Native Boundary Representation Understanding with Large Language Models

Add code
Dec 18, 2025
Figure 1 for BrepLLM: Native Boundary Representation Understanding with Large Language Models
Figure 2 for BrepLLM: Native Boundary Representation Understanding with Large Language Models
Figure 3 for BrepLLM: Native Boundary Representation Understanding with Large Language Models
Figure 4 for BrepLLM: Native Boundary Representation Understanding with Large Language Models
Viaarxiv icon

Skillful Subseasonal-to-Seasonal Forecasting of Extreme Events with a Multi-Sphere Coupled Probabilistic Model

Add code
Dec 14, 2025
Viaarxiv icon

Action is All You Need: Dual-Flow Generative Ranking Network for Recommendation

Add code
May 22, 2025
Viaarxiv icon

Cross-layer Integrated Sensing and Communication: A Joint Industrial and Academic Perspective

Add code
May 16, 2025
Figure 1 for Cross-layer Integrated Sensing and Communication: A Joint Industrial and Academic Perspective
Figure 2 for Cross-layer Integrated Sensing and Communication: A Joint Industrial and Academic Perspective
Figure 3 for Cross-layer Integrated Sensing and Communication: A Joint Industrial and Academic Perspective
Figure 4 for Cross-layer Integrated Sensing and Communication: A Joint Industrial and Academic Perspective
Viaarxiv icon

EventDiff: A Unified and Efficient Diffusion Model Framework for Event-based Video Frame Interpolation

Add code
May 13, 2025
Figure 1 for EventDiff: A Unified and Efficient Diffusion Model Framework for Event-based Video Frame Interpolation
Figure 2 for EventDiff: A Unified and Efficient Diffusion Model Framework for Event-based Video Frame Interpolation
Figure 3 for EventDiff: A Unified and Efficient Diffusion Model Framework for Event-based Video Frame Interpolation
Figure 4 for EventDiff: A Unified and Efficient Diffusion Model Framework for Event-based Video Frame Interpolation
Viaarxiv icon

BRepFormer: Transformer-Based B-rep Geometric Feature Recognition

Add code
Apr 10, 2025
Figure 1 for BRepFormer: Transformer-Based B-rep Geometric Feature Recognition
Figure 2 for BRepFormer: Transformer-Based B-rep Geometric Feature Recognition
Figure 3 for BRepFormer: Transformer-Based B-rep Geometric Feature Recognition
Figure 4 for BRepFormer: Transformer-Based B-rep Geometric Feature Recognition
Viaarxiv icon

Beyond Object Categories: Multi-Attribute Reference Understanding for Visual Grounding

Add code
Mar 25, 2025
Figure 1 for Beyond Object Categories: Multi-Attribute Reference Understanding for Visual Grounding
Figure 2 for Beyond Object Categories: Multi-Attribute Reference Understanding for Visual Grounding
Figure 3 for Beyond Object Categories: Multi-Attribute Reference Understanding for Visual Grounding
Figure 4 for Beyond Object Categories: Multi-Attribute Reference Understanding for Visual Grounding
Viaarxiv icon

TUMTraffic-VideoQA: A Benchmark for Unified Spatio-Temporal Video Understanding in Traffic Scenes

Add code
Feb 04, 2025
Figure 1 for TUMTraffic-VideoQA: A Benchmark for Unified Spatio-Temporal Video Understanding in Traffic Scenes
Figure 2 for TUMTraffic-VideoQA: A Benchmark for Unified Spatio-Temporal Video Understanding in Traffic Scenes
Figure 3 for TUMTraffic-VideoQA: A Benchmark for Unified Spatio-Temporal Video Understanding in Traffic Scenes
Figure 4 for TUMTraffic-VideoQA: A Benchmark for Unified Spatio-Temporal Video Understanding in Traffic Scenes
Viaarxiv icon