Picture for Hao Zhang

Hao Zhang

refer to the report for detailed contributions

GameArena: Evaluating LLM Reasoning through Live Computer Games

Add code
Dec 09, 2024
Figure 1 for GameArena: Evaluating LLM Reasoning through Live Computer Games
Figure 2 for GameArena: Evaluating LLM Reasoning through Live Computer Games
Figure 3 for GameArena: Evaluating LLM Reasoning through Live Computer Games
Figure 4 for GameArena: Evaluating LLM Reasoning through Live Computer Games
Viaarxiv icon

Large Language Models show both individual and collective creativity comparable to humans

Add code
Dec 04, 2024
Figure 1 for Large Language Models show both individual and collective creativity comparable to humans
Figure 2 for Large Language Models show both individual and collective creativity comparable to humans
Figure 3 for Large Language Models show both individual and collective creativity comparable to humans
Figure 4 for Large Language Models show both individual and collective creativity comparable to humans
Viaarxiv icon

FreeSim: Toward Free-viewpoint Camera Simulation in Driving Scenes

Add code
Dec 04, 2024
Figure 1 for FreeSim: Toward Free-viewpoint Camera Simulation in Driving Scenes
Figure 2 for FreeSim: Toward Free-viewpoint Camera Simulation in Driving Scenes
Figure 3 for FreeSim: Toward Free-viewpoint Camera Simulation in Driving Scenes
Figure 4 for FreeSim: Toward Free-viewpoint Camera Simulation in Driving Scenes
Viaarxiv icon

CPA: Camera-pose-awareness Diffusion Transformer for Video Generation

Add code
Dec 02, 2024
Viaarxiv icon

Learning Adaptive Lighting via Channel-Aware Guidance

Add code
Dec 02, 2024
Figure 1 for Learning Adaptive Lighting via Channel-Aware Guidance
Figure 2 for Learning Adaptive Lighting via Channel-Aware Guidance
Figure 3 for Learning Adaptive Lighting via Channel-Aware Guidance
Figure 4 for Learning Adaptive Lighting via Channel-Aware Guidance
Viaarxiv icon

CRAYM: Neural Field Optimization via Camera RAY Matching

Add code
Dec 02, 2024
Viaarxiv icon

GATE OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation

Add code
Dec 01, 2024
Viaarxiv icon

Cross-modal Medical Image Generation Based on Pyramid Convolutional Attention Network

Add code
Nov 26, 2024
Figure 1 for Cross-modal Medical Image Generation Based on Pyramid Convolutional Attention Network
Figure 2 for Cross-modal Medical Image Generation Based on Pyramid Convolutional Attention Network
Figure 3 for Cross-modal Medical Image Generation Based on Pyramid Convolutional Attention Network
Figure 4 for Cross-modal Medical Image Generation Based on Pyramid Convolutional Attention Network
Viaarxiv icon

An Evaluation-Driven Approach to Designing LLM Agents: Process and Architecture

Add code
Nov 21, 2024
Figure 1 for An Evaluation-Driven Approach to Designing LLM Agents: Process and Architecture
Figure 2 for An Evaluation-Driven Approach to Designing LLM Agents: Process and Architecture
Figure 3 for An Evaluation-Driven Approach to Designing LLM Agents: Process and Architecture
Figure 4 for An Evaluation-Driven Approach to Designing LLM Agents: Process and Architecture
Viaarxiv icon

DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding

Add code
Nov 21, 2024
Figure 1 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Figure 2 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Figure 3 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Figure 4 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Viaarxiv icon