Picture for Zhaoxiang Zhang

Zhaoxiang Zhang

OpenSatMap: A Fine-grained High-resolution Satellite Dataset for Large-scale Map Construction

Add code
Oct 30, 2024
Viaarxiv icon

FreeVS: Generative View Synthesis on Free Driving Trajectory

Add code
Oct 23, 2024
Figure 1 for FreeVS: Generative View Synthesis on Free Driving Trajectory
Figure 2 for FreeVS: Generative View Synthesis on Free Driving Trajectory
Figure 3 for FreeVS: Generative View Synthesis on Free Driving Trajectory
Figure 4 for FreeVS: Generative View Synthesis on Free Driving Trajectory
Viaarxiv icon

A Comparative Study on Reasoning Patterns of OpenAI's o1 Model

Add code
Oct 17, 2024
Figure 1 for A Comparative Study on Reasoning Patterns of OpenAI's o1 Model
Figure 2 for A Comparative Study on Reasoning Patterns of OpenAI's o1 Model
Figure 3 for A Comparative Study on Reasoning Patterns of OpenAI's o1 Model
Figure 4 for A Comparative Study on Reasoning Patterns of OpenAI's o1 Model
Viaarxiv icon

MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models

Add code
Oct 15, 2024
Figure 1 for MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models
Figure 2 for MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models
Figure 3 for MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models
Figure 4 for MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models
Viaarxiv icon

DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model

Add code
Oct 14, 2024
Figure 1 for DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model
Figure 2 for DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model
Figure 3 for DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model
Figure 4 for DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model
Viaarxiv icon

Reconstructive Visual Instruction Tuning

Add code
Oct 12, 2024
Viaarxiv icon

MIO: A Foundation Model on Multimodal Tokens

Add code
Sep 26, 2024
Figure 1 for MIO: A Foundation Model on Multimodal Tokens
Figure 2 for MIO: A Foundation Model on Multimodal Tokens
Figure 3 for MIO: A Foundation Model on Multimodal Tokens
Figure 4 for MIO: A Foundation Model on Multimodal Tokens
Viaarxiv icon

HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models

Add code
Sep 24, 2024
Figure 1 for HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
Figure 2 for HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
Figure 3 for HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
Figure 4 for HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
Viaarxiv icon

SimMAT: Exploring Transferability from Vision Foundation Models to Any Image Modality

Add code
Sep 12, 2024
Figure 1 for SimMAT: Exploring Transferability from Vision Foundation Models to Any Image Modality
Figure 2 for SimMAT: Exploring Transferability from Vision Foundation Models to Any Image Modality
Figure 3 for SimMAT: Exploring Transferability from Vision Foundation Models to Any Image Modality
Figure 4 for SimMAT: Exploring Transferability from Vision Foundation Models to Any Image Modality
Viaarxiv icon

Enhancing Sound Source Localization via False Negative Elimination

Add code
Aug 29, 2024
Figure 1 for Enhancing Sound Source Localization via False Negative Elimination
Figure 2 for Enhancing Sound Source Localization via False Negative Elimination
Figure 3 for Enhancing Sound Source Localization via False Negative Elimination
Figure 4 for Enhancing Sound Source Localization via False Negative Elimination
Viaarxiv icon