Picture for Jiaxin Fan

Jiaxin Fan

VisionPangu: A Compact and Fine-Grained Multimodal Assistant with 1.7B Parameters

Add code
Mar 05, 2026
Viaarxiv icon

Reinforced Reasoning for Embodied Planning

Add code
May 28, 2025
Figure 1 for Reinforced Reasoning for Embodied Planning
Figure 2 for Reinforced Reasoning for Embodied Planning
Figure 3 for Reinforced Reasoning for Embodied Planning
Figure 4 for Reinforced Reasoning for Embodied Planning
Viaarxiv icon

CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era

Add code
Mar 16, 2025
Figure 1 for CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era
Figure 2 for CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era
Figure 3 for CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era
Figure 4 for CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era
Viaarxiv icon

Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism

Add code
Aug 07, 2023
Figure 1 for Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism
Figure 2 for Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism
Figure 3 for Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism
Figure 4 for Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism
Viaarxiv icon

PVT-COV19D: Pyramid Vision Transformer for COVID-19 Diagnosis

Add code
Jun 30, 2022
Figure 1 for PVT-COV19D: Pyramid Vision Transformer for COVID-19 Diagnosis
Figure 2 for PVT-COV19D: Pyramid Vision Transformer for COVID-19 Diagnosis
Viaarxiv icon