Picture for Jie Ma

Jie Ma

SDGOCC: Semantic and Depth-Guided Bird's-Eye View Transformation for 3D Multimodal Occupancy Prediction

Add code
Jul 22, 2025
Viaarxiv icon

ChartSketcher: Reasoning with Multimodal Feedback and Reflection for Chart Understanding

Add code
May 25, 2025
Viaarxiv icon

Deliberation on Priors: Trustworthy Reasoning of Large Language Models on Knowledge Graphs

Add code
May 21, 2025
Viaarxiv icon

NTIRE 2025 Challenge on UGC Video Enhancement: Methods and Results

Add code
May 05, 2025
Viaarxiv icon

Multimodal Point Cloud Semantic Segmentation With Virtual Point Enhancement

Add code
Apr 02, 2025
Figure 1 for Multimodal Point Cloud Semantic Segmentation With Virtual Point Enhancement
Figure 2 for Multimodal Point Cloud Semantic Segmentation With Virtual Point Enhancement
Figure 3 for Multimodal Point Cloud Semantic Segmentation With Virtual Point Enhancement
Figure 4 for Multimodal Point Cloud Semantic Segmentation With Virtual Point Enhancement
Viaarxiv icon

FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal Reasoning

Add code
Apr 02, 2025
Figure 1 for FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal Reasoning
Figure 2 for FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal Reasoning
Figure 3 for FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal Reasoning
Figure 4 for FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal Reasoning
Viaarxiv icon

Single-Step Latent Consistency Model for Remote Sensing Image Super-Resolution

Add code
Mar 25, 2025
Figure 1 for Single-Step Latent Consistency Model for Remote Sensing Image Super-Resolution
Figure 2 for Single-Step Latent Consistency Model for Remote Sensing Image Super-Resolution
Figure 3 for Single-Step Latent Consistency Model for Remote Sensing Image Super-Resolution
Figure 4 for Single-Step Latent Consistency Model for Remote Sensing Image Super-Resolution
Viaarxiv icon

Dual-Domain Homogeneous Fusion with Cross-Modal Mamba and Progressive Decoder for 3D Object Detection

Add code
Mar 12, 2025
Figure 1 for Dual-Domain Homogeneous Fusion with Cross-Modal Mamba and Progressive Decoder for 3D Object Detection
Figure 2 for Dual-Domain Homogeneous Fusion with Cross-Modal Mamba and Progressive Decoder for 3D Object Detection
Figure 3 for Dual-Domain Homogeneous Fusion with Cross-Modal Mamba and Progressive Decoder for 3D Object Detection
Figure 4 for Dual-Domain Homogeneous Fusion with Cross-Modal Mamba and Progressive Decoder for 3D Object Detection
Viaarxiv icon

Positioning-Aided Channel Estimation for Multi-LEO Satellite Downlink Communications

Add code
Feb 09, 2025
Figure 1 for Positioning-Aided Channel Estimation for Multi-LEO Satellite Downlink Communications
Figure 2 for Positioning-Aided Channel Estimation for Multi-LEO Satellite Downlink Communications
Figure 3 for Positioning-Aided Channel Estimation for Multi-LEO Satellite Downlink Communications
Figure 4 for Positioning-Aided Channel Estimation for Multi-LEO Satellite Downlink Communications
Viaarxiv icon

Integrated Positioning and Communication via LEO Satellites: Opportunities and Challenges

Add code
Nov 21, 2024
Figure 1 for Integrated Positioning and Communication via LEO Satellites: Opportunities and Challenges
Figure 2 for Integrated Positioning and Communication via LEO Satellites: Opportunities and Challenges
Figure 3 for Integrated Positioning and Communication via LEO Satellites: Opportunities and Challenges
Figure 4 for Integrated Positioning and Communication via LEO Satellites: Opportunities and Challenges
Viaarxiv icon