Picture for Zijian Kang

Zijian Kang

SAIL-VL2 Technical Report

Add code
Sep 18, 2025
Viaarxiv icon

Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology

Add code
Jul 10, 2025
Viaarxiv icon

SAILViT: Towards Robust and Generalizable Visual Backbones for MLLMs via Gradual Feature Refinement

Add code
Jul 02, 2025
Figure 1 for SAILViT: Towards Robust and Generalizable Visual Backbones for MLLMs via Gradual Feature Refinement
Figure 2 for SAILViT: Towards Robust and Generalizable Visual Backbones for MLLMs via Gradual Feature Refinement
Figure 3 for SAILViT: Towards Robust and Generalizable Visual Backbones for MLLMs via Gradual Feature Refinement
Figure 4 for SAILViT: Towards Robust and Generalizable Visual Backbones for MLLMs via Gradual Feature Refinement
Viaarxiv icon

VGR: Visual Grounded Reasoning

Add code
Jun 16, 2025
Viaarxiv icon

LEREL: Lipschitz Continuity-Constrained Emotion Recognition Ensemble Learning For Electroencephalography

Add code
Apr 12, 2025
Figure 1 for LEREL: Lipschitz Continuity-Constrained Emotion Recognition Ensemble Learning For Electroencephalography
Figure 2 for LEREL: Lipschitz Continuity-Constrained Emotion Recognition Ensemble Learning For Electroencephalography
Figure 3 for LEREL: Lipschitz Continuity-Constrained Emotion Recognition Ensemble Learning For Electroencephalography
Figure 4 for LEREL: Lipschitz Continuity-Constrained Emotion Recognition Ensemble Learning For Electroencephalography
Viaarxiv icon

Information Bottleneck-Guided Heterogeneous Graph Learning for Interpretable Neurodevelopmental Disorder Diagnosis

Add code
Feb 28, 2025
Figure 1 for Information Bottleneck-Guided Heterogeneous Graph Learning for Interpretable Neurodevelopmental Disorder Diagnosis
Figure 2 for Information Bottleneck-Guided Heterogeneous Graph Learning for Interpretable Neurodevelopmental Disorder Diagnosis
Figure 3 for Information Bottleneck-Guided Heterogeneous Graph Learning for Interpretable Neurodevelopmental Disorder Diagnosis
Figure 4 for Information Bottleneck-Guided Heterogeneous Graph Learning for Interpretable Neurodevelopmental Disorder Diagnosis
Viaarxiv icon

Scalable Vision Language Model Training via High Quality Data Curation

Add code
Jan 10, 2025
Figure 1 for Scalable Vision Language Model Training via High Quality Data Curation
Figure 2 for Scalable Vision Language Model Training via High Quality Data Curation
Figure 3 for Scalable Vision Language Model Training via High Quality Data Curation
Figure 4 for Scalable Vision Language Model Training via High Quality Data Curation
Viaarxiv icon

Neural-MCRL: Neural Multimodal Contrastive Representation Learning for EEG-based Visual Decoding

Add code
Dec 23, 2024
Figure 1 for Neural-MCRL: Neural Multimodal Contrastive Representation Learning for EEG-based Visual Decoding
Figure 2 for Neural-MCRL: Neural Multimodal Contrastive Representation Learning for EEG-based Visual Decoding
Figure 3 for Neural-MCRL: Neural Multimodal Contrastive Representation Learning for EEG-based Visual Decoding
Figure 4 for Neural-MCRL: Neural Multimodal Contrastive Representation Learning for EEG-based Visual Decoding
Viaarxiv icon

Toward Accurate Camera-based 3D Object Detection via Cascade Depth Estimation and Calibration

Add code
Feb 07, 2024
Figure 1 for Toward Accurate Camera-based 3D Object Detection via Cascade Depth Estimation and Calibration
Figure 2 for Toward Accurate Camera-based 3D Object Detection via Cascade Depth Estimation and Calibration
Figure 3 for Toward Accurate Camera-based 3D Object Detection via Cascade Depth Estimation and Calibration
Figure 4 for Toward Accurate Camera-based 3D Object Detection via Cascade Depth Estimation and Calibration
Viaarxiv icon

SupFusion: Supervised LiDAR-Camera Fusion for 3D Object Detection

Add code
Sep 13, 2023
Figure 1 for SupFusion: Supervised LiDAR-Camera Fusion for 3D Object Detection
Figure 2 for SupFusion: Supervised LiDAR-Camera Fusion for 3D Object Detection
Figure 3 for SupFusion: Supervised LiDAR-Camera Fusion for 3D Object Detection
Figure 4 for SupFusion: Supervised LiDAR-Camera Fusion for 3D Object Detection
Viaarxiv icon