Picture for Zhen Peng

Zhen Peng

FusionBERT: Multi-View Image-3D Retrieval via Cross-Attention Visual Fusion and Normal-Aware 3D Encoder

Add code
Apr 02, 2026
Viaarxiv icon

TLC-Plan: A Two-Level Codebook Based Network for End-to-End Vector Floorplan Generation

Add code
Feb 06, 2026
Viaarxiv icon

BPDQ: Bit-Plane Decomposition Quantization on a Variable Grid for Large Language Models

Add code
Feb 04, 2026
Viaarxiv icon

A novel VAE-DML fusion framework for casual analysis of greenwashing in the mining industry

Add code
Jan 31, 2026
Viaarxiv icon

Stable Time Series Prediction of Enterprise Carbon Emissions Based on Causal Inference

Add code
Jan 31, 2026
Viaarxiv icon

A Foundation Model for Chest X-ray Interpretation with Grounded Reasoning via Online Reinforcement Learning

Add code
Sep 04, 2025
Viaarxiv icon

LoTA-QAF: Lossless Ternary Adaptation for Quantization-Aware Fine-Tuning

Add code
May 24, 2025
Viaarxiv icon

Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models

Add code
Jan 30, 2025
Figure 1 for Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models
Figure 2 for Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models
Figure 3 for Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models
Figure 4 for Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models
Viaarxiv icon

StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation

Add code
Jan 10, 2025
Figure 1 for StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation
Figure 2 for StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation
Figure 3 for StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation
Figure 4 for StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation
Viaarxiv icon

Has Multimodal Learning Delivered Universal Intelligence in Healthcare? A Comprehensive Survey

Add code
Aug 23, 2024
Figure 1 for Has Multimodal Learning Delivered Universal Intelligence in Healthcare? A Comprehensive Survey
Figure 2 for Has Multimodal Learning Delivered Universal Intelligence in Healthcare? A Comprehensive Survey
Figure 3 for Has Multimodal Learning Delivered Universal Intelligence in Healthcare? A Comprehensive Survey
Figure 4 for Has Multimodal Learning Delivered Universal Intelligence in Healthcare? A Comprehensive Survey
Viaarxiv icon