Picture for Zhiwei Zhang

Zhiwei Zhang

Latent-Space Autoregressive World Model for Efficient and Robust Image-Goal Navigation

Add code
Nov 14, 2025
Figure 1 for Latent-Space Autoregressive World Model for Efficient and Robust Image-Goal Navigation
Figure 2 for Latent-Space Autoregressive World Model for Efficient and Robust Image-Goal Navigation
Figure 3 for Latent-Space Autoregressive World Model for Efficient and Robust Image-Goal Navigation
Figure 4 for Latent-Space Autoregressive World Model for Efficient and Robust Image-Goal Navigation
Viaarxiv icon

EMAformer: Enhancing Transformer through Embedding Armor for Time Series Forecasting

Add code
Nov 11, 2025
Viaarxiv icon

On Continuous Optimization for Constraint Satisfaction Problems

Add code
Oct 06, 2025
Figure 1 for On Continuous Optimization for Constraint Satisfaction Problems
Figure 2 for On Continuous Optimization for Constraint Satisfaction Problems
Figure 3 for On Continuous Optimization for Constraint Satisfaction Problems
Figure 4 for On Continuous Optimization for Constraint Satisfaction Problems
Viaarxiv icon

A Comprehensive Review of Agricultural Parcel and Boundary Delineation from Remote Sensing Images: Recent Progress and Future Perspectives

Add code
Aug 20, 2025
Figure 1 for A Comprehensive Review of Agricultural Parcel and Boundary Delineation from Remote Sensing Images: Recent Progress and Future Perspectives
Figure 2 for A Comprehensive Review of Agricultural Parcel and Boundary Delineation from Remote Sensing Images: Recent Progress and Future Perspectives
Figure 3 for A Comprehensive Review of Agricultural Parcel and Boundary Delineation from Remote Sensing Images: Recent Progress and Future Perspectives
Figure 4 for A Comprehensive Review of Agricultural Parcel and Boundary Delineation from Remote Sensing Images: Recent Progress and Future Perspectives
Viaarxiv icon

Bradley-Terry and Multi-Objective Reward Modeling Are Complementary

Add code
Jul 10, 2025
Viaarxiv icon

Image Corruption-Inspired Membership Inference Attacks against Large Vision-Language Models

Add code
Jun 14, 2025
Viaarxiv icon

Can Large Multimodal Models Understand Agricultural Scenes? Benchmarking with AgroMind

Add code
May 18, 2025
Viaarxiv icon

VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption

Add code
May 17, 2025
Figure 1 for VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption
Figure 2 for VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption
Figure 3 for VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption
Figure 4 for VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption
Viaarxiv icon

When Thinking Fails: The Pitfalls of Reasoning for Instruction-Following in LLMs

Add code
May 16, 2025
Viaarxiv icon

MediAug: Exploring Visual Augmentation in Medical Imaging

Add code
Apr 26, 2025
Viaarxiv icon