Picture for Fengqing Zhu

Fengqing Zhu

Not Your Stereo-Typical Estimator: Combining Vision and Language for Volume Perception

Add code
Apr 10, 2026
Viaarxiv icon

DietDelta: A Vision-Language Approach for Dietary Assessment via Before-and-After Images

Add code
Apr 07, 2026
Viaarxiv icon

Adaptive Greedy Frame Selection for Long Video Understanding

Add code
Mar 20, 2026
Viaarxiv icon

Can You Hear, Localize, and Segment Continually? An Exemplar-Free Continual Learning Benchmark for Audio-Visual Segmentation

Add code
Mar 09, 2026
Viaarxiv icon

Temporal Imbalance of Positive and Negative Supervision in Class-Incremental Learning

Add code
Mar 02, 2026
Viaarxiv icon

Implicit-Scale 3D Reconstruction for Multi-Food Volume Estimation from Monocular Images

Add code
Feb 13, 2026
Viaarxiv icon

Food Portion Estimation: From Pixels to Calories

Add code
Feb 04, 2026
Viaarxiv icon

Leveraging Second-Order Curvature for Efficient Learned Image Compression: Theory and Empirical Evidence

Add code
Jan 29, 2026
Viaarxiv icon

Size Matters: Reconstructing Real-Scale 3D Models from Monocular Images for Food Portion Estimation

Add code
Jan 27, 2026
Viaarxiv icon

Training-Free Text-to-Image Compositional Food Generation via Prompt Grafting

Add code
Jan 25, 2026
Viaarxiv icon