Picture for Hehe Fan

Hehe Fan

Incentivizing Generative Zero-Shot Learning via Outcome-Reward Reinforcement Learning with Visual Cues

Add code
Mar 22, 2026
Viaarxiv icon

Variational Rectification Inference for Learning with Noisy Labels

Add code
Mar 18, 2026
Viaarxiv icon

Let's Reward Step-by-Step: Step-Aware Contrastive Alignment for Vision-Language Navigation in Continuous Environments

Add code
Mar 10, 2026
Viaarxiv icon

Super Research: Answering Highly Complex Questions with Large Language Models through Super Deep and Super Wide Research

Add code
Mar 03, 2026
Viaarxiv icon

Stroke3D: Lifting 2D strokes into rigged 3D model via latent diffusion models

Add code
Feb 16, 2026
Viaarxiv icon

Look Inward to Explore Outward: Learning Temperature Policy from LLM Internal States via Hierarchical RL

Add code
Feb 13, 2026
Viaarxiv icon

4DPC$^2$hat: Towards Dynamic Point Cloud Understanding with Failure-Aware Bootstrapping

Add code
Feb 03, 2026
Viaarxiv icon

Seeing Is Believing? A Benchmark for Multimodal Large Language Models on Visual Illusions and Anomalies

Add code
Feb 02, 2026
Viaarxiv icon

DRFormer: A Dual-Regularized Bidirectional Transformer for Person Re-identification

Add code
Feb 01, 2026
Viaarxiv icon

TransNormal: Dense Visual Semantics for Diffusion-based Transparent Object Normal Estimation

Add code
Jan 31, 2026
Viaarxiv icon