Picture for Fengjiao Chen

Fengjiao Chen

LARY: A Latent Action Representation Yielding Benchmark for Generalizable Vision-to-Action Alignment

Add code
Apr 13, 2026
Viaarxiv icon

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Add code
Mar 29, 2026
Viaarxiv icon

UniHetero: Could Generation Enhance Understanding for Vision-Language-Model at Large Data Scale?

Add code
Dec 30, 2025
Viaarxiv icon