Picture for Chen Wei

Chen Wei

PyVision: Agentic Vision with Dynamic Tooling

Add code
Jul 10, 2025
Viaarxiv icon

Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models

Add code
Jul 09, 2025
Viaarxiv icon

Play to Generalize: Learning to Reason Through Game Play

Add code
Jun 09, 2025
Viaarxiv icon

AutoSDT: Scaling Data-Driven Discovery Tasks Toward Open Co-Scientists

Add code
Jun 09, 2025
Viaarxiv icon

Hi-VAE: Efficient Video Autoencoding with Global and Detailed Motion

Add code
Jun 08, 2025
Viaarxiv icon

SMAR: Soft Modality-Aware Routing Strategy for MoE-based Multimodal Large Language Models Preserving Language Capabilities

Add code
Jun 06, 2025
Viaarxiv icon

Synthesizing Images on Perceptual Boundaries of ANNs for Uncovering and Manipulating Human Perceptual Variability

Add code
May 06, 2025
Viaarxiv icon

A Report on the llms evaluating the high school questions

Add code
Apr 30, 2025
Viaarxiv icon

FreeGraftor: Training-Free Cross-Image Feature Grafting for Subject-Driven Text-to-Image Generation

Add code
Apr 22, 2025
Viaarxiv icon

Perception Encoder: The best visual embeddings are not at the output of the network

Add code
Apr 17, 2025
Viaarxiv icon