Picture for Dong Chen

Dong Chen

MAFIG: Multi-agent Driven Formal Instruction Generation Framework

Add code
Apr 13, 2026
Viaarxiv icon

From UAV Imagery to Agronomic Reasoning: A Multimodal LLM Benchmark for Plant Phenotyping

Add code
Apr 10, 2026
Viaarxiv icon

A Persistent Homology Design Space for 3D Point Cloud Deep Learning

Add code
Apr 05, 2026
Viaarxiv icon

LACON: Training Text-to-Image Model from Uncurated Data

Add code
Mar 27, 2026
Viaarxiv icon

VideoWeaver: Multimodal Multi-View Video-to-Video Transfer for Embodied Agents

Add code
Mar 26, 2026
Viaarxiv icon

Direct Object-Level Reconstruction via Probabilistic Gaussian Splatting

Add code
Mar 15, 2026
Viaarxiv icon

ReMem-VLA: Empowering Vision-Language-Action Model with Memory via Dual-Level Recurrent Queries

Add code
Mar 13, 2026
Viaarxiv icon

A Novel Modular Cable-Driven Soft Robotic Arm with Multi-Segment Reconfigurability

Add code
Mar 04, 2026
Viaarxiv icon

Clinical-Prior Guided Multi-Modal Learning with Latent Attention Pooling for Gait-Based Scoliosis Screening

Add code
Feb 06, 2026
Viaarxiv icon

Token Entropy Regularization for Multi-modal Antenna Affiliation Identification

Add code
Jan 29, 2026
Viaarxiv icon