Picture for Yijia Fan

Yijia Fan

FlashVLM: Text-Guided Visual Token Selection for Large Multimodal Models

Add code
Dec 23, 2025
Viaarxiv icon

LLM-CAS: Dynamic Neuron Perturbation for Real-Time Hallucination Correction

Add code
Dec 21, 2025
Viaarxiv icon

PTTA: A Pure Text-to-Animation Framework for High-Quality Creation

Add code
Dec 21, 2025
Viaarxiv icon

MM-CoT:A Benchmark for Probing Visual Chain-of-Thought Reasoning in Multimodal Models

Add code
Dec 09, 2025
Viaarxiv icon

HybridToken-VLM: Hybrid Token Compression for Vision-Language Models

Add code
Dec 09, 2025
Viaarxiv icon

3DAlign-DAER: Dynamic Attention Policy and Efficient Retrieval Strategy for Fine-grained 3D-Text Alignment at Scale

Add code
Nov 17, 2025
Viaarxiv icon

Cost-Effective Communication: An Auction-based Method for Language Agent Interaction

Add code
Nov 17, 2025
Viaarxiv icon

Agent-GSPO: Communication-Efficient Multi-Agent Systems via Group Sequence Policy Optimization

Add code
Oct 26, 2025
Viaarxiv icon

RaCoT: Plug-and-Play Contrastive Example Generation Mechanism for Enhanced LLM Reasoning Reliability

Add code
Oct 26, 2025
Viaarxiv icon

Backward-Friendly Optimization: Training Large Language Models with Approximate Gradients under Memory Constraints

Add code
Oct 26, 2025
Viaarxiv icon