Picture for Lin Ma

Lin Ma

Flash-VL 2B: Optimizing Vision-Language Model Performance for Ultra-Low Latency and High Throughput

Add code
May 14, 2025
Viaarxiv icon

TopoDiT-3D: Topology-Aware Diffusion Transformer with Bottleneck Structure for 3D Point Cloud Generation

Add code
May 14, 2025
Viaarxiv icon

ScaleTrack: Scaling and back-tracking Automated GUI Agents

Add code
May 01, 2025
Viaarxiv icon

MoE-Lens: Towards the Hardware Limit of High-Throughput MoE LLM Serving Under Resource Constraints

Add code
Apr 12, 2025
Viaarxiv icon

InstructionBench: An Instructional Video Understanding Benchmark

Add code
Apr 07, 2025
Viaarxiv icon

UniToken: Harmonizing Multimodal Understanding and Generation through Unified Visual Encoding

Add code
Apr 06, 2025
Viaarxiv icon

UniViTAR: Unified Vision Transformer with Native Resolution

Add code
Apr 02, 2025
Viaarxiv icon

AP-CAP: Advancing High-Quality Data Synthesis for Animal Pose Estimation via a Controllable Image Generation Pipeline

Add code
Apr 01, 2025
Viaarxiv icon

DataPlatter: Boosting Robotic Manipulation Generalization with Minimal Costly Data

Add code
Mar 25, 2025
Viaarxiv icon

Variational Bayesian Personalized Ranking

Add code
Mar 14, 2025
Viaarxiv icon